TopBraid EDG Data Governance Packages
TopBraid Enterprise Data Governance (TopBraid EDG) is a solution for comprehensive data governance. EDG makes it possible for you to manage a complete range of assets and use cases.
We recognize that in ramping up a data governance program, different organizations may have differing priorities and starting points. With EDG, you can start incrementally. For example, you may start using EDG for just business glossaries or just reference data or you may first focus on metadata management. After the initial start, you can always extend your scope to governing other assets when you are ready to do so.
To support this comprehensive but staged approach, TopBraid EDG provides the following packages, available for use as an initial configuration of EDG. Each package can also be used in any combination with the other packages toward your targeted scope of data governance.
Governance Packages Available in TopBraid EDG
Reference Data Management
*Customer defined asset collections are also supported. See the add-on module Customer Defined Asset Collection Types.
TOPBRAID EDG PACKAGES
TopBraid EDG includes an operational Governance Model as part of all packages. The Governance Model lets you define your organization's governance charter, governance/subject areas and domains, organizational structure (roles, users) policies, workflows, metrics, dashboards, and other governance assets.
The Vocabulary Management package lets organizations create, connect and use taxonomies (vocabularies that are based on SKOS, the W3C standard for managing taxonomies and thesauruses) and ontologies (defined using SHACL, RDFS and/or OWL). This package is often used in combination with Tagger and AutoClassifier to categorize documents.
*Note: The TopBraid EDG Vocabulary Management Package (TopBraid EDG-VM) offers the same capabilities (and uses the same codebase) as the “stand-alone” TopBraid Enterprise Vocabulary Management (TopBraid EVN) product that was released prior to EDG. It also has additional features. Unlike TopBraid EVN, TopBraid EDG-VM is combinable with other TopBraid EDG packages. Further, TopBraid EDG-VM (as do all EDG packages) offers “Search the EDG” functionality that is not available in TopBraid EVN. Starting with 5.5, users of TopBraid EDG (all packages) will be able to easily configure a variety of personalized dashboards. There are no other differences. TopBraid EDG-VM is the next generation of TopBraid EVN.
|Big Data Assets|
The Metadata Management lets organizations govern technical metadata – information about their databases, datasets, logical and physical data models and other data assets. When combined with the Business Glossaries package, this provides capability to map key data elements to business terms. When combined with the Reference Data package, this provides capability to specify permissible data values as reference data. Additionally, it lets organizations govern information about their business applications, infrastructure, business capabilities and processes. This package also includes the ability to define and trace detailed data lineage as data flows between applications.
The Reference Data Management package makes it easy to bring consistency and accuracy to the use of reference data across enterprise applications. It gives end users flexible capabilities to profile, govern, update and provision reference datasets, and supports comprehensive metadata to document the meaning (semantics) of reference data.
The Business Glossary package provides a simple starting place for a data governance initiative. It lets you define and connect business terms. It is included with the Metadata Management package, letting you establish connections between terms and technical metadata. Glossaries can also be included as an add-on module to other packages.
Add on Modules Available for TopBraid EDG
List of Asset Collection Types Available in TopBraid EDG
|Big Data Assets support governance of assets that make up a big data ecosystem. Big Data assets include metadata for Avro and other Schemas, Catalogs (e.g. HCatalog), Clusters, Configurations, Containers, Files, HBase, HDFS, Jobs, Nodes, Property-Value Pairs, Recipes, Region Servers, Regions, Resilient Distributed Datasets (RDDs), Schedulers, Sqoop and other Scripts, Tables, and Trackers. Included in EDG Package: Metadata Management||Business Glossaries let you define and connect business terms. Creating a Business Glossary can provide a simple starting place for a data governance initiative. When used in combination with the metadata management packages in EDG, a glossary lets you establish connections between terms and technical metadata. Included in EDG Packages: Business Glossary and Metadata Management. Also available as an add-on Asset Collection type with the Vocabulary Management and Reference Data Management packages.||Content Corpora is a collection of read-only textual items, such as documents, excerpts, websites, etc.—along with associated metadata. The original items are imported from external sources, such as CMSs or web sites and are not created nor edited within EDG. Hundreds of document formats are supported. The textual content of Corpus items provides the foundation for manual or automated tagging and annotation with Content Tag Sets. Included in Add-on Module: TopBraid Tagger with AutoClassifier used in conjunction with EDG Package: Vocabulary Management|
|Content Tag Sets are used for tagging content using vocabularies managed in Vocabulary Management Package. Use of Content Tag Sets requires the add-on module TopBraid Tagger. Users can tag (assign metadata to) content through a visual user interface that displays the context for both the content and the vocabulary. They can also run Tagger's auto-classification capability to automatically assign relevant tags to content and review the results. Included in Add-on Module: TopBraid Tagger with AutoClassifier used in conjunction with EDG Package: Vocabulary Management||Crosswalks let you create connections between resources in two different asset collections. This is especially useful for defining connections between two different standard vocabularies or between a standard one and a specialized local one. They also commonly used to map two reference datasets. Applications can use saved crosswalk connection data to enhance the use of either vocabulary by taking advantage of the connected data and metadata for search, classification, and other operations. Included in EDG Packages: Reference Data Management, Vocabulary Management||Data Assets support cataloging and governing assets that make up a data ecosystem. Data assets include databases, database columns and tables, data elements, datasets and their schemas, and logical and physical models. Included in EDG Packages: Metadata Management|
|Datatypes support the specification of scalar data types, structured datatypes, scales and code lists. Scalar datatypes include all of the ORACLE data types. Structured data types provide for the definition of arrays, lists and other composite data types. Code lists are used to specify enumerated values that need not be governed as reference data, such as status values. Included in EDG Packages: Metadata Management||Enterprise Assets cataloging and governing enterprise assets such as business functions, activities, roles, capabilities, processes, and information assets including forms, documents and reports. Included in EDG Package: Metadata Management||EDG Enumerations are a light-weight form of reference data. There can be hundreds of enumerations as pick lists or code lists. Included in EDG Packages: Metadata Management|
|Lineage Models support capturing information about how data flows from databases, applications to users, for reports, in support of business activities and functions, and the enablement of enterprise capabilities. Included in EDG Package: Metadata Management||Ontologies are knowledge models that can be used to generate data models (for example) or are simply used as for various purposes as important documentation and reference for the business stakeholders. In EDG, they can be used as conceptual models that can connect terms with the technical metadata (e.g., tables and columns). Included in EDG Packages: Reference Data Management, Vocabulary Management for specific package requirements. Also included in ALL EDG Packages for customization of that package (EDG is an ontology(model)-driven system.||Reference Datasets are controlled datasets of industry defined codes. Reference datasets are essentially lists such as country codes, currency codes and product types. Reference data is found in practically every enterprise application including back-end systems, front-end commerce applications, and data warehouses. In EDG, the main classes for respective reference datasets are defined in corresponding ontologies. Included in EDG Package: Reference Data Management|
|Requirements allow cataloging, capturing and connecting variety of requirements such as data requirements, regulatory requirements and security requirements and support traceability of requirements to other EDG-managed assets. Included in EDG Package: Metadata Management||SKOS Taxonomies are vocabularies that are based on SKOS, the W3C standard for managing taxonomies and thesauruses. These information models contain hierarchies of terms connected using broader/narrower relationships. In addition to standard SKOS attributes, they can contain custom attributes and relationships. Included in EDG Package: Vocabulary Management||Technical Assets cataloging and governing information about software systems, business applications as well as technical infrastructure like servers and networks. Included in EDG Package: Metadata Management|