Governing and curating data (Watson Knowledge Catalog)

With the Watson Knowledge Catalog service, you can create catalogs of curated assets that are supported by a governance framework.

Service This service is not available by default. An administrator must install this service on the IBM Cloud Pak for Data platform. To determine whether the service is installed, open the Services catalog and check whether the service is enabled.

Watch the following video for an overview of Watson Knowledge Catalog features.

This video provides a visual method as an alternative to following the written steps in this documentation.

How you get started depends on your user role and permissions and your goal. To see which roles and permissions you have, click your user avatar, select Profile and settings, and then view the Permissions page. If you need more permissions, contact your Cloud Pak for Data administrator.

Get started with Watson Knowledge Catalog
Role Goal
Administrator Prepare for governance
Data Scientist Find data assets in a catalog
Business Analyst Find data assets in a catalog
View information assets
Data Steward or Data Engineer Curate data
Create governance artifacts
Data Quality Analyst Curate data
Create governance artifacts
Analyze data quality
Developer Use Watson Knowledge Catalog APIs

Prepare for governance

As a user with the Administrator role, you must prepare for governance before users with other roles can start working.

To prepare for governance, complete these tasks:

  1. Create governance categories.
  2. Assign Watson Knowledge Catalog roles to users.
  3. Add users to categories.
  4. Set up workflow configurations.

See Preparing for governance. You can also perform other administrative tasks to customize and control Watson Knowledge Catalog. See Managing Watson Knowledge Catalog.

Create governance artifacts to govern data assets

To view or create governance artifacts, open the main menu and choose Governance, and then the artifact type.

Data Stewards and Data Quality Analysts can govern data assets in catalogs by using governance artifacts in these ways:

Administrators can configure workflows for governance artifacts to require explicit approvals for new or updated artifacts.

See Governance artifacts.

Curate data assets

To curate data, open the main menu and choose Governance and then Data discovery or Data quality.

Basic data curation tools are available to all catalog collaborators. Basic data curation tools help you develop valuable data assets in these ways:

Advanced data curation tools require the Data Steward and Data Quality Analyst roles. Advanced data curation tools help you develop valuable data assets in these ways:

See Curate data.

Find assets

You can search across all catalogs that you are a member of by entering one or more words in the global search field.

To open a catalog, open the main menu and choose Catalogs > All catalogs, and then click the name of a catalog.

You can find assets within a catalog in these ways:

If you have the Business Analyst, Data Steward, or Data Quality Analyst role, you can view more information about assets that are in the default catalog in the Information assets view:

See Find and view assets in catalogs and Information assets view.

Work with assets in analytics projects

To discover insights by working with data, you need to move the assets to an analytics project. You can also use a project as a staging area to curate data assets before publishing them to the catalog. Projects contain a select subset of catalog collaborators.

To open or create a project, open the main menu and choose Projects.

You have these capabilities for working with assets in projects with Watson Knowledge Catalog:

See Analytics projects.

If you have the Watson Studio service installed, you can analyze data and build models.

Use Watson Knowledge Catalog APIs

To use Watson Knowledge Catalog APIs in your application, you can call endpoints with a request URL in this format:

https://{web-client}/{API-path}?{API-query}

Replace these variables:

Variables for calling API endpoints
Variable Replace with
{web-client} The IP address or name of your Cloud Pak for Data web client.
{API-path} The path for the API.
For example, use /v2/catalogs to return the list of catalogs.
{API-query} The query string for the API, if applicable.
For example, use /v2/asset_types?catalog_id=5 to return the list of asset types in the catalog with the ID of 5.

You can view the API documentation at this URL:

https://{web-client}/data-api/api-explorer/

Learn more