Creating a duplicate identification infoset

Use the Duplicate pane to identify duplicate objects. A new duplicate identification infoset is created to include the duplicate objects.

Procedure

  1. From the Infoset list, select an infoset as a source infoset that you want to identify its duplicate objects.
  2. Click Create > Duplicate.
  3. In the Available Operation Infosets pane, select an infoset as a primary infoset to compare to the source infoset.
  4. Click Create Infoset.
  5. Enter a name and a description for this infoset.
    When you name the infoset, avoid using commas (,). Although a comma is a valid character, it causes queries to fail when specified as part of the search text.

    If the infoset is published to the governance catalog, the description will be part of the asset details. Therefore, make it as detailed as possible.

  6. Decide whether you want to create the infoset with or without the data map.
    • To generate the data map for this infoset, leave the Include Data Map for the Refine tab checkbox selected. The infoset takes longer to generate, but you can use the data map and its function later to refine the infoset.
    • To create the infoset without the data map, clear the Include Data Map for the Refine tab checkbox. In this case, the infoset generates more quickly, but no visualization of the data is available later to you for refining the infoset.
      Note: If the Compliance, Data Assessment, Data Topology, Overlay, and Customized Reports are run against an infoset without a data map, they contain no data.
  7. To include this infoset in the set of objects that are synchronized with the governance catalog, select Publish to catalog.
    Even if you do not select this option, the infoset might be published to the catalog as an inferred object. This type of synchronization happens when the infoset serves as direct input to an object that is selected for publishing. The infoset is then required for proper object modeling in the governance catalog.
  8. If you want specific users to be notified when the infoset is created, select Send email when the process is complete and enter the appropriate email addresses.
  9. To determine the overlays that later can be applied to the infoset within the data map, click Select Overlays and add up to seven overlays.
    You can select overlays only if you select to generate the data map.
  10. Click Save and View to go to the Ancestry pane, or click Save to return to the Filters pane.

Results

A new duplicate identification infoset is created to consist of all duplicate objects except any objects from the primary infoset. It is ready for you to start the data deduplication process.