High-level integration language programs, called flows, are made up of deterministic and probabilistic rules. Each rule controls a specific step in the entity resolution process. The rules consist of matching and filtering criteria.
In deterministic rules, both matching and filtering are done deterministically by using a combination of exact equalities and various matching functions (for example, a geo-proximity function or a function that assesses how common the name is according to US Census data). In probabilistic rules, the matching is done with a PME engine that uses standard PME matching algorithms to drive comparison. Filtering of the probabilistic matching results is done deterministically.
Creating a comprehensive set of linkages requires both probabilistic and deterministic matching flows. The two types of flows complement each other.
The language joins the results of both flows to create linkages between enterprise customer and social profiles. The matching process uses attribute filtering, while the linking and join processes use grouping and cardinality definitions to resolve conflicts. (See the related links to learn more about filtering attributes for better results and about cardinality and its use in conflict resolution policies.)
Thresholds are used by both deterministic and probabilistic flows. Changes to thresholds for deterministic rules are made in the code. Any changes to the deterministic thresholds should be done with the guidance of a data analyst. Changes to probabilistic thresholds (auto-link threshold) are made in the PME algorithm through the Big Match Console or MDM Workbench. See the related topics for specific information about adjusting thresholds.