Defining the survivor validation flow - 6.5

Deduplication

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Quality components > Deduplication components
Data Quality and Preparation > Third-party systems > Data Quality components > Deduplication components
Design and Development > Third-party systems > Data Quality components > Deduplication components
EnrichPlatform
Talend Studio

Procedure

  1. Double-click tRuleSurvivorship to open its Basic settings view.
  2. Click the Sync columns button to retrieve the schema from the previous component.
  3. From the list, select the column to be used as a Group Identifier.
  4. In the Rule package name field, enter the name of the rule package you need to create to define the survivor validation flow of interest, org.talend.survivorship.sample in this example.
  5. In the Rule table, click the [+] button to add as many rows as required and complete them using the corresponding rule definitions.
  6. Next to Generate rules and survivorship flow, click the icon to generate the rule package with its contents you have defined.

    You can find the generated rule package in the Metadata > Rules Management > Survivorship Rules directory of Talend Studio Repository. From there, you can open the survivor validation flow created in this example and read its diagram.