Skip to main content Skip to complementary content

Define the match analysis

Procedure

  1. From the Profiling perspective, right-click Metadata and create a file connection to the duplicated_records output file generated by the Job.
    For further information, check the Data Profiling part in the Talend Studio User Guide.
  2. Expand the new file connection under Metadata and select Analyze matches.
  3. Follow the steps in the wizard to define the analysis metadata and click Finish to open the analysis editor.
  4. In the Matching Key table, define a match key on the Code column to group records by their identification, records which have the same code are grouped together.
  5. Click Chart below the table to show the duplicates generated according to the Bernoulli distribution selected previously in the Job.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!