About this task
In this scenario, the main input schema is already stored in the Repository. For more information about storing schema metadata in the repository, see the Talend Studio User Guide.
In the Repository tree view, expand
Metadata - DB
Connections where you have stored the main input schema and
drop the database table onto the design workspace. The input table used in
this scenario is called customer.
A dialog box is displayed with a list of components.
- Select the relevant database component, tMysqlInput in this example, and then click OK.
- Drop two tGenKey components, two tMatchGroup components, a tMap and a tLogRow components from Palette onto the design workspace.
- Link the input component to the tGenKey and tMap components using Main links.
In the two tMatchGroup components, select the
Output distance details check boxes in the
Advanced settings view of both components
before linking them together.
This will provide the MATCHING_DISTANCES column in the output schema of each tMatchGroup.If the two tMatchGroup components are already linked to each other, you must select the Output distance details check box in the second component in the Job flow first otherwise you may have an issue.
- Link the two tMatchGroup components and the tLogRow component using Main links.
If needed, give the components specific labels to reflect their usage in
For further information about how to label a component, see Talend Studio User Guide.