Setting up the Job - 7.1

Processing (Integration)

English (United States)
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
Talend Studio
Data Governance > Third-party systems > Processing components (Integration)
Data Quality and Preparation > Third-party systems > Processing components (Integration)
Design and Development > Third-party systems > Processing components (Integration)


  1. Drop these components from the Palette to the design workspace: tFileInputDelimited, tExtractDynamicFields, tUniqRow, tFileOutputDelimited, and tLogRow, and name the components as shown above to better identify their roles in the Job.
  2. Connect the component labelled People, the component labelled Split_Column, and the component labelled Deduplicate using Row > Main connections.
  3. Connect the component labelled Deduplicate and the component labelled Unique_Families using a Main > Uniques connection.
  4. Connect the component labelled Deduplicate and the component labelled Duplicated_Families using a Main > Duplicates connection.