Setting up the Job - 7.3
Processing (Integration)
- Version
- 7.3
- Language
- English
- Product
- Talend Big Data
- Talend Big Data Platform
- Talend Data Fabric
- Talend Data Integration
- Talend Data Management Platform
- Talend Data Services Platform
- Talend ESB
- Talend MDM Platform
- Talend Open Studio for Big Data
- Talend Open Studio for Data Integration
- Talend Open Studio for ESB
- Talend Real-Time Big Data Platform
- Module
- Talend Studio
- Content
- Data Governance > Third-party systems > Processing components (Integration)
- Data Quality and Preparation > Third-party systems > Processing components (Integration)
- Design and Development > Third-party systems > Processing components (Integration)
- Last publication date
- 2023-09-12
Procedure
-
Drop these components from the Palette to
the design workspace: tFileInputDelimited,
tExtractDynamicFields, tUniqRow, tFileOutputDelimited, and tLogRow, and name the components as shown above to better
identify their roles in the Job.
-
Connect the component labelled People,
the component labelled Split_Column, and
the component labelled Deduplicate using
Row > Main connections.
-
Connect the component labelled Deduplicate and the component labelled Unique_Families using a Main > Uniques
connection.
-
Connect the component labelled Deduplicate and the component labelled Duplicated_Families using a Main > Duplicates connection.