Finding duplicate files between two folders - Cloud - 8.0

tFileList

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > File components (Integration) > File management components > tFileList
Data Quality and Preparation > Third-party systems > File components (Integration) > File management components > tFileList
Design and Development > Third-party systems > File components (Integration) > File management components > tFileList
Last publication date
2024-02-20

This scenario describes a Job that iterates on files in two folders, transforms the iteration results to data flows to obtain a list of filenames, and then picks up all duplicates from the list and shows them on the Run console, as a preparation step before merging the two folders, for example.

For more technologies supported by Talend, see Talend components.