This scenario applies only to Talend products with Big Data.
For more technologies supported by Talend, see Talend components.
This scenario describes a four-component Job that filters a list of customers to find out customers from a particular country, and saves the result list to a local file. Before the input data is filtered, duplicate entries are first removed from the list.
The input file contains three columns: Name, Country, and Age, and it has some duplicate entries, as shown below:
Mario;PuertoRico;49 Mike;USA;22 Ricky;PuertoRico;37 Silvia;Spain;20 Billy;Canada;21 Ricky;PuertoRico;37 Romeo;UK;19 Natasha;Russia;25 Juan;Cuba;23 Bob;Jamaica;55 Mario;PuertoRico;49