This scenario aims at helping you set up and use connectors in a pipeline. You are
advised to adapt it to your environment and use case.
Procedure
-
Click .
-
In the panel that opens, select the type of connection you
want to create.
Example
FTP
-
Select your engine
in the Engine list.
Note:
- It is recommended to use the Remote Engine Gen2 rather than
the Cloud Engine for Design for advanced
processing of data.
- If no Remote Engine Gen2 has been created from Talend Management Console or if it exists but appears as unavailable
which means it is not up and running, you will not be able to select
a Connection type in the list nor to
save the new connection.
- The list of available connection types depends on the engine you
have selected.
-
Select the type of connection you want to create.
Here, select FTP.
-
Fill in the connection properties to access your FTP server as described in FTP properties, check the connection and click Add
dataset.
-
In the Add a new dataset panel, fill in the required
properties to point to the FTP directory in which your file is located and click
View sample to see a preview of your dataset sample.
Here, the file to be retrieved is a CSV file listing restaurants in Baltimore
located in a
Talend/Files folder:
-
Click Validate to save your dataset.
-
On the same FTP connection, add another dataset that will be
used as destination in your pipeline. Here you are pointing to a
Talend/Out folder.
-
Click Add
pipeline on the Pipelines page. Your new pipeline opens.
-
Give the pipeline a meaningful name.
Example
Processing and moving files on FTP
server
-
Click ADD SOURCE and
select your source dataset, restaurant on FTP
dir in the panel that opens.
-
Click to
add processors to the pipeline, for example an Aggregate processor to list all the restaurant addresses.
-
Configure the processor. In the Operations area:
-
Select .location in the Field
path list.
-
Select List in the
Operation list.
-
Enter the name of the Output field name, here
address.
-
Save your configuration.
The restaurant addresses have been aggregated in one single record.
-
Click to
add a Normalize processor to the pipeline in order to flatten the address record
and split every entry into a separate record.
-
Configure the processor. In the Operations area:
-
Select .address in the Field path to
normalize list.
-
Enable the Is list option.
-
Save your configuration.
-
Click the ADD
DESTINATION item on the pipeline to open the panel allowing to
select the FTP output directory in which your output file will be
uploaded.
-
Give a meaningful name to the destination; addresses on FTP out dir for example.
-
In the Configuration tab of the destination, check that the
file you want to upload does not exceed the size limit.
-
Click Save to
save your configuration.
-
On the top toolbar of Talend Cloud Pipeline Designer,
click the Run button to open the panel allowing you to select
your run profile.
-
Select your run profile in the list (for more information, see Run profiles), then click Run to
run your pipeline.
Results
Your pipeline is being executed, the restaurant data that was stored on an FTP directory
has been processed and the output file is uploaded to the FTP target directory you have specified: