Replicating a list of leads and processing the two output flows differently
Before you begin
-
You have previously created a connection to the system storing your source data.
Here, a database connection.
-
You have previously added the dataset holding your source data.
Download and extract the file: filter-python-customers.zip. It contains lead data including ID, name, revenue, etc.
-
You also have created the connection and the related dataset that will hold the processed data.
Here, a file stored on Amazon S3 and a file stored on HDFS.
Procedure
Results
Your pipeline is being executed, the records are duplicated and filtered, and the output flows are sent to the target systems you have indicated.
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!