Before you begin
-
Make sure your user or user group has the correct permissions to access the Amazon S3 resources.
If you do not have these permissions you can try one of the following options.- (recommended) Ask the administrator who manages your Amazon account to give you/your user the correct S3 permissions.
- Implement your access policy yourself by following the Amazon documentation if you are allowed to do so.
- (not recommended) Attach the AmazonS3FullAccess policy to your group/your user through the IAM console. This allows you to read and write to S3 resources without restrictions to a specific bucket. However this is a quick fix that is not recommended by Talend.
Note: The default error that displays when trying to access S3 resources without sufficient permissions is Bad Gateway. -
Retrieve the financial_transactions.avro file from the Downloads tab in the left panel of this page.
- Create a Remote Engine Gen2 and its run profile from Talend Cloud Management Console.
The Cloud Engine for Design and its corresponding run profile come embedded by default in Talend Cloud Management Console to help users quickly get started with the app, but it is recommended to install the secure Remote Engine Gen2 for advanced processing of data.
Procedure
Results
On the Datasets page, the new dataset is added to the list and
can be used to reproduce the use
case you have created previously.
Before executing this pipeline, select whether you want to overwrite the existing
data on S3 or merge them in the configuration tab of the destination dataset:
Once your pipeline is executed, the updated data will be visible in the file located on Amazon S3.