Starting a new Amazon EMR cluster - 7.0

Amazon EMR

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Amazon services (Integration) > Amazon EMR components
Data Quality and Preparation > Third-party systems > Amazon services (Integration) > Amazon EMR components
Design and Development > Third-party systems > Amazon services (Integration) > Amazon EMR components
EnrichPlatform
Talend Studio
Configure the tAmazonEMRManage component to start a new Amazon EMR cluster.

Procedure

  1. Double-click the tAmazonEMRManage component to open its Basic settings view.
  2. In the Access Key and Secret Key fields, enter the authentication credentials required to access Amazon S3.
  3. From the Action list, select Start to start a cluster.
  4. Select the AWS region from the Region drop-down list. In this example, it is Asia Pacific (Tokyo).
  5. In the Cluster name field, enter the name of the cluster to be started. In this example, it is talend-doc-emr-cluster.
  6. From the Cluster version and Application drop-down list, select the version of the cluster and the application to be installed on the cluster.
  7. Select the Enable log check box and in the field displayed, specify the path to a folder in an S3 bucket where you want Amazon EMR to write the log data. In this example, it is s3://talend-doc-emr-bucket.