Creating an Amazon EMR cluster management Job - 6.5

Amazon EMR

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Amazon services (Integration) > Amazon EMR components
Data Quality and Preparation > Third-party systems > Amazon services (Integration) > Amazon EMR components
Design and Development > Third-party systems > Amazon services (Integration) > Amazon EMR components
EnrichPlatform
Talend Studio

Create a Job to start a new Amazon EMR cluster, then resize the cluster, and finally list the ID and name information of the instance groups in the cluster.

Procedure

  1. Create a new Job and add a tAmazonEMRManage component, a tAmazonEMRResize component, a tAmazonEMRListInstances component, and a tJava component by typing their names in the design workspace or dropping them from the Palette.
  2. Link the tAmazonEMRManage component to the tAmazonEMRResize component using a Trigger > OnSubjobOk connection.
  3. Link the tAmazonEMRResize component to the tAmazonEMRListInstances component using a Trigger > OnSubjobOk connection.
  4. Link the tAmazonEMRListInstances component to the tJava component using a Row > Iterate connection.