tAmazonEMRResize - 6.3

Talend Components Reference Guide

EnrichVersion
6.3
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

Function

tAmazonEMRResize adds or resizes a task instance group in a cluster on Amazon EMR (Elastic MapReduce).

Purpose

tAmazonEMRResize allows you to resize a cluster on Amazon EMR.

tAmazonEMRResize properties

Component family

Cloud/Amazon/EMR

Basic settings

Access key and Secret key

Specify the access keys (the access key ID in the Access Key field and the secret access key in the Secret Key field) required to access the Amazon Web Services. For more information about AWS access keys, see Access keys (access key ID and secret access key).

To enter the secret key, click the [...] button next to the secret key field, and then in the pop-up dialog box enter the password between double quotes and click OK to save the settings.

 

Inherit credentials from AWS role

Select this check box to leverage the instance profile credentials. These credentials can be used on Amazon EC2 instances, and are delivered through the Amazon EC2 metadata service. To use this option, your Job must be running within Amazon EC2 or other services that can leverage IAM Roles for access to resources. For more information, see Using an IAM Role to Grant Permissions to Applications Running on Amazon EC2 Instances.

 

Assume role

Select this check box and specify the values for the following parameters used to create a new assumed role session.

  • Role ARN: the Amazon Resource Name (ARN) of the role to assume.

  • Role session name: an identifier for the assumed role session.

  • Session duration (minutes): the duration (in minutes) for which we want to have the assumed role session to be active.

For more information about assuming roles, see AssumeRole.

Configuration

Action

Select an action to be performed from the drop-down list.

  • Add task instance group: add a task instance group in a cluster.

  • Resize task instance group: resize a task instance group in a cluster.

Region

Specify the AWS region by selecting a region name from the list or entering a region between double quotation marks (for example "us-east-1"). For more information about how to specify the AWS region, see Choose an AWS Region.

Cluster id

Enter the ID of the cluster to be resized.

Group name

Enter the name of the task instance group to be added.

This field is available only when Add task instance group is selected from the Action drop-down list.

Group id

Enter the ID of the task instance group to be resized.

This field is available only when Resize task instance group is selected from the Action drop-down list.

Instance Configuration

Instance count

Enter the number of instances for the task instance group.

Task instance type

Select an instance type for all instances in the task instance group to be added from the drop-down list.

This list is available only when Add task instance group is selected from the Action drop-down list.

Request spot

Select this check box to launch Spot instances, and in the Bid price($) field displayed, enter the maximum hourly rate (in dollars) you are willing to pay per instance.

This check box is available only when Add task instance group is selected from the Action drop-down list.

Advanced settings

STS Endpoint

Select this check box and in the field displayed, specify the AWS Security Token Service endpoint where session credentials are retrieved from.

This check box is available only when the Assume role check box is selected.

tStatCatcher Statistics

Select this check box to gather the Job processing metadata at the Job level as well as at each component level.

Global Variables

TASK_GROUP_ID: the ID of the task instance group. This is an After variable and it returns a string.

TASK_GROUP_NAME: the name of the task instance group. This is an After variable and it returns a string.

ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio User Guide.

Usage

tAmazonEMRResize is usually used as a standalone component.

Log4j

If you are using a subscription-based version of the Studio, the activity of this component can be logged using the log4j feature. For more information on this feature, see Talend Studio User Guide.

For more information on the log4j logging levels, see the Apache documentation at http://logging.apache.org/log4j/1.2/apidocs/org/apache/log4j/Level.html.

Related scenario

No scenario is available for the Standard version of this component yet.