Configuring and running your Spark Job with CDP Public Cloud Data Hub on AWS
Talend Studio allows you to deploy and execute your Spark Streaming and Spark Batch Jobs on a remote Talend JobServer with a CDP Public Cloud Data Hub on AWS instance.
Before you begin
- The Talend JobServer settings are defined correctly in Talend Studio to run your Job remotely. For more information see, Configuring remote execution (Talend > Run/Debug).
- The AWS instance environment is defined in Cloudera Management Console. For more information, see Register an AWS environment from the official Cloudera documentation.
- The cluster on AWS is defined in the Cloudera Management Console. For more information, see Create a custom cluster on AWS from the official Cloudera documentation.
Procedure
Results
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!