Joining movie and director information using an Apache Spark Batch Job - 7.2

Talend Data Fabric Getting Started Guide

author
Talend Documentation Team
EnrichVersion
7.2
EnrichProdName
Talend Data Fabric
task
Data Quality and Preparation > Cleansing data
Data Quality and Preparation > Profiling data
Design and Development
Installation and Upgrade
EnrichPlatform
Talend Administration Center
Talend DQ Portal
Talend Installer
Talend Runtime
Talend Studio
This scenario demonstrates:
  1. How to create a Talend Job for Apache Spark Batch. See Creating the Spark Batch Job for details.

  2. How to drop and link the components to be used in a Spark Batch Job. See Dropping and linking Spark components for details.

  3. How to configure the input components using the related metadata from the Repository. See Configuring the input data for details.

  4. How to configure the transformation to join the input data. See Configuring the data transformation for details.

  5. How to write the transformed data to ADLS. See Writing the output to Azure ADLS Gen1 for details.