Joining movie and director information using an Apache Spark Batch Job - 8.0

Talend Real-Time Big Data Platform Getting Started Guide

Version
8.0
Language
English
Operating system
Real-Time Big Data Platform
Product
Talend Real-Time Big Data Platform
Module
Talend Administration Center
Talend Installer
Talend Runtime
Talend Studio
Content
Data Quality and Preparation > Cleansing data
Data Quality and Preparation > Profiling data
Design and Development
Installation and Upgrade
Last publication date
2024-03-13
This scenario demonstrates:
  1. How to create a Talend Job for Apache Spark Batch. See Creating the Spark Batch Job for details.

  2. How to drop and link the components to be used in a Spark Batch Job. See Dropping and linking Spark components for details.

  3. How to configure the input components using the related metadata from the Repository. See Configuring the input data for details.

  4. How to configure the transformation to join the input data. See Configuring the data transformation for details.

  5. How to write the transformed data to ADLS. See Writing the output to Azure ADLS Gen1 for details.