Create the first Job - 7.0

Big Data Job Examples

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Design and Development > Designing Jobs
Design and Development > Designing Jobs > Hadoop distributions
Design and Development > Designing Jobs > Job Frameworks > Standard
EnrichPlatform
Talend Studio
Follow these steps to create the first Job, which will set up an HCatalog database to manage the access log file to be analyzed.

Procedure

  1. In the Repository tree view, expand the Job Designs node, right-click Standard Jobs and select Create folder to create a new folder to group the Jobs that you will create.
  2. In the Repository tree view, right-click Job Designs and select Create folder to create a new folder to group the Jobs that you will create.
  3. Right-click the folder you just created, and select Create job to create your first Job. Name it A_HCatalog_Create to identify its role and execution order among the example Jobs.

    You can also provide a short description for your Job, which will appear as a tooltip when you move your mouse over the Job.

  4. Drop a tHDFSDelete and two tHCatalogOperation components from the Palette onto the design workspace.
  5. Connect the three components using Trigger > On Subjob Ok connections.

    The HDFS subjob will be used to remove any previous results of this demo example, if any, to prevent possible errors in Job execution, and the two HCatalog subjobs will be used to create an HCatalog database and set up an HCatalog table and partition in the created HCatalog table, respectively.

  6. Label these components to better identify their functionality.