How to recover Job execution in case of failure - 6.3

Talend Data Fabric Studio User Guide

English (United States)
Talend Data Fabric
Talend Studio
Data Quality and Preparation
Design and Development

Talend Studio along with Talend Administration Center offer the concept of "recovery checkpoints" as Job execution restore facility. Checkpoints are taken in anticipation of the potential need to restart a Job execution beyond its starting point.

General concept

Job execution processes can be time-consuming, as are backup and restore operations. If checkpointing is possible, this will minimize the amount of time and effort wasted when the process of Job execution is interrupted by failure.

With Talend Studio, you can set checkpoints in your Job design at specified intervals (On Subjob Ok and On Subjob Error connections) in terms of bulks of the data flow.

With Talend Administration Center, and in case of failure during Job execution, the execution process can be restarted from the latest checkpoint previous to the failure rather than from the beginning.

A two-step procedure

The only prerequisite for this facility offered in Talend Studio, is to have trigger connections of the types On Subjob OK and On Subjob Error in your Job design.

To be able to recover Job execution in case of failure, you need to:

  1. Define checkpoints manually on one or more of the trigger connections you use in the Job you design in Talend Studio.

    For more information on how to initiate recovery checkpoints, see How to set checkpoints on trigger connections.

  2. In case of failure during the execution of the designed Job, recover Job execution from the latest checkpoint previous to the failure through the Error recovery Management page in Talend Administration Center.

    For more information, see Talend Administration Center User Guide.