Scheduling incremental data profiling and sampling - Cloud

Talend Cloud Data Catalog User Guide

Version
Cloud
Language
English
Product
Talend Cloud
Module
Talend Data Catalog
Content
Data Governance
Last publication date
2023-11-13

You can schedule to run the process periodically and limit it either by maximum duration or amount of data.

The bridge saves profiling results in the MIMB cache as soon as possible. When it profiles multiple files, it saves profiling results of each file as soon as they are ready.

If the bridge fails, it can restart at the latest point available in the cache. When the bridge detects that the file did not change, it can update the profile time for the file.

The bridge tries to return as much as possible to the caller when it completes, fails or reaches the time or volume limit.

Before you begin

You have been assigned an object role with the Data Management capability.

Procedure

  1. In the Import Setup tab, select the True value for the Incremental Import parameter.
  2. Specify the following miscellaneous options.
    • -tl 3600s processing time limit in s -seconds m - minutes or h hours;
    • -fl 1000 processing files count limit.

    Refer to the parameter information by expanding the Help panel.