Component-specific settings for tUniqRow - Cloud - 8.0

Talend Job Script Reference Guide

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend CommandLine
Talend Studio
Content
Design and Development > Designing Jobs
Last publication date
2024-02-22

The following table describes the Job script functions and parameters that you can define in the setSettings {} function of the component.

Function/parameter Description Mandatory?

UNIQUE_KEY {}

In this function, use the following parameters to specify each column on which de-duplication will be carried out:

  • SCHEMA_COLUMN: Use this parameter to specify the column name.
  • KEY_ATTRIBUTE: Set this parameter to true to carry out deduplication on the specified column.

    By default, this parameter is set to false.

  • CASE_SENSITIVE: Set this parameter to true to differentiate upper case and lower case.

    By default, this parameter is set to false.

Yes

ONLY_ONCE_EACH_

DUPLICATED_KEY

Set this parameter to true to have only the first duplicated entry in the column(s) defined as key(s) sent to the output flow for duplicates.

By default, this parameter is set to false.

No

IS_VIRTUAL_COMPONENT

Set this parameter to true to enable generating temporary files on the hard disk when processing a large amount of data. This helps to prevent Job execution failure caused by memory overflow.

With this this parameter set to true, you need also to define the BUFFER_SIZE and TEMP_DIRECTORY parameters.

By default, this parameter is set to false.

No

BUFFER_SIZE

Use this parameter to specify the buffer size, namely the number of rows that can be buffered in the memory before a temporary file is generated on the hard disk.

Acceptable values:

  • S: small, 0.5 million rows
  • M: medium, 1 million rows
  • B: big, 2 million rows

Yes when IS_VIRTUAL_COMPONENT is set to true

TEMP_DIRECTORY

Use this parameter to specify the location where the temporary files should be stored.

Warning:

Make sure that you specify an existing directory for temporary files; otherwise your Job execution will fail.

Yes when IS_VIRTUAL_COMPONENT is set to true

CHANGE_HASH_AND_

EQUALS_FOR_

BIGDECIMAL

Set this parameter to true to ignore trailing zeros for Big Decimal data.

By default, this parameter is set to false.

No

TSTATCATCHER_STATS

Set this parameter to true to gather the processing metadata at the Job level as well as at each component level.

By default, this parameter is set to false.

No

LABEL

Use this parameter to specify a text label for the component.

No