Skip to main content Skip to complementary content
Close announcements banner

What's new in R2020-05

Big Data

Feature

Description

Available in

Support for EMR 5.29 You can run Talend Jobs with the Amazon EMR distribution in version 5.29.

All Talend products with Big Data

Upsert existing Delta Lake tables with new data When you configure how to save the dataset in tDeltaLakeOutput, select Merge to upsert an existing Delta Lake table with new data from a data flow or from another Delta Lake table. New fields are available to configure which columns to merge and how to perform this merge.

All Talend products with Big Data

Check data consistency with EMR clusters When using tS3Configuration, enable the Use EMRFS consistent view option to use the EMR File System (EMRFS) consistent view. This option allows EMR clusters to check for list and read-after-write consistency for Amazon S3 objects that are written by or synced with EMRFS.

All Talend products with Big Data

Spark catalog configuration in tHiveConfiguration You must indicate a Spark implementation with the Spark catalog property in the configuration of tHiveConfiguration. The value to select depends on whether the Hive metastore is external to your cluster or not. This configuration prevents errors at runtime. This property is available in Spark Batch Jobs only.

All Talend products with Big Data

Support for Oracle 19c Oracle 19c is now supported by the following Big Data components.
Spark Batch:
  • tOracleConfiguration
  • tOracleInput
  • tOracleOutput
Spark Streaming:
  • tOracleConfiguration
  • tOracleLookupInput
  • tOracleOutput

All Talend products with Big Data

Advanced Assume Role configuration in DynamoDB components When you enable the Assume Role option in the tDynamoDBInput and tDynamoDBOutput components, you can now configure the following properties from the Advanced settings view to fine tune your configuration:
  • Signing region (mandatory)
  • External Id
  • Serial number
  • Token code
  • Tags
  • IAM Policy ARNs
  • Policy

All Talend products with Big Data

Access data from a secondary index When you retrieve data from a table with the tDynamoDBInput component, you can specify a secondary index in the component configuration to improve the performance of queries and scans.

All Talend products with Big Data

Data Integration

Feature

Description

Available in

Remote TAC connection improvement A user with LDAP will be prompted for new login credentials in Talend Studio if the AD password has been changed.

All Talend on-premises products

Title bar improvement The title of Talend Studio on the title bar will be updated to show the patch version information after installing a patch.

All Talend products with Talend Studio

AWS SDK driver upgrade The AWS SDK driver for Redshift SSO connection in Talend Studio metadata has been upgraded.

All Talend products with Talend Studio

Context propagation enhancement The context propagation over the reference project has been enhanced in Data Integration part. Any context variable update in the reference project now can be automatically synchronized to the main project.

All Talend products with Talend Studio

Advanced Assume Role configuration When you enable the Assume Role option, you can now configure the following properties from the Advanced settings view to fine tune your configuration:
  • Signing region (mandatory)
  • External Id
  • Serial number
  • Token code
  • Tags
  • IAM Policy ARNs
  • Policy
This enhancement is available in the following components:
  • tAmazonEMRListInstances, tAmazonEMRManage, tAmazonEMRResize, tAmazonRedshiftManage
  • tRedshiftOutputBulk, tRedshiftOutputBulkExec
  • tS3BucketCreate, tS3BucketDelete, tS3BucketExist, tS3BucketList, tS3Connection, tS3Copy, tS3Delete, tS3Get, tS3List, tS3Put
  • tSQSConnection, tSQSInput, tSQSMessageChangeVisibility, tSQSMessageDelete, tSQSOutput, tSQSQueueAttributes, tSQSQueueCreate, tSQSQueueDelete, tSQSQueueList, tSQSQueuePurge

All Talend products with Talend Studio

tSQLDWH components renamed tSQLDWH components were renamed. The following gives the detail.
  • tSQLDWHBulkExec renamed as tAzureSynapseBulkExec
  • tSQLDWHClose renamed as tAzureSynapseClose
  • tSQLDWHCommit renamed as tAzureSynapseCommit
  • tSQLDWHConnection renamed as tAzureSynapseConnection
  • tSQLDWHInput renamed as tAzureSynapseInput
  • tSQLDWHOutput renamed as tAzureSynapseOutput
  • tSQLDWHRollback renamed as tAzureSynapseRollback
  • tSQLDWHRow renamed as tAzureSynapseRow

All Talend products with Talend Studio

Support for Azure Data Lake Storage Gen2 The Azure Synapse components support Azure Data Lake Storage Gen2. The tAzureSynapseBulkExec component provides the Data Lake Storage Gen2 option in the Azure Storage drop-down list in the Basic settings view and the Secure transfer required option in the Advanced settings view. The existing option Data Lake Store in the Azure Storage drop-down list changed to Data Lake Storage Gen1.

All Talend products with Talend Studio

tELTTeradataMap: relationship operator updated The ELT Teradata Map Editor uses these operators: =, <=, <, >=, >, and <>; the corresponding previous operators, including EQ, LE, LT, GE, GT, and NE, are deprecated, as shown in the following figures.
The existing:
Changed to:

All Talend products with Talend Studio

Support for Azure Active Directory authentication You can now use Azure Active Directory authentication when establishing connections using the following components.
  • tAzureSynapseBulkExec, tAzureSynapseConnection, tAzureSynapseInput, tAzureSynapseOutput, tAzureSynapseRow
  • tELTMSSqlMap
  • tMSSqlBulkExec, tMSSqlConnection, tMSSqlInput, tMSSqlOutput, tMSSqlOutputBulkExec, tMSSqlRow, tMSSqlSCD, tMSSqlSP
  • tCreateTable

All Talend products with Talend Studio

tAzureSynapseBulkExec: support for COPY statement for loading data

The tAzureSynapseBulkExec supports COPY statement for loading data and the following changes were made to the component.

In the Basic settings view:
  • Load method drop-down list (new);
  • Azure storage drop-down list (updated);
  • Authentication method drop-down list (new);
  • SAS token field (new);
  • Endpoint suffix field (new);
  • External paths option (new).
In the Advanced settings view:
  • File type drop-down list (new);
  • Specify map to source table fields option (new);
  • First row field (new);
  • Field quote field (new);
  • Field terminator field (new);
  • Row terminator field (new);
  • Date format drop-down list (new);
  • Encoding drop-down list (new);
  • Identity insert option (new);
  • Max errors field (new);
  • Compressed by drop-down list (updated).

All Talend products with Talend Studio

Data Quality

Feature Description

Available in

Components All Data Quality components can run on Databricks on Azure and AWS, except for tMatchIndex and tMatchIndexPredict.

As those components do not support the Elasticsearch authentication, they cannot run on Databricks.

All Talend Platform and Data Fabric products

Application Integration

Feature Description

Available in

REST Services Context variables are now fully supported to be used in REST service provider and consumer endpoints in Data Services and Routes.

All Talend products with ESB

Microservices The Microservices offer now the possibility to provide metrics to Prometheus.

All Talend products with ESB

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!