What's new in R2020-05 - Cloud - 7.3

Talend Release Notes

Version
Cloud
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud API Services Platform
Talend Cloud Big Data
Talend Cloud Big Data Platform
Talend Cloud Data Fabric
Talend Cloud Data Integration
Talend Cloud Data Management Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Cloud API Designer
Talend Cloud API Tester
Talend Cloud Data Inventory
Talend Cloud Data Preparation
Talend Cloud Data Stewardship
Talend Cloud Pipeline Designer
Talend Data Preparation
Talend Data Stewardship
Talend Management Console
Talend Studio
Content
Installation and Upgrade
Release Notes
Last publication date
2024-02-08

Big Data

Feature

Description

Available in

Support for EMR 5.29 You can run Talend Jobs with the Amazon EMR distribution in version 5.29.

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All Talend products with Big Data

Upsert existing Delta Lake tables with new data When you configure how to save the dataset in tDeltaLakeOutput, select Merge to upsert an existing Delta Lake table with new data from a data flow or from another Delta Lake table. New fields are available to configure which columns to merge and how to perform this merge.

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All Talend products with Big Data

Check data consistency with EMR clusters When using tS3Configuration, enable the Use EMRFS consistent view option to use the EMR File System (EMRFS) consistent view. This option allows EMR clusters to check for list and read-after-write consistency for Amazon S3 objects that are written by or synced with EMRFS.

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All Talend products with Big Data

Spark catalog configuration in tHiveConfiguration You must indicate a Spark implementation with the Spark catalog property in the configuration of tHiveConfiguration. The value to select depends on whether the Hive metastore is external to your cluster or not. This configuration prevents errors at runtime. This property is available in Spark Batch Jobs only.

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All Talend products with Big Data

Support for Oracle 19c Oracle 19c is now supported by the following Big Data components.
Spark Batch:
  • tOracleConfiguration
  • tOracleInput
  • tOracleOutput
Spark Streaming:
  • tOracleConfiguration
  • tOracleLookupInput
  • tOracleOutput

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All Talend products with Big Data

Advanced Assume Role configuration in DynamoDB components When you enable the Assume Role option in the tDynamoDBInput and tDynamoDBOutput components, you can now configure the following properties from the Advanced settings view to fine tune your configuration:
  • Signing region (mandatory)
  • External Id
  • Serial number
  • Token code
  • Tags
  • IAM Policy ARNs
  • Policy

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All Talend products with Big Data

Access data from a secondary index When you retrieve data from a table with the tDynamoDBInput component, you can specify a secondary index in the component configuration to improve the performance of queries and scans.

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All Talend products with Big Data

Data Integration

Feature

Description

Available in

Remote TAC connection improvement A user with LDAP will be prompted for new login credentials in Talend Studio if the AD password has been changed.

Available in:

Big Data

Big Data Platform

Data Fabric

Data Integration

Data Management Platform

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend on-premises products

Title bar improvement The title of Talend Studio on the title bar will be updated to show the patch version information after installing a patch.

Available in:

Big Data

Big Data Platform

Cloud API Services Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Integration

Cloud Data Management Platform

Data Fabric

Data Integration

Data Management Platform

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with Talend Studio

AWS SDK driver upgrade The AWS SDK driver for Redshift SSO connection in Talend Studio metadata has been upgraded.

Available in:

Big Data

Big Data Platform

Cloud API Services Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Integration

Cloud Data Management Platform

Data Fabric

Data Integration

Data Management Platform

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with Talend Studio

Context propagation enhancement The context propagation over the reference project has been enhanced in Data Integration part. Any context variable update in the reference project now can be automatically synchronized to the main project.

Available in:

Big Data

Big Data Platform

Cloud API Services Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Integration

Cloud Data Management Platform

Data Fabric

Data Integration

Data Management Platform

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with Talend Studio

Advanced Assume Role configuration When you enable the Assume Role option, you can now configure the following properties from the Advanced settings view to fine tune your configuration:
  • Signing region (mandatory)
  • External Id
  • Serial number
  • Token code
  • Tags
  • IAM Policy ARNs
  • Policy
This enhancement is available in the following components:
  • tAmazonEMRListInstances, tAmazonEMRManage, tAmazonEMRResize, tAmazonRedshiftManage
  • tRedshiftOutputBulk, tRedshiftOutputBulkExec
  • tS3BucketCreate, tS3BucketDelete, tS3BucketExist, tS3BucketList, tS3Connection, tS3Copy, tS3Delete, tS3Get, tS3List, tS3Put
  • tSQSConnection, tSQSInput, tSQSMessageChangeVisibility, tSQSMessageDelete, tSQSOutput, tSQSQueueAttributes, tSQSQueueCreate, tSQSQueueDelete, tSQSQueueList, tSQSQueuePurge

Available in:

Big Data

Big Data Platform

Cloud API Services Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Integration

Cloud Data Management Platform

Data Fabric

Data Integration

Data Management Platform

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with Talend Studio

tSQLDWH components renamed tSQLDWH components were renamed. The following gives the detail.
  • tSQLDWHBulkExec renamed as tAzureSynapseBulkExec
  • tSQLDWHClose renamed as tAzureSynapseClose
  • tSQLDWHCommit renamed as tAzureSynapseCommit
  • tSQLDWHConnection renamed as tAzureSynapseConnection
  • tSQLDWHInput renamed as tAzureSynapseInput
  • tSQLDWHOutput renamed as tAzureSynapseOutput
  • tSQLDWHRollback renamed as tAzureSynapseRollback
  • tSQLDWHRow renamed as tAzureSynapseRow

Available in:

Big Data

Big Data Platform

Cloud API Services Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Integration

Cloud Data Management Platform

Data Fabric

Data Integration

Data Management Platform

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with Talend Studio

Support for Azure Data Lake Storage Gen2 The Azure Synapse components support Azure Data Lake Storage Gen2. The tAzureSynapseBulkExec component provides the Data Lake Storage Gen2 option in the Azure Storage drop-down list in the Basic settings view and the Secure transfer required option in the Advanced settings view. The existing option Data Lake Store in the Azure Storage drop-down list changed to Data Lake Storage Gen1.

Available in:

Big Data

Big Data Platform

Cloud API Services Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Integration

Cloud Data Management Platform

Data Fabric

Data Integration

Data Management Platform

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with Talend Studio

tELTTeradataMap: relationship operator updated The ELT Teradata Map Editor uses these operators: =, <=, <, >=, >, and <>; the corresponding previous operators, including EQ, LE, LT, GE, GT, and NE, are deprecated, as shown in the following figures.
The existing:
Changed to:

Available in:

Big Data

Big Data Platform

Cloud API Services Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Integration

Cloud Data Management Platform

Data Fabric

Data Integration

Data Management Platform

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with Talend Studio

Support for Azure Active Directory authentication You can now use Azure Active Directory authentication when establishing connections using the following components.
  • tAzureSynapseBulkExec, tAzureSynapseConnection, tAzureSynapseInput, tAzureSynapseOutput, tAzureSynapseRow
  • tELTMSSqlMap
  • tMSSqlBulkExec, tMSSqlConnection, tMSSqlInput, tMSSqlOutput, tMSSqlOutputBulkExec, tMSSqlRow, tMSSqlSCD, tMSSqlSP
  • tCreateTable

Available in:

Big Data

Big Data Platform

Cloud API Services Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Integration

Cloud Data Management Platform

Data Fabric

Data Integration

Data Management Platform

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with Talend Studio

tAzureSynapseBulkExec: support for COPY statement for loading data

The tAzureSynapseBulkExec supports COPY statement for loading data and the following changes were made to the component.

In the Basic settings view:
  • Load method drop-down list (new);
  • Azure storage drop-down list (updated);
  • Authentication method drop-down list (new);
  • SAS token field (new);
  • Endpoint suffix field (new);
  • External paths option (new).
In the Advanced settings view:
  • File type drop-down list (new);
  • Specify map to source table fields option (new);
  • First row field (new);
  • Field quote field (new);
  • Field terminator field (new);
  • Row terminator field (new);
  • Date format drop-down list (new);
  • Encoding drop-down list (new);
  • Identity insert option (new);
  • Max errors field (new);
  • Compressed by drop-down list (updated).

Available in:

Big Data

Big Data Platform

Cloud API Services Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Integration

Cloud Data Management Platform

Data Fabric

Data Integration

Data Management Platform

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with Talend Studio

Data Quality

Feature Description

Available in

Components All Data Quality components can run on Databricks on Azure and AWS, except for tMatchIndex and tMatchIndexPredict.

As those components do not support the Elasticsearch authentication, they cannot run on Databricks.

Available in:

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

All Talend Platform and Data Fabric products

Application Integration

Feature Description

Available in

REST Services Context variables are now fully supported to be used in REST service provider and consumer endpoints in Data Services and Routes.

Available in:

Cloud API Services Platform

Cloud Data Fabric

Data Fabric

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with ESB

Microservices The Microservices offer now the possibility to provide metrics to Prometheus.

Available in:

Cloud API Services Platform

Cloud Data Fabric

Data Fabric

Data Services Platform

ESB

MDM Platform

Real-Time Big Data Platform

All Talend products with ESB