What's new in R2020-07 - 7.3

Talend Big Data products Release Notes

author
Talend Documentation Team
EnrichVersion
7.3
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Installation and Upgrade
Release Notes

The R2020-07 Studio monthly release contains the following new features.

Platform support

Feature

Description

Product

Internet Explorer 11 Internet Explorer 11 support is deprecated.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

Big Data: new features

Feature

Description

Product

Support for Cloudera Data Platform (CDP) When you configure a connection to a Hadoop cluster, you can select Cloudera CDP 7.1.1. You can also add and use the dynamic distributions of CDP Private Cloud Base 7.x.

The CDP integration in Talend Studio includes a new dependency management system that improves the performance of your Jobs at runtime.

CDP supports the following elements:
  • Data Integration components:
    • Sqoop
    • Impala

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tAzureFSConfiguration: new properties provided in the Basic settings view with the Azure Data Lake Storage (ADLS) Gen2 When you configure a connection with tAzureFSConfiguration running in the Spark Streaming or Spark Batch Job frameworks, you can now authenticate with your Azure AD credentials.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

Using a Hadoop configuration file with Spark Streaming Jobs You can connect Spark Streaming Jobs to a Hadoop cluster using a configuration JAR file. You specify the path to this file either in the Spark Configuration of the Job or in the Hadoop cluster configuration. This option is only available for Yarn cluster on non-Cloud distributions. Optionally, you can contextualize this connection parameter to automatically connect to the right cluster based on the environment in which you run the Job.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tBigQueryBulkExec: new property provided The tBigQueryBulkExec component provides the Credential type property which allows you to authenticate to your project using either the Service account option or with the HMAC key option.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tS3Configuration: new property provided in the Basic settings view for Spark Batch Jobs

When you use the SSE-KMS encryption service enabled on AWS, you can now specify the KMS key ID of the customer managed CMK you want to use for the encryption.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

Data Integration: new features

Feature

Description

Product

Title bar improvement The release name for the patch on the title bar of Talend Studio has been updated.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tBigQueryBulkExec: new option provided

The tBigQueryBulkExec component provides the Use custom null marker option, which prevents errors caused by fields with null values.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

Option removed

The Move to the current directory option is not necessary and is thus removed. Components involved: tFTPGet, tFTPPut, tFTPFileList, tFTPDelete, tFTPFileExist, tFTPFileRname, and tFTPTruncate.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tJDBCSCDELT: support for SCD type 0

tJDBCSCDELT now supports SCD type 0 for Exasol, Mysql, MSsql, Oracle, Postgresql, and Snowflake.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tVerticaBulkExec: using an existing dynamic schema

The tVerticaBulkExec component can now use the dynamic schema generated by a tSetDynamiSchema component.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tRedshiftUnload: support for Apache Parquet files

The tRedshiftUnload component can now unload data to Apache Parquet files.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tAmazonEMRManage: new Amazon EMR cluster version supported

The tAmazonEMRManage component supports Amazon EMR cluster version 5.29.0.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

EXA components renamed

EXA components were renamed. The following gives the details.

  • tEXABulkExe was renamed as tExasolBulkExec;
  • tEXAClose was renamed as tExasolClose;
  • tEXACommit was renamed as tExasolCommit;
  • tEXAConnection was renamed as tExasolConnection;
  • tEXAInput was renamed as tExasolInput;
  • tEXAOutput was renamed as tExasolOutput;
  • tEXARollback was renamed as tExasolRollback;
  • tEXARow was renamed as tExasolRow.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tGoogleDataprocManage: new option provided The tGoogleDataprocManage component provides the Internal IP only option which allows you to configure all instances in the cluster to have only internal IP addresses.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tGSConnection: new authentication type The tGSConnection component provides the Credential type property which allows you to authenticate to your project using either with the Service account option or with the HMAC key option.
This property is also available for all other Google Storage components such as tGSDelete, tGSGet, tGSList, tGSCopy, tGSPut, tGSBucketCreate, tGSBucketList, tGSBucketDelete and tGSBucketExist.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

tGSBucketCreate: new region for bucket creation The tGSBucketCreate component provides the ASIA region for bucket creation when selecting Service account as the credential type.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

Data Mapper: new features

Feature

Description

Product

Merge option in tHConvertFile The tHConvertFile component has a new option which allows you to merge the part files created when using a large input file.

Talend Big Data Platform

Talend Real-Time Big Data Platform

Data Quality: new features

Feature

Description

Product

Referenced project The main project detects when you have made some changes in the referenced project.

Talend Big Data Platform

Talend Real-Time Big Data Platform