What's new in R2022-07 - Cloud

Talend Cloud Release Notes

Version
Cloud
Language
English (United States)
Product
Talend Cloud
Module
Talend API Designer
Talend API Tester
Talend Data Inventory
Talend Data Preparation
Talend Data Stewardship
Talend Management Console
Talend Pipeline Designer
Content
Installation and Upgrade
Release Notes

Big Data: new features

Feature

Description

Support of Databricks runtime 10.x and onwards on Spark Universal 3.2.x You can now run your Spark Batch and Streaming Jobs on transient and interactive Databricks clusters, both on AWS and Azure, using Spark Universal with Spark 3.2.x. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard.

When you select this mode, Talend Studio is compatible with Databricks 10.x version.

Support of simple mapping with Dataset when using tMap in Spark Batch Jobs Following the migration of component APIs from Resilient Distributed Dataset (RDD) to Dataset (DS), you can now perform simple mapping with DS as well as RDD when you use tMap in your Spark Batch Jobs.
For example, the following screenshot shows a simple mapping without any transformation from the input table to the output table:
Support of IDBroker with Cloudera CDP Public Cloud Talend Studio now supports IDBroker as part of Knox authentication service with Cloudera CDP Public Cloud. When you use S3 in your Spark Batch Jobs, IDBroker allows to read and write any type of files using S3 as an external storage and provides the token generation.
Support of Azure Active Directory (AD) Service Principal for Azure Synapse Spark pools in Spark Jobs

When you use Azure Synapse Analytics with Apache Spark pools as a distribution for your Spark Batch and Spark Streaming Jobs, you can now authenticate with Azure AD Service Principal. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard.

When you select the Azure Active Directory authentication mode, you can also authenticate using a certificate now.

Data Integration: new features

Feature

Description

Enhancement of data type mapping to modify, restore and migrate the default mapping files The data type mapping in Talend Studio has been improved.
  • The default data type mapping files can now only be modified. After modification, you can restore them to the Talend default data type mapping files corresponding to your current Talend Studio version.
  • The default data type mapping files are now migrated automatically after applying a Talend Studio monthly update.

For more information, see Type mapping.

Support of deleting the .m2 folder after updating Talend Studio You can now delete the .m2 folder under <Talend-Studio>\configuration\ after applying a Talend Studio monthly update. You must close Talend Studio before deleting the .m2 folder. This helps you to easily remove the obsolete JAR files affected by Common Vulnerabilities and Exposures (CVEs).

For more information, see Updating Talend Studio.

MIMB OEM cumulative patch upgrade for Talend Metadata Bridge The MIMB OEM cumulative patch for Talend Metadata Bridge has been upgraded from MIMB-OEM-CumulativePatch-1010-20220208-1632.zip to MIMB-OEM-CumulativePatch-1010-20220622-1543.zip.
Improvements of FTP components: JAR upgrade, new JSch client options, and enhanced logging JSch JAR is upgraded to version 0.2.1. A new option, Config client, is provided by FTP components, allowing you to set JSch client options.
Enhancement of tMongoDBOutput and tCosmosDBOutput to support dynamic schema

The tMongoDBOutput and tCosmosDBOutput components support dynamic schema when inserting documents to collections.

Enhancement of the Snowflake metadata wizard This feature enables the Snowflake metadata wizard to retrieve new schemas without affecting existing ones.

Data Quality: new features

Feature

Description

tDataQualityRules for Spark The component is now available for Apache Spark Batch and Streaming.
It supports:
  • Apache Spark 3.0 and greater in local mode.
  • Cloudera Data Engineering service with Apache Spark 3.1 and 3.2.
Viewing rows of match analysis The View rows feature is now available for match analyses from the Duplicate Record Statistics or Group Statistics tabs.

It lets you view all the analyzed data corresponding to the row you selected. You can export those data into a CSV file.

Continuous Integration: new features

Feature

Description

Talend CI Builder upgraded to version 8.0.7 Talend CI Builder is upgraded from version 8.0.5 to version 8.0.7.

Use Talend CI Builder 8.0.7 in your CI commands or pipeline scripts from this monthly version onwards until a new version of Talend CI Builder is released.

Support of displaying project details at build time

Talend CI Builder now supports displaying project details at build time.

Compatibility check between Talend CommandLine and project versions at build time

Talend CI Builder now supports checking the compatibility between Talend CommandLine and project versions at build time. If the Talend CommandLine version is older than your project version, the build fails and an error message is displayed.