What's new in R2021-10 - Cloud

Talend Cloud Release Notes

Version
Cloud
Language
English (United States)
Product
Talend Cloud
Module
Talend API Designer
Talend API Tester
Talend Data Inventory
Talend Data Preparation
Talend Data Stewardship
Talend Management Console
Talend Pipeline Designer
Content
Installation and Upgrade
Release Notes

Big Data: new features

Feature

Description

Support of Azure Synapse distribution with Spark 3.0

You can now use Azure Synapse Analytics with Apache Spark pools as a new distribution for your Spark Batch and Spark Streaming Jobs with Spark 3.0 in YARN cluster mode only. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard. For more information, see Defining the Azure Synapse Analytics connection parameters.

Azure Synapse Analytics allows you to process your data thanks to various analytics engines. With Apache Spark pools you can have various compute capabilities (such as speed and efficiency) and a compatibility with ADLS Gen2 storage.

This feature is not in technical preview anymore.
Support of Hive Warehouse Connector with Cloudera CDP 7.1.x
You can now use the Hive Warehouse Connector to get data from Hive and write data to Hive table in Spark Batch Jobs with the following new components:
  • tHiveWarehouseConfiguration: enables the reuse of the Hive Warehouse Connector connection configuration to Hive in the same Job.
  • tHiveWarehouseInput: extracts data from Hive and sends the data to the component that follows using Hive Warehouse Connector.
  • tHiveWarehouseOutput: connects to a given Hive database and writes the received data into a given Hive table or a directory in HDFS using Hive Warehouse Connector.

With Hive Warehouse Connector, Talend Studio support Hive transactional managed tables which allows you to have a more optimal transaction control over your data.

Data Integration: new features

Feature

Description

tWorkdayInput: support for X.509 certificate The tWorkdayInput component provides a new WS Security type option named WS Security with certificate authentication, allowing you to secure your messages using an X.509 certificate.
tSnowflakeOutputBulk, tSnowflakeBulkExec, and tSnowflakeOutputBulkExec: gaining access permissions associated to an AWS IAM role

The tSnowflakeOutputBulk, tSnowflakeBulkExec, and tSnowflakeOutputBulkExec components provide the S3 assume role option, which allow you gaining the access permissions associated to an AWS IAM role.

tSnowflakeBulkExec and tSnowflakeOutputBulkExec: a new option for COPY command authentication

The tSnowflakeBulkExec and tSnowflakeOutputBulkExec components provide the Snowflake access to storage drop-down list, which is used for COPY command authentication.

tFileOutputExcel: appending data in streaming mode

The tFileOutputExcel component provides the Streaming append option, which, when selected, allows you to write data to an existing file in the streaming mode. This option helps improve performance when you write large volume of data with a limited memory size.

tSalesforceInput and tSalesforceOutput: using UTC time for datetime data type

The tSalesforceInput and tSalesforceOutput components provide the Use UTC time for datetime option, which, when selected, uses UTC time for columns that are of datetime type. If this option is not selected, the local time is used.

New components available

This release provides the following new components:

  • tFileInputORC and tFileOutputORC, which provide access to files that are of ORC format.
  • tCyberarkInput, which retrieves CyberArk-managed login information of the specified application and passes the information to the next component that requires it. By preventing exposure of credential information, this component helps improve the security of managed applications.
Google Drive components: new support for shared files, folders and drives

The Include shared folders/files and Include shared drives options have been added to the Advanced settings view and allow you to access shared files, folders and drives. The Corpora type option has been added to narrow the scope of file and folder searches.

These options are available for tGoogleDriveCopy, tGoogleDriveCreate, tGoogleDriveDelete, tGoogleDriveGet, tGoogleDriveList and tGoogleDrivePut.

tGoogleDriveList: new Use query option to customize searches

The Use query option has been added to the Advanced settings view and allows you to search for specific files and folders.

Data Quality: new features

Feature

Description

Information about blocking and matching keys in reports Two tables have been added to the reports to get details on the blocking and matching keys.