Big Data: new features
Feature |
Description |
Available in |
---|---|---|
Support of Databricks runtime 10.x and onwards on Spark Universal 3.2.x | You can now run your Spark Batch and Streaming Jobs on transient and
interactive Databricks clusters, both on AWS and Azure, using Spark Universal with
Spark 3.2.x. You can configure it either in the Spark
Configuration view of your Spark Jobs or in the Hadoop
Cluster Connection metadata wizard. When you select this mode, Talend Studio is compatible with Databricks 10.x version. |
ⓘ Available in: Big Data Big Data Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Data Fabric Real-Time Big Data Platform All subscription-based Talend products with Big Data |
Support of simple mapping with Dataset when using tMap in Spark Batch Jobs | Following the migration of component APIs from Resilient Distributed Dataset
(RDD) to Dataset (DS), you can now perform simple mapping with DS as well as RDD
when you use tMap in your Spark Batch Jobs. For example, the following
screenshot shows a simple mapping without any transformation from the input
table to the output table:
|
ⓘ Available in: Big Data Big Data Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Data Fabric Real-Time Big Data Platform All subscription-based Talend products with Big Data |
Support of IDBroker with Cloudera CDP Public Cloud | Talend Studio now supports IDBroker as part of Knox authentication service with Cloudera CDP Public Cloud. When you use S3 in your Spark Batch Jobs, IDBroker allows to read and write any type of files using S3 as an external storage and provides the token generation. |
ⓘ Available in: Big Data Big Data Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Data Fabric Real-Time Big Data Platform All subscription-based Talend products with Big Data |
Support of Azure Active Directory (AD) Service Principal for Azure Synapse Spark pools in Spark Jobs |
When you use Azure Synapse Analytics with Apache Spark pools as a distribution for your Spark Batch and Spark Streaming Jobs, you can now authenticate with Azure AD Service Principal. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard. When you select the Azure Active Directory authentication mode, you can also authenticate using a certificate now. |
ⓘ Available in: Big Data Big Data Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Data Fabric Real-Time Big Data Platform All subscription-based Talend products with Big Data |
Data Integration: new features
Feature |
Description |
Available in |
---|---|---|
Enhancement of data type mapping to modify, restore and migrate the default mapping files | The data type mapping in Talend Studio
has been improved.
For more information, see Type mapping. |
ⓘ Available in: Big Data Big Data Platform Cloud API Services Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Cloud Data Integration Cloud Data Management Platform Data Fabric Data Integration Data Management Platform Data Services Platform ESB MDM Platform Real-Time Big Data Platform All subscription-based Talend products with Talend Studio |
Support of deleting the .m2 folder after updating Talend Studio | You can now delete the .m2 folder under
<Talend-Studio>\configuration\ after applying a
Talend Studio
monthly update. You must close Talend Studio
before deleting the .m2 folder. This helps you to easily
remove the obsolete JAR files affected by Common Vulnerabilities and Exposures
(CVEs). For more information, see Updating Talend Studio. |
ⓘ Available in: Big Data Big Data Platform Cloud API Services Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Cloud Data Integration Cloud Data Management Platform Data Fabric Data Integration Data Management Platform Data Services Platform ESB MDM Platform Real-Time Big Data Platform All subscription-based Talend products with Talend Studio |
MIMB OEM cumulative patch upgrade for Talend Metadata Bridge | The MIMB OEM cumulative patch for Talend Metadata Bridge has been upgraded from MIMB-OEM-CumulativePatch-1010-20220208-1632.zip to MIMB-OEM-CumulativePatch-1010-20220622-1543.zip. |
ⓘ Available in: Big Data Big Data Platform Cloud API Services Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Cloud Data Integration Cloud Data Management Platform Data Fabric Data Integration Data Management Platform Data Services Platform ESB MDM Platform Real-Time Big Data Platform All subscription-based Talend products with Talend Studio |
Improvements of FTP components: JAR upgrade, new JSch client options, and enhanced logging | JSch JAR is upgraded to version 0.2.1. A new option, Config
client, is provided by FTP components, allowing you to set JSch
client options. |
ⓘ Available in: Big Data Big Data Platform Cloud API Services Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Cloud Data Integration Cloud Data Management Platform Data Fabric Data Integration Data Management Platform Data Services Platform ESB MDM Platform Real-Time Big Data Platform All subscription-based Talend products with Talend Studio |
Enhancement of tMongoDBOutput and tCosmosDBOutput to support dynamic schema |
The tMongoDBOutput and tCosmosDBOutput components support dynamic schema when inserting documents to collections. |
ⓘ Available in: Big Data Big Data Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Data Fabric Real-Time Big Data Platform All subscription-based Talend products with Big Data |
Enhancement of the Snowflake metadata wizard | This feature enables the Snowflake metadata wizard to retrieve new schemas without affecting existing ones. |
ⓘ Available in: Big Data Big Data Platform Cloud API Services Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Cloud Data Integration Cloud Data Management Platform Data Fabric Data Integration Data Management Platform Data Services Platform ESB MDM Platform Real-Time Big Data Platform All subscription-based Talend products with Talend Studio |
Data Quality: new features
Feature |
Description |
Available in |
---|---|---|
tDataQualityRules for Spark | The component is now available for Apache Spark Batch and Streaming. It
supports:
|
ⓘ Available in: Big Data Platform Cloud API Services Platform Cloud Big Data Platform Cloud Data Fabric Cloud Data Management Platform Data Fabric Data Management Platform Data Services Platform MDM Platform Real-Time Big Data Platform All Talend Platform and Data Fabric products |
Viewing rows of match analysis | The View rows feature is now available for match
analyses from the Duplicate Record Statistics or
Group Statistics tabs. It lets you view all the analyzed data corresponding to the row you selected. You can export those data into a CSV file. |
ⓘ Available in: Big Data Platform Cloud API Services Platform Cloud Big Data Platform Cloud Data Fabric Cloud Data Management Platform Data Fabric Data Management Platform Data Services Platform MDM Platform Real-Time Big Data Platform All Talend Platform and Data Fabric products |
Continuous Integration: new features
Feature |
Description |
Available in |
---|---|---|
Talend CI Builder upgraded to version 8.0.7 |
Talend
CI Builder is upgraded from version 8.0.5 to version 8.0.7. Use Talend CI Builder 8.0.7 in your CI commands or pipeline scripts from this monthly version onwards until a new version of Talend CI Builder is released. |
ⓘ Available in: Big Data Big Data Platform Cloud API Services Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Cloud Data Integration Cloud Data Management Platform Data Fabric Data Integration Data Management Platform Data Services Platform ESB MDM Platform Real-Time Big Data Platform All subscription-based Talend products with Talend Studio |
Support of displaying project details at build time |
Talend
CI Builder now supports displaying project details at build time. |
ⓘ Available in: Big Data Big Data Platform Cloud API Services Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Cloud Data Integration Cloud Data Management Platform Data Fabric Data Integration Data Management Platform Data Services Platform ESB MDM Platform Real-Time Big Data Platform All subscription-based Talend products with Talend Studio |
Compatibility check between Talend CommandLine and project versions at build time |
Talend
CI Builder now supports checking the compatibility between Talend CommandLine
and project versions at build time. If the Talend CommandLine
version is older than your project version, the build fails and an error message
is displayed. |
ⓘ Available in: Big Data Big Data Platform Cloud API Services Platform Cloud Big Data Cloud Big Data Platform Cloud Data Fabric Cloud Data Integration Cloud Data Management Platform Data Fabric Data Integration Data Management Platform Data Services Platform ESB MDM Platform Real-Time Big Data Platform All subscription-based Talend products with Talend Studio |