Skip to main content

Big Data

Feature

Description

Available in

Support for Iceberg table format in tHiveCreateTable in Standard Jobs You can now create an Iceberg table with tHiveCreateTable in Standard Jobs using either Cloudera or Amazon EMR distributions.

Using Iceberg tables allows you to work with different file formats such as Parquet, ORC, and Avro with Cloudera distributions, and Parquet only with Amazon EMR distributions.

All subscription-based Talend products with Big Data

New tHBaseNamespace component to create namespaces for HBase tables in Standard Jobs A new component, tHBaseNamespace, is available in Talend Studio in your Standard Jobs. This component allows you to create a namespace for HBase tables.

All subscription-based Talend products with Big Data

Support for HDInsight 5.0 with Spark Universal 3.1.x
Availability-noteBeta contentBeta
You can now run your Spark Batch and Spark Streaming Jobs on HDInsight with Spark Universal 3.1.x. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard, with either ADLS Gen2 storage or Azure storage.

When you select this mode, Talend Studio is compatible with HDInsight 5.0 version.

You must deactivate Log4j in components to be able to run your Spark Jobs on HDInsight. To do so, go to File > Edit Project properties > Log4j and clear the Activate log4j in components check box.

All subscription-based Talend products with Big Data

Support for AWS EMR Serverless 6.6.0 with Spark Universal 3.2.x and 3.3.x
Availability-noteBeta contentBeta
You can now run your Spark Batch Jobs on AWS EMR Serverless with Spark Universal 3.2.x and 3.3.x. You can configure it in the Spark Configuration view of your Spark Batch Jobs.

When you select this mode, Talend Studio is compatible with AWS EMR Serverless 6.6.0 version.

All subscription-based Talend products with Big Data

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!