Skip to main content

Big Data

Feature

Description

Available in

Support for Databricks 12.x runtime with Spark Universal 3.3.x You can now run your Spark Batch and Streaming Jobs on all-purpose and job clusters on Google Cloud Platform (GCP), AWS, and Azure using Spark Universal with Spark 3.3.x. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard.

When you select this mode, Talend Studio is compatible with Databricks 12.x version.

All subscription-based Talend products with Big Data

Support for Amazon EMR 6.8.0, 6.9.0, and 6.10.0 with Spark Universal 3.3.x You can now run your Spark Jobs on an Amazon EMR cluster using Spark Universal with Spark 3.3.x in Yarn cluster mode. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard.

When you select this mode, Talend Studio is compatible with Amazon EMR 6.8.0, 6.9.0, and 6.10.0 versions.

All subscription-based Talend products with Big Data

Support for Azure Synapse Analytics with Spark Universal 3.2.x and 3.3.x
Availability-noteBeta contentBeta
You can now run your Spark Batch Jobs on Azure Synapse Analytics with Spark Universal 3.2.x and 3.3.x, in Synapse mode. You can configure it in the Spark Configuration view of your Spark Batch Jobs.

As it is a beta feature only, it is not suitable for production environment.

All subscription-based Talend products with Big Data

New component tHBaseDeleteRows to delete rows from an HBase table in Spark Batch Jobs Talend Studio now provides the tHBaseDeleteRows component, which allows you to delete rows from an HBase table in Spark Batch Jobs.

All subscription-based Talend products with Big Data

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!