Big Data - Cloud - 8.0

Talend Release Notes

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud API Services Platform
Talend Cloud Big Data
Talend Cloud Big Data Platform
Talend Cloud Data Fabric
Talend Cloud Data Integration
Talend Cloud Data Management Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Cloud API Designer
Talend Cloud API Tester
Talend Cloud Data Inventory
Talend Cloud Data Preparation
Talend Cloud Data Stewardship
Talend Cloud Management Console
Talend Cloud Pipeline Designer
Talend Data Preparation
Talend Data Stewardship
Talend Studio
Content
Installation and Upgrade
Release Notes

Feature

Description

Available in

Support for HPE Ezmeral Runtime Enterprise 5.4 on Kubernetes with Spark 3.1.x You can now run your Spark Batch and Streaming Jobs on Kubernetes with Livy and Datatap using Spark Universal with Spark 3.1.x.

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All subscription-based Talend products with Big Data

Support for Databricks 12.x runtime with Spark Universal 3.3.x You can now run your Spark Batch and Streaming Jobs on all-purpose and job clusters on Google Cloud Platform (GCP), AWS, and Azure using Spark Universal with Spark 3.3.x. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard.

When you select this mode, Talend Studio is compatible with Databricks 12.x version.

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All subscription-based Talend products with Big Data

Support for Amazon EMR 6.8.0 and 6.9.0 with Spark Universal 3.3.x You can now run your Spark Jobs on an Amazon EMR cluster using Spark Universal with Spark 3.3.x in Yarn cluster mode. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard.

When you select this mode, Talend Studio is compatible with Amazon EMR 6.8.0 and 6.9.0 versions.

With the Beta version for this feature, the following known issues exist with a workaround:
  • Spark Batch Jobs with HBase never end, make sure to use htrace-core4-4.2.0-incubating.jar in the /usr/lib/hbase/lib.
  • Spark Jobs with Redshift components have runtime exception, make sure to use the Hadoop 3.3.1 version.

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All subscription-based Talend products with Big Data

Support for MongoDB v4+ for Spark Streaming 3.1 and onwards Talend Studio now supports MongoDB v4+ with Spark 3.1 and onwards versions for the following components in your Spark Streaming Jobs using Dataset:
  • tMongoDBConfiguration
  • tMongoDBInput
  • tMongoDBLookupInput
  • tMongoDBOutput

With the Beta version for this feature, the MongoDB version to select from the DB Version drop-down list is MongoDB 3.2+.

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All subscription-based Talend products with Big Data