Big Data: new features - Cloud - 8.0

Talend Release Notes

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud API Services Platform
Talend Cloud Big Data
Talend Cloud Big Data Platform
Talend Cloud Data Fabric
Talend Cloud Data Integration
Talend Cloud Data Management Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Cloud API Designer
Talend Cloud API Tester
Talend Cloud Data Inventory
Talend Cloud Data Preparation
Talend Cloud Data Stewardship
Talend Cloud Pipeline Designer
Talend Data Preparation
Talend Data Stewardship
Talend Management Console
Talend Studio
Content
Installation and Upgrade
Release Notes
Last publication date
2024-04-16

Feature

Description

Available in

Support of Spark Universal You can now run your Spark Jobs using Spark Universal with Spark 2.4.x or Spark 3.0.x, either in Local or Yarn cluster mode.

Spark Universal is a mechanism that allows Talend Studio to be compatible with every big data distribution available for a given Spark version, using only a Hadoop configuration JAR file that contains all the necessary information to establish a connection to the cluster in Yarn cluster.

Spark Universal gives you more agility by enabling a switch between the different Spark modes, distributions, or environments.

You can configure your Spark Universal connection either in the Spark configuration view of your Job or in the Hadoop Cluster Connection metadata wizard from the Repository tree view:

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All subscription-based Talend products with Big Data

Support of Kubernetes with Spark Universal 3.1.x You can now run your Spark Jobs using Spark Universal with Spark 3.1.x in Kubernetes mode.
You can configure your Spark Universal connection with Kubernetes either in the Spark configuration view of your Job or in the Hadoop Cluster Connection metadata wizard from the Repository tree view:

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All subscription-based Talend products with Big Data

Support of Dynamic Schema in Spark Batch components You can now use the Dynamic Schema in your Spark Jobs with the following components:
  • tDeltaLakeInput
  • tDeltaLakeOutput
  • tFileInputParquet
  • tFileOutputParquet
  • tJDBCInput
  • tJDBCOutput
  • tLogRow
  • tSqlRow

Available in:

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

All subscription-based Talend products with Big Data