Skip to main content

Big Data

Feature Description Available in
Support for branching and tagging with Iceberg components in Standard Jobs You can now perform actions on branches and tags in your Iceberg table with the tIcebergTable in Standard Jobs. New parameters are now available in the Alter table action drop-down list, allowing you to either create or delete branches and tags.

All subscription-based Talend products with Big Data

Support for parallelization during output files writing in Spark Jobs A new option, Parallelize output files writing, is available in the Spark Configuration view of your Spark Batch Jobs. When you select this option, it allows the Spark Batch Jobs to run multiple threads in parallel when writing output files rather than writing output files sequentially in one thread.

This option improves the performance of the execution time.

This feature is available for all distributions, but is only available for Spark Batch Jobs containing the following output components:
  • tAvroOutput
  • tFileOutputDelimited
  • tFileOutputParquet

All subscription-based Talend products with Big Data

Support for HDInsight connection mode with Hive components in Standard Jobs HDInsight 5.0 and 5.1 versions are now supported in Hive components with ADLS Gen1 in Standard Jobs.

All subscription-based Talend products with Big Data

Support for HDInsight 5.1 with Spark Universal 3.3.x
Availability-noteBeta contentBeta
You can now run your Spark Batch and Spark Streaming Jobs on HDInsight with Spark Universal 3.3.x. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard, with either ADLS Gen2 storage or Azure storage.

When you select this mode, Talend Studio is compatible with HDInsight 5.1 version.

All subscription-based Talend products with Big Data

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!