Big Data: new features - 6.5

Talend Data Fabric Release Notes

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Installation and Upgrade

Enhancements of Spark Job designer

Feature

Description

tAzureFSConfiguration in Spark Batch and Spark Streaming Jobs

Provides authentication information for Spark to connect to Azure Blob Storage and Azure Data Lake Store.

The support of Azure Data Lake Store is available only when Hortonworks Data Platform V2.6.0 or Cloudera CDH 5.12 is used with this component.

Enhancements of tDataPrepRun in Spark Batch and Spark Streaming Jobs

Dynamic dataset preparation has been added.

Enhancements of Hadoop support

Feature

Description

Upgraded support for Hadoop distributions

  • Cloudera CDH V5.12

  • MapR 6.0

Hive application ID

The Hive components now capture the Application_ID values and write them in the Job logs.

MapR OJAI

A new component, tMapROjaiOutput, has been added to write data to a MapR Ojai database.

Hbase

Users can now read and write custom timestamps columns using the HBase components.

New NoSQL components

Feature

Description

Neo4j

  • Neo4j Batch components have been created.

  • Neo4j V3.2 along with the Bolt protocol is now supported.

Component enhancements

Feature

Description

Upgraded support for Couchbase

New Couchbase components have been created to replace the old ones to support Couchbase V4.X and V5.X.

Enhanced Marklogic support

The Marklogic components now support Marklogic V9.