Supported Hadoop distribution versions for Talend Jobs - 7.0

Talend Data Fabric Installation Guide for Linux

EnrichVersion
7.0
EnrichProdName
Talend Data Fabric
task
Installation and Upgrade
EnrichPlatform
Talend Activity Monitoring Console
Talend Administration Center
Talend Artifact Repository
Talend CommandLine
Talend Data Preparation
Talend Data Stewardship
Talend DQ Portal
Talend ESB
Talend Identity and Access Management
Talend Installer
Talend JobServer
Talend Log Server
Talend MDM Server
Talend MDM Web UI
Talend Repository Manager
Talend Runtime
Talend SAP RFC Server
Talend Studio

In general, Talend certifies a specific release version for a given Big Data (Hadoop) Distribution vendor. These are typically what is recommended for use for that vendor. For incremental upgrades and service packs by a given vendor, Talend relies on the vendors' compatibility statements to ensure the proper running and execution of the Talend software. Where compatibility is stated, Talend also supports that version under our Support SLA. If an incompatibility should be verified by the Hadoop vendor, then Talend considers that a re-test and upgrade may be necessary.

If support for the Hadoop distribution you want to use is not yet available in your Talend Studio, it may be available via an update. For related information, see Talend Help Center.

If you are using a Cloudera distribution, you can also add your distribution via some dynamic distribution settings (technical preview) in the Studio. For further information, see Adding the latest Hadoop distribution dynamically (technical preview). The dynamic distribution added this way is not officially supported by Talend.

Hadoop distributions

  • : officially supported.
  • : not officially supported.
  • : the Kerberos kinit option is supported by the Studio.
  • : the Kerberos kinit option and the Kerberos keytab option are both supported by the Studio.

1. This enables the connection between the Studio and a custom Hadoop distribution not yet officially supported in the Studio. For further information, see the sections describing how to connect to a custom Hadoop distribution of the Talend Big Data Getting Started Guide or the documentation of any related component that creates the connection to a Hadoop distribution, such as tHDFSConnection.

2. For this MapR version, the MapR security ticket mechanism is supported by the Studio.

3. In MapR 6, HBase support has been removed in favor of MapR-DB. For more information, see https://maprdocs.mapr.com/home/ReleaseNotes/install_upgrade_notes.html.

4. The supported Amazon EMR versions for the tAmazonEMRManage component are 4.0.0, 4.1.0, 4.2.0, 4.5.0, 4.6.0, 4.9.2 and 5.11.0.

5. The Profiling perspective does not support the Embedded connection mode on Hive distributions. This mode is available mainly for test purposes done by Hadoop developers. The studio may not be able to run correctly with the embedded mode.

6. Officially supported Cloudera version. Other Cloudera versions could be compatible as part of Cloudera backward compatibility on Cloudera V5.X branches.

  HBase HCatalog HDFS Hive 5 Oozie Pig Spark Sqoop Talend MapReduce
Standalone YARN
Google Dataproc V1.1 (Apache 2.7.3)

V2.0

HDP V2.4.0

V1.6

V2.5.0

V1.6

V2.6.0

V1.6 and V2.1

Cloudera CDH 5.5 (YARN mode)

V1.5 V1.5

CDH 5.6 (YARN mode)

V1.5 V1.5

CDH 5.7 (YARN mode)

V1.6 V1.6

CDH 5.8 (YARN mode)

V1.6 V1.6 and V2.0

CDH 5.10 (YARN mode)

V1.6 and V2.1 V1.6 and V2.1

CDH 5.12.1 (YARN mode)

V2.2 V2.2

CDH 5.13.0

V2.2 V2.2

Cloudera Altus CDH5.11 6 Azure (technical preview) V2.1
AWS V2.1
MapR 5.0.0 (YARN mode) 2

V1.3 V1.3

5.1.0 (YARN mode) 2

V1.5 V1.5

5.2.0 (YARN mode) 2

V1.6 V1.6

6.0.0 (YARN mode) 2 3

V2.1 V2.1

Amazon EMR 4 EMR 4.5.0 (Apache 2.7.2)

V1.6
EMR 4.6.0 (Apache 2.7.2)

V1.6
EMR 5.0.0 (Apache 2.7.2)

V2.0
EMR 5.5.0 (Apache 2.7.3)

V2.1
EMR 5.8.0 (Apache 2.7.3)