Big Data: known issues and known limitations - 8.0

Talend Big Data products Release Notes

Version
8.0
Language
English (United States)
EnrichDitaval
Big Data
Product
Talend Big Data
Talend Big Data Platform
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
Content
Installation and Upgrade
Release Notes

We encourage you to consult the JIRA bug tracking tool for a full list of open issues:

https://jira.talendforge.org/issues/?filter=35345

Limitation

Description

Product

Hive Hive is not supported in Spark Local mode.

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

Java 11
  • Java 11 is not supported in the Standard Jobs or the Metadata Repository once they involve big bata distributions.
  • Java 11 is not supported in the Spark Jobs.

This limitation is due to the constraint to support Java 11 of the big data distributions.

To run your Spark Jobs and Standard Jobs or Metadata Repository that involve big data distributions, you need to install Java 8 on your computer, and in Talend Studio customize the path in Preferences > Talend > Java interpreter and then browse the location of JDK 8 in Preferences > Java > Installed JREs.

Talend Open Studio for Big Data

Talend Big Data

Talend Big Data Platform

Talend Real-Time Big Data Platform

Issue Workaround

When you run a Spark Batch Jobs with MapRDB components that have Date type columns in schema columns, the following compile error appears:

"The method toBytes(ByteBuffer) in the type Bytes is not applicable for the arguments (Date)".

Date type columns in schema columns cannot be used when you run a Spark Batch Job with MapRDB components.
HBase is not working with a CDP 7.1.x cluster using Kerberos in YARN Client mode and returns the following error: hbase.pb.AuthenticationService.GetAuthenticationTokenorg.apache.hadoop.hbase.HBaseIOException: com.google.protobuf.ServiceException: Error calling method hbase.pb.AuthenticationService.GetAuthenticationToken. If you want to use Kerberos when using HBase with a CDP 7.1.x cluster, it is recommended to use YARN Cluster mode instead of YARN Client mode.