Teradata error: "Invalid Input: only Latin letters allowed" - Cloud - 8.0

Talend Studio User Guide

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-22
Available in...

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

You will get an error message when using Talend Studio to profile the Teradata database using the Soundex Frequency Table indicator because your input is invalid.

From the Profiling perspective, try to profile a column in Teradata, first_name for example, using the Soundex Frequency Table indicator. Run the column analysis with the SQL engine. The analysis runs successfully.

Try to drill down data on the result page: in the Frequency Statistics table in the Analysis Results view, right-click a row and select View Rows. You will get an error in the SQL Editor about the generated SQL query.

[Error 5583] Invalid Input: only Latin letters allowed.

This limitation is due to Teradata soundex implementation. The Teradata database requires that a character string or expression that contains a surname is evaluated in simple Latin characters.

A simple Latin character is one that does not have diacritical marks such as tilde (~) or acute accent (´). There are 26 uppercase simple Latin characters and 26 lowercase simple Latin characters. Even a simple call to SOUNDEX ('Sébastien') cannot be executed on Teradata. Therefore, it is not possible to drill down into all rows that sounds like 'Sébastien'.