Skip to main content

Character-based patterns

Talend Data Preparation allows you to analyze the character-based patterns repartition in your data.
Patterns representing Latin characters, as well as Asian characters, split between Hiragana, Katakana, Kanji and Hangul
Character Pattern
Latin numbers 9 replaces all ASCII digits
Latin lowercase letters a replaces all ASCII Latin characters
Latin uppercase letters A replaces all uppercase Latin characters
Hiragana H replaces all Hiragana characters
Katakana K replaces all Katakana characters
Kanji C replaces Chinese characters
Hangul G replaces Hangul characters

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!