Profiling Hive - Cloud

Talend Cloud API Services Platform Studio User Guide

author
Talend Documentation Team
EnrichVersion
Cloud
EnrichProdName
Talend Cloud
task
Design and Development
EnrichPlatform
Talend Management Console
Talend Studio
Once you create the Hive connection via the connection to the Hadoop distribution as outlined in Creating a connection to Hive, you can analyze the data present in all Hive tables.

Procedure

  1. Under the Metadata node in the DQ Repository tree view browse to the Hive connection.
  2. Right-click the Hive connection and select Overview Analysis.

    This analysis profiles database content to have an overview of the number of tables and rows per table. For further information, see Analyzing databases.

  3. Right-click a Hive table and select any of the analyses listed in the menu.

    A wizard guides you through the steps to create the selected analysis. You can then assign indicators to the analyzed columns according to your need.