Profiling Hive - Cloud

Talend Cloud Data Management Platform Studio User Guide

EnrichVersion
Cloud
EnrichProdName
Talend Cloud
EnrichPlatform
Talend Management Console
Talend Studio
task
Design and Development
Once you create the Hive connection via the connection to the Hadoop distribution as outlined in Creating a connection to Hive, you can analyze the data present in all Hive tables.

Procedure

  1. Under the Metadata node in the DQ Repository tree view browse to the Hive connection.
  2. Right-click the Hive connection and select Overview Analysis.

    This analysis profiles database content to have an overview of the number of tables and rows per table. For further information, see Analyzing databases.

  3. Right-click a Hive table and select any of the analyses listed in the menu.

    A wizard guides you through the steps to create the selected analysis. You can then assign indicators to the analyzed columns according to your need.