Profiling an ADLS Databricks file - Cloud

Talend Cloud Data Management Platform Studio User Guide

Version
Cloud
Language
English (United States)
Product
Talend Cloud
Module
Talend Management Console
Talend Studio
Content
Design and Development

From the Profiling perspective of Talend Studio, you can generate a column analysis on an ADLS Databricks file through Hive.

A JDBC connection is required to connect to Hive on Databricks.

Procedure

To create a profiling analysis on an ADLS file, you must:

  1. Download the JDBC driver and add it to the Studio.
  2. Create a JDBC connection to the ADLS cluster.
  3. Create a column analysis with simple indicators on the table and columns.
    Note: Those steps are described in the following procedures.

What to do next

You can modify the analysis settings and add other indicators as needed. You can also create other analyses later on this ADLS file by using the same Hive table.