Skip to main content Skip to complementary content

Profiling an ADLS Databricks file

From the Profiling perspective of Talend Studio, you can generate a column analysis on an ADLS Databricks file through Hive.

A JDBC connection is required to connect to Hive on Databricks.

Procedure

To create a profiling analysis on an ADLS file, you must:

  1. Download the JDBC driver and add it to the Studio.
  2. Create a JDBC connection to the ADLS cluster.
  3. Create a column analysis with simple indicators on the table and columns.
    Those steps are described in the following procedures.

What to do next

You can modify the analysis settings and add other indicators as needed. You can also create other analyses later on this ADLS file by using the same Hive table.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!