Skip to main content

Data Profiling Attributes

Data profiling attributes can be used in the SELECT list, FROM clause, WHERE clause and ORDER BY clause of a MQL Query.

The following Data Profiling attributes are supported on file fields or table/view columns.

Data Profiling attribute name

Description

Value examples

Value data type

Supported operators in WHERE

"Data Profiling"."Distinct"

Number of distinct rows

1001

Long

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling".Duplicate

Number of duplicate rows

5

Long

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling".Valid

Number of valid rows

5

Long

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling".Empty

Number of empty rows: null in database or empty in files

5

Long

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling".Invalid

Number of invalid rows

5

Long

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling".Min

Lowest value

1.0

Double

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling".Max

Highest value

3.123456E30

Double

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling".Mean

Mean value

3.123456E30

Double

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling".Variance

Variance value

3.123456E30

Double

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling".Median

Median value

3.123456E30

Double

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling"."Lower Quantile"

Lower quantile

3.123456E30

Double

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling"."Upper Quantile"

Upper quantile

3.123456E30

Double

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling"."Avg Length"

Average length of strings

6.5

Double

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling"."Min Length"

Minimum length of strings

3

Integer

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling"."Max Length"

Maximum length of strings

20

Integer

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling"."Inferred Data Types"

List of data types detected by the profiler

["integer", "string"]

Array of String

=, !=, = ANY (...), != ANY (...), = ALL (...), != ALL (...), [NOT] EXISTS

The following Data Profiling attributes are supported on files, tables and views.

Data Profiling attribute name

Description

Value examples

Value data type

Supported operators in WHERE

"Data Profiling"."Profiled Date"

Last profiled date

2022-09-24 13:25:17

String

=, BEFORE, AFTER, BETWEEN ... AND ...

"Data Profiling"."Processed Records"

Processed row count

10

Long

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

"Data Profiling"."Total Records"

Total row count

100

Long

=, !=, = ANY (...), != ANY (...), [NOT] EXISTS, <, <=, >, >= and BETWEEN

When somebody uses a data profiling attribute in a filter, MIMM removes all objects from the results which are marked as hide data for that user. The hide data is controlled by multiple factors: whether the user has the Data Viewer capability object role assignment, whether a harvested object has a proposed or assigned data class resulting in a Sensitivity Label that has the HIDE DATA flag, or whether the object has a Sensitivity Label that has the HIDE DATA flag.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!