Presentation of the feature importance report - Cloud - 8.0

Data matching with Talend tools

Version
Cloud
8.0
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Data Quality components > Matching components > Continuous matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Data matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Data Governance > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Continuous matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Data matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Design and Development > Third-party systems > Data Quality components > Matching components > Continuous matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Data matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Fuzzy matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Matching with machine learning components
Last publication date
2024-02-06

The first page

This page contains:
  • The Job name and the date/time (UTC) in the top left and right respectively
  • The heat map: each cell represents a feature. The color shade indicates if a feature is important in the model. The more important is the feature, the darker is the color.
The possible values in the heat map are:
  • A number between 0 and 1.000 rounded off to three decimal digits
  • 0.000: the value, for example 0.0001, is rounded to the nearest value.
  • 0: the feature is not used
  • N/A: the feature is not computed

For more information on the heat map, see Analyzing the heat map.

The second page

This page contains:
  • The Job name and the date/time (UTC) in the top left and right respectively
  • The parameters set in the Advanced settings tab of the tMatchModel component, including the hyper-parameters (the number of trees and tree-depth ranges)
  • The number of trees for the best model
  • The maximum tree depth for the best model
  • The model quality