About this task
This report analyzes the data in the datamart relative to a given database to check data storage requirements for specific columns. It highlights the columns which have the largest difference between the parametrized column size and the actual maximum size. This will help the administrator to tune the database server for better performance through making sure that the physical storage space is not wasted in any of the analyzed columns.
You have accessed Talend DQ Portal as a user.
At least one report has been generated on a column analysis in the Profiling perspective of Talend Studio. The column analysis must use the text statistics indicators, mainly Minimal Length, Maximal Length and Average Length.
To launch a report in order to analyze column size in a specific database, do the following:
From the user interface, click the icon, point to Reports
> Integrity reports and then click
Column size analysis.
ExampleThe corresponding page opens.
Click in the Header field and select
YES if you want to insert a logo in the
report to launch.
The default logo file is a Talend logo, but you can decide to use a logo of your choice. For further information, see Customizing logos in reports.
- Click the CONNECTION explore icon to display a dialog box that lists the database connections created in the Profiling perspective of Talend Studio.
- From the CONNECTION list, select the database connection used for the column analyses carried out in the Profiling perspective of Talend Studio.
Click Confirm at the bottom right corner
of the dialog box.
The name of the selected connection is displayed in the CONNECTION field.
Click Execute in the top of the Parameters panel.
A loading indicator is displayed and then a report on all column analyses, if more than one exists, that use the selected database connection opens in the page.
ExampleIn this example, we have three reports that have been initially generated on three column analyses in the Profiling perspective of Talend Studio. This report which is generated from Talend DQ Portal on the selected database connection gives information about all the analyzed columns in the three different analyses as the following:
names of the analyzed columns.
results of the subtraction of the Max Length from the column size. This will give information about the actual storage space used in the column.
data length defined for the column in the database.
computes the minimal length of the text in the column.
computes the average length of the text in the column.
computes the maximum length of the text in the column.The results shown in the report help the administrator to reduce the physical storage requirements (column size) for certain columns and thus have some space savings in these columns. This column storage space tuning will result in reduced physical storage in the table and database size.
In the top right corner of the page, click to save the report parameters.
You can run a saved report without redefining its parameters, for further information, see Accessing the list of defined reports.