tHDFSRowCount - 7.3

HDFS

Version
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > File components (Integration) > HDFS components
Data Quality and Preparation > Third-party systems > File components (Integration) > HDFS components
Design and Development > Third-party systems > File components (Integration) > HDFS components
Last publication date
2024-02-21

Reads a file in HDFS row by row in order to determine the number of rows this file contains.

tHDFSRowCount counts the number of rows in a file in HDFS. If the file to be processed is a Hadoop sequence file type or a large dataset, it is recommended to use a tAggregateRow to count the records.

For more technologies supported by Talend, see Talend components.