tHDFSRowCount

HDFS

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Data Fabric
Talend Big Data Platform
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
Talend Big Data
task
Data Quality and Preparation > Third-party systems > File components (Integration) > HDFS components
Data Governance > Third-party systems > File components (Integration) > HDFS components
Design and Development > Third-party systems > File components (Integration) > HDFS components
EnrichPlatform
Talend Studio

Reads a file in HDFS row by row in order to determine the number of rows this file contains.

tHDFSRowCount counts the number of rows in a file in HDFS. If the file to be processed is a Hadoop sequence file type or a large dataset, it is recommended to use a tAggregateRow to count the records.

For more technologies supported by Talend, see Talend components.