tHDFSRowCount - 7.1

HDFS

EnrichVersion
7.1
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
EnrichPlatform
Talend Studio
task
Data Governance > Third-party systems > File components (Integration) > HDFS components
Data Quality and Preparation > Third-party systems > File components (Integration) > HDFS components
Design and Development > Third-party systems > File components (Integration) > HDFS components

Reads a file in HDFS row by row in order to determine the number of rows this file contains.

tHDFSRowCount counts the number of rows in a file in HDFS. If the file to be processed is a Hadoop sequence file type or a large dataset, it is recommended to use a tAggregateRow to count the records.

For more technologies supported by Talend, see Talend components.