- On the Integration perspective, drop the following components from the Palette onto the design workspace: tFixedFlowInput, tHDFSOutput, tHDFSInput, tLogRow and three tLibraryLoad.
- Connect tFixedFlowInput to tHDFSOutput using a link.
Do the same to connect tHDFSInput to tLogRow.
Double-click one of the three tLibraryLoad components to open its Component view.
Click the [...] button to open the Module wizard and select the library to be loaded.
In this example, load azure-data-lake-store-sdk-2.1.4.jar. This is one of the libraries required by the HDFS components to work with Azure Data Lake Store. You can find this jar in the MVN repository such as Azure Data Lake Store Java Client SDK
Do the same to use the other two tLibraryLoad components to load the other two libraries.
In this example, these libraries are hadoop-azure-datalake-2.6.0-cdh5.12.1.jar and jackson-core-2.8.4.jar.