To handle the access log file to be analyzed on the Hadoop system, you needed to define an appropriate schema in the relevant components.
To simplify the configuration, before we start to configure the Jobs, we can save the read-only schema of the tApacheLogInput component as a generic schema that can be reused across Jobs.
- In the Job B_HCatalog_Read, double-click the tApacheLogInput component to open its Basic settings view.
- Click the [...] button next to the Edit schema to open the Schema dialog box.
- Click the button to open the Select folder dialog box.
- In this example we have not created any folder under the Generic schemas node, so simply click OK to close the dialog box and open the generic schema setup wizard.
Give your generic schema a name, access_log in this example, and click Finish to close the wizard and save the schema.
Click OK to close the Schema dialog box. Now the generic schema
appears under the Generic schemas node of
the Repository view and is ready for use
where it is needed in your Job configurations.