tFileUnarchive - 6.3

Talend Open Studio for Big Data Components Reference Guide

Talend Open Studio for Big Data
Talend Studio
Data Governance
Data Quality and Preparation
Design and Development


Decompresses the archive file provided as parameter and puts it in the extraction directory.


Decompresses an archive file for further processing. Such formats are supported: *.tar.gz , *.tgz, *.tar, *.gz and *.zip.

tFileUnarchive Properties

Component family



Basic settings

Archive file

File path to the archive.


Extraction Directory

Folder where the unzipped file(s) will be put.


Use archive file name as root directory

Select this check box to create a folder named as the archive, if it does not exist, under the specified directory and extract the zipped file(s) to that folder.


Check the integrity before unzip

Select this check box to run an integrity check before unzipping the archive.


Extract file paths

Select this check box to reproduce the file path structure zipped in the archive.


Need a password

Select this check box and provide the correct decrypt method and password if the archive to be unzipped is password protected. Note that the encrypted archive must be one created by the tFileArchive component; otherwise you will see error messages or get nothing extracted even if no error message is displayed.

Decrypt method: select the decrypt method from the list, either Java Decrypt or Zip4j Decrypt.

Enter the password: enter the decryption password.

To enter the password, click the [...] button next to the password field, and then in the pop-up dialog box enter the password between double quotes and click OK to save the settings.

Advanced settings

tStatCatcher Statistics

Select this check box to gather the processing metadata at the Job level as well as at each component level.


This component can be used as a standalone component but it can also be used within a Job as a Start component using an Iterate link.

Global Variables

CURRENT_FILE: the current file name. This is a Flow variable and it returns a string.

CURRENT_FILEPATH: the current file path. This is a Flow variable and it returns a string.

ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio User Guide.


Outgoing links (from this component to another):

Row: Iterate.

Trigger: On Subjob Ok; On Subjob Error; Run if; On Component Ok; On Component Error.

Incoming links (from one component to this one):

Row: Iterate.

Trigger: Run if; On Subjob Ok; On Subjob Error; On component Ok; On Component Error; Synchronize; Parallelize.

For further information regarding connections, see Talend Studio User Guide.


If you are using a subscription-based version of the Studio, the activity of this component can be logged using the log4j feature. For more information on this feature, see Talend Studio User Guide.

For more information on the log4j logging levels, see the Apache documentation at



Such files can be decompressed: *.tar.gz , *.tgz, *.tar, *.gz and *.zip.

Related scenario

For tFileUnarchive related scenario, see tFileCompare.