tSPSSOutput - 6.3

Talend Open Studio for Big Data Components Reference Guide

Talend Open Studio for Big Data
Data Governance
Data Quality and Preparation
Design and Development
Talend Studio


tSPSSOutput writes data entries in an .sav file.


tSPSSOutput writes or appends data to an SPSS .sav file. It creates SPSS files on the fly and overwrites existing ones.

tSPSSOutput properties

Component family

Business Intelligence


Basic settings

Sync schema

Click this button to synchronize with the columns of the SPSS .sav file.


Schema and Edit Schema

A schema is a row description. It defines the number of fields (columns) to be processed and passed on to the next component. The schema is either Built-In or stored remotely in the Repository.

Since version 5.6, both the Built-In mode and the Repository mode are available in any of the Talend solutions.

Click Edit schema to make changes to the schema. If the current schema is of the Repository type, three options are available:

  • View schema: choose this option to view the schema only.

  • Change to built-in property: choose this option to change the schema to Built-in for local changes.

  • Update repository connection: choose this option to change the schema stored in the repository and decide whether to propagate the changes to all the Jobs upon completion. If you just want to propagate the changes to the current Job, you can select No upon completion and choose this schema metadata again in the [Repository Content] window.



Built-in: The schema is created and stored locally for this component only. Related topic: see Talend Studio User Guide.



Repository: The schema already exists and is stored in the Repository, hence can be reused. Related topic: see Talend Studio User Guide.



Name or path of the SPSS .sav file to be written.


Write Type

Select an operation from the list:

Write: simply writes the new data.

Append: writes the new data at the end of the existing data.

Advanced settings

tStatCatcher Statistics

Select this check box to collect log data at the component level.

Global Variables

NB_LINE: the number of rows read by an input component or transferred to an output component. This is an After variable and it returns an integer.

ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio User Guide.


This component can not be used as start component. It requires an input flow.


Before being able to benefit from all functional objectives of the SPSS components, make sure to do the following: -If you have already installed SPSS, add the path to the SPSS directory as the following: SET PATH=%PATH%;<DR>:\program\SPSS, or -If you have not installed SPSS, you must copy the SPSS IO "spssio32.dll" lib from the SPSS installation CD and paste it in Talend root directory.


If you are using a subscription-based version of the Studio, the activity of this component can be logged using the log4j feature. For more information on this feature, see Talend Studio User Guide.

For more information on the log4j logging levels, see the Apache documentation at http://logging.apache.org/log4j/1.2/apidocs/org/apache/log4j/Level.html.


Oracle provides two kinds of JVM platforms (32-bit and 64-bit). By default, the JVM used in a 64-bit operating system is the 64-bit version of the JVM. Since the JSPSS.dll file used by this component is compiled in the 32-bit JVM, it is needed to configure the 32-bit JVM for Job execution in Talend Studio installed on a 64-bit operating system.

To do so, perform the following:

  1. Open the relevant Job in the workspace.

  2. Click the Run tab and enter the Advanced settings view.

  3. Select the Use specific JVM arguments check box and click the New... button.

  4. In the Set the VM argument box, enter -d32 and click Ok for validation.

Scenario: Writing data in an .sav file

This Java scenario describes a very simple Job that writes data entries in an .sav file.

Setting up the Job

  1. Drop a tRowGenerator component and a tSPSSOutput component from the Palette onto the design workspace.

  2. Right-click on tRowGenerator and connect it to tSPSSOutput using a Main Row link.

Configuring the input component

  1. In the design workspace, double click tRowGenerator to display its Basic Settings view and open its editor. Here you can define your schema.

  2. Click the plus button to add the columns you want to write in the .sav file.

  3. Define the schema and set the parameters to the columns.


    Make sure to define the length of your columns. Otherwise, an error message will display when building your Job.

  4. Click OK to validate your schema and close the editor.

Configuring the output component

  1. Click tSPSSOutput to display its Basic settings view and define the component properties.

  2. Click the three-dot button next to the Filename field and browse to the SPSS .sav file in which you want to write data.

  3. Click the three-dot button next to Sync columns to synchronize columns with the previous component. In this example, the schema to be inserted in the .sav file consists of the two columns: id and country.

  4. If required, click Edit schema to view/edit the defined schema.

  5. From the Write Type list, select Write or Append to simply write the input data in the .sav file or add it to the end of the .sav file.

Job execution

Save the Job and press F6 to execute it.

The data generated by the tRowGenerator component is written in the defined .sav file.