tUniservRTMailBulk - 6.3

Talend Open Studio for Big Data Components Reference Guide

EnrichVersion
6.3
EnrichProdName
Talend Open Studio for Big Data
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

Function

tUniservRTMailBulk creates an index pool for mailRetrieval with predefined input data.

Purpose

tUniservRTMailBulk prepares the index pool for duplicate search.

tUniservRTMailBulk properties

Component family

Data quality

 

Basic settings

Schema and Edit schema

A schema is a row description, it defines the number of fields to be processed and passed on to the next component. The schema is either Built-in or stored remotely in the Repository.

The schema of this component is read-only. You can click Edit schema to view the schema.

 

Host name

Server host name between double quotation marks.

 

Port

Listening port number of the server between double quotation marks.

 

Service

The service name is "mail" by default. Enter a new name if necessary (e.g. due to service suffix), between double quotation marks.

Advanced settings

Uniserv Parameters

For detailed information, please refer to the Uniserv user manual mailRetrieval.

 

tStatCatcher Statistics

Select this check box to collect log data at the Job and the component levels.

Global Variables

ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio User Guide.

Usage

tUniservRTMailBulk prepares the index pool for duplicate search.

Limitation

To use tUniservRTMailBulk, the Uniserv software mailRetrieval must be installed.

An input component and a map are needed to read the address from the database or a file. The component does not have an output connection.

 

Scenario: Creating an index pool

This scenario describes a batch job that loads the address list of an SQL database into the index pool.

The database for this scenario is already saved in the Repository, so that all schema metadata is available.

  1. In the Repository view, expand the Metadata node and the directory in which the database is saved. Then drag this database into the design workspace.

    The dialog box below appears.

  2. Select tMysqlInput and then click OK to close the dialog box.

    The component is then displayed in the workspace.

  3. Drag the following components from the Palette into the design workspace: tMap and tUniservRTMailBulk.

  4. Connect tMap with tUniservRTMailBulk first.

    Accept the schema from tUniservRTMailBulk by clicking Yes on the prompt window.

  5. Connect the other components via Row > Main.

  6. Double-click tMap_1 to open the schema mapping window. On the left is the schema of the database file and on the right is the schema of tUniservRTMailBulk. At the bottom is displayed the Schema Editor, where you can find the attributes of the individual columns and edit them.

  7. Assign the columns of the input source to the respective columns of tUniservRTMailBulk. For this purpose, select a column of the input source and drag it onto the appropriate column on the right side. The meaning of the individual arguments is described in the Uniserv user manual mailRetrieval.

  8. Click OK to close the window.

  9. Double-click tUniservRTMailBulk to open its Basic Settings view.

  10. Fill in the server information and specify the service.

  11. Select Advanced Settings to adapt the server parameters.