These properties are used to configure tSnowflakeBulkExec running in the Standard Job framework.
The Standard tSnowflakeBulkExec component belongs to the Cloud family.
The component in this framework is available in all subscription-based Talend products.
Basic settings
Database |
Select a type of database from the list and click Apply. |
Property Type |
Select the way the connection details will be set.
This property is available when Use this Component is selected from the Connection Component drop-down list. |
Connection Component |
Select the component that opens the database connection to be reused by this component. |
Account |
In the Account field, enter, in double quotation marks, the account name that has been assigned to you by Snowflake. This field is available only when Use this Component is selected from the Connection Component drop-down list. |
Snowflake Region |
Select an AWS region or an Azure region from the Snowflake Region drop-down list. This field is available only when Use this Component is selected from the Connection Component drop-down list and the Use Custom Snowflake Region option is not selected in the Advanced settings view. |
User Id and Password |
Enter, in double quotation marks, your authentication information to log in Snowflake.
This field is available only when Use this Component is selected from the Connection Component drop-down list. |
Warehouse |
Enter, in double quotation marks, the name of the Snowflake warehouse to be used. This name is case-sensitive and is normally upper case in Snowflake. This field is available only when Use this Component is selected from the Connection Component drop-down list. |
Schema |
Enter, within double quotation marks, the name of the database schema to be used. This name is case-sensitive and is normally upper case in Snowflake. This field is available only when Use this Component is selected from the Connection Component drop-down list. |
Database |
Enter, in double quotation marks, the name of the Snowflake database to be used. This name is case-sensitive and is normally upper case in Snowflake. This field is available only when Use this Component is selected from the Connection Component drop-down list. |
Table |
Click the [...] button and in the displayed wizard, select the Snowflake table to be used. To load the data into a new table, select Use custom object in the wizard and enter the name of the new table in Object Name field. |
Schema and Edit Schema |
A schema is a row description. It defines the number of fields (columns) to be processed and passed on to the next component. When you create a Spark Job, avoid the reserved word line when naming the fields. Built-In: You create and store the schema locally for this component only. Repository: You have already created the schema and stored it in the Repository. You can reuse it in various projects and Job designs. If the Snowflake data type to be handled is VARIANT, OBJECT or ARRAY, while defining the schema in the component, select String for the corresponding data in the Type column of the schema editor wizard. Click Edit schema to make changes to the schema. If the current schema is of the Repository type, three options are available:
Note that if the input value of any non-nullable primitive field is null, the row of data including that field will be rejected. This component offers the advantage of the dynamic schema feature. This allows you to retrieve unknown columns from source files or to copy batches of columns from a source without mapping each column individually. For further information about dynamic schemas, see Talend Studio User Guide. This dynamic schema feature is designed for the purpose of retrieving unknown columns of a table and is recommended to be used for this purpose only; it is not recommended for the use of creating tables. |
Table Action |
Select the action to be carried out to the table.
|
Output Action |
Select the operation you want to perform to the incoming data and data records in the Snowflake database table. You can insert, delete, update or merge data in the Snowflake table. This option assumes that the Snowflake table specified in Table field already exists.
|
Storage | Select the type of storage from which the data will be
loaded to the table.
|
Stage Folder | Specify the Snowflake stage folder to load data from. This field is available when you select Internal from the Storage drop-down list in the Basic settings view. |
Region | Specify the region where the S3 bucket locates. This field is available when you select S3 from the Storage drop-down list in the Basic settings view. |
Access Key and Secret Key | Enter the authentication information required to connect to
the Amazon S3 bucket to be used. To enter the password, click the [...] button next to the password field, and then in the pop-up dialog box enter the password between double quotes and click OK to save the settings. This field is available when you select S3 from the Storage drop-down list in the Basic settings view. |
Bucket | Enter the name of the bucket to be used to load data. This
bucket must already exist. This field is available when you select S3 from the Storage drop-down list in the Basic settings view. |
Folder | Enter the folder (in double quotation marks) from which you
want to load data. This field is available when S3 or Azure is selected from the Storage drop-down list. |
Protocol | Select the protocol used to create Azure connection. This field is available when you select Azure from the Storage drop-down list in the Basic settings view. |
Account Name | Enter the name (in double quotation marks) of the Azure
storage account you need to access. This field is available when you select Azure from the Storage drop-down list in the Basic settings view. |
Container | Specify the Azure container (in double quotation marks) used for storing and managing
data. This field is available when you select Azure from the Storage drop-down list in the Basic settings view. |
SAS Token | Specify the SAS token to grant limited access to objects in
your storage account. To enter the SAS token, click the [...] button next to the SAS token field, and then in the pop-up dialog box enter the password between double quotes and click OK to save the settings. This field is available when you select Azure from the Storage drop-down list in the Basic settings view. |
Advanced settings
Additional JDBC Parameters |
Specify additional connection properties for the database connection you are creating. The properties are separated by semicolon and each property is a key-value pair, for example, encryption=1;clientname=Talend. This field is available only when you select Use this Component from the Connection Component drop-down list and select Internal from the Storage drop-down list in the Basic settings view. |
Use Custom Snowflake Region |
Select this check box to specify a custom
Snowflake region. This option is available only when you select Use this Component from the Connection Component drop-down list in the
Basic settings view.
For more information on Snowflake Region ID, see Supported Regions. |
Login Timeout |
Specify the timeout period (in minutes) of Snowflake login attempts. An error will be generated if no response is received in this period. |
Role |
Enter, in double quotation marks, the default access control role to use to initiate the Snowflake session. This role must already exist and has been granted to the user ID you are using to connect to Snowflake. If this field is left empty, the PUBLIC role is automatically granted. For information about Snowflake access control model, see Understanding the Access Control Model. |
Allow Snowflake to convert columns and tables to uppercase |
Select this check box to convert lowercase in the defined table name and schema column names to uppercase. Note that unquoted identifiers should match the Snowflake Identifier Syntax. If you deselect the check box, all identifiers are automatically quoted. This property is not available when you select the Manual Query check box. For more information on the Snowflake Identifier Syntax, see Identifier Syntax. |
Temporary Table Schema | Specifies a schema for the temporary table. The schema must exist. |
Custom DB Type | Select this check box to specify the DB type for each
column in the schema. This property is available only when you select an action with Create Table from the Table Action drop down list in the Basic settings. |
Delete Storage Files On Success | Delete all the files in your storage folder once the
Job is running successfully. This field is not available when you select Use Custom Storage Location. |
S3 Max Error Retry |
Specify the maximum data loading retries when an error occurs during loading data to or from the S3 folder. This parameter defaults to 3. A value of -1 specifies the maximum possible retries. Only -1 or positive integers are accepted. This field is available when you select S3 from the Storage drop-down list in the Basic settings view. |
Azure Max Error Retry |
Specify the maximum data loading retries when an error occurs during loading data to or from the Azure folder. This parameter defaults to 3. A value of -1 specifies the maximum possible retries. Only -1 or positive integers are accepted. This field is available when you select Azure from the Storage drop-down list in the Basic settings view. |
Use Custom S3 Connection Configuration | Select this check box if you wish to use your custom
S3 configuration. Option: select the parameter from the list. Value: enter the parameter value. This field is available when you select S3 from the Storage drop-down list in the Basic settings view. |
Use Custom Stage Prefix |
Select this check box to specify the path to the folder (with the current stage as the root) from which the data is loaded. You need also to enter the path to the folder in the field provided. For example, to load data stored in the files that are located in myfolder1/myfolder2 under the stage, you need to type "@~/myfolder1/myfolder2" in the field. This field is available when you select Internal from the Storage drop-down list in the Basic settings view. Once selected, the Stage Folder in Basic settings view becomes unavailable. |
Use Custom Storage Location | Select this check box to connect to a custom external storage, for example, S3. |
Copy Command Options | Set parameters for the COPY INTO command by selecting
the following options from the drop-down list. The COPY INTO command is
provided by Snowflake. It loads data to a Snowflake database table.
|
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at the Job level as well as at each component level. |
Global Variables
NB_LINE |
The number of rows processed. This is an After variable and it returns an integer. |
NB_SUCCESS |
The number of rows successfully processed. This is an After variable and it returns an integer. |
NB_REJECT |
The number of rows rejected. This is an After variable and it returns an integer. |
ERROR_MESSAGE |
The error message generated by the component when an error occurs. This is an After variable and it returns a string. |
Usage
Usage rule |
This component can be used as a standalone component in a Job or a subJob. |