OpenStack Swift Object Store - Technical Preview - Import - 7.1

Talend Data Catalog Bridges

author
Talend Documentation Team
EnrichVersion
7.1
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
EnrichPlatform
Talend Data Catalog

Bridge Specifications

Vendor OpenStack
Tool Name Swift Object Store
Tool Version 1.0
Tool Web Site https://www.openstack.org/software/releases/queens/components/swift
Supported Methodology [File System] Multi-Model, Data Store (NoSQL / Hierarchical) via REST API
Incremental Harvesting
Multi-Model Harvesting
Remote Repository Browsing for Model Selection
Data Profiling

BRIDGE INFORMATION
Import tool: OpenStack Swift Object Store 1.0 (https://www.openstack.org/software/releases/queens/components/swift)
Import interface: [File System] Multi-Model, Data Store (NoSQL / Hierarchical) via REST API from OpenStack Swift Object Store - New Beta Bridge
Import bridge: 'OpenStackSwift' 10.1.0

BRIDGE DOCUMENTATION
IMPORTING FROM OpenStackSwift Service.

This bridge supports the following file formats:
- Flat File (CSV)
- Open Office Excel (XSLX)
- COBOL Copybook
- JSON (JavaScript Object Notation)
- Apache Avro
- Apache Parquet
- Apache ORC
- W3C XML

as well as the compressed versions of the above formats:
- ZIP (as a compression format, not as archive format)
- BZIP
- GZIP
- LZ4
- Snappy (as standard Snappy format, not as Hadoop native Snappy format)

Please refer to the individual parameter's tool tips for more detailed examples.


Bridge Parameters

Parameter Name Description Type Values Default Scope
REST Endpoint: Your REST Endpoint to sign programmatic requests to the service. STRING      
Auth V1 Endpoint: Your Auth V1 Endpoint to authenticate the bridge. STRING      
User User name STRING      
Password Password PASSWORD      
Container CONTAINER name STRING      
Miscellaneous Specify miscellaneous options identified with a -letter and value.

For example, -m 4G -f 100 -j -Dname=value -Xms1G

-m the maximum Java memory size whole number (e.g. -m 4G or -m 2500M ).
-v set environment variable(s) (e.g. -v var1=value -v var2="value with spaces").
-j the last option that is followed by Java command line options (e.g. -j -Dname=value -Xms1G).
-hadoop key1=val1;key2=val2 to manualy set hadoop configuration options
-tps 10 maximum threads pool size
-tl 3600s processing time limit in s -seconds m - minutes or h hours;
-fl 1000 processing files count limit;
-delimited.top_rows_skip 1 number of rows to skip while processing csv files
-delimited.extra_separators ~,||,|~ comma separated extra delimiters each of which will be used while processing csv files
-delimited.no_header by default, bridge automatically tries to detect headers while processing csv files(basing on header columns types), use this option to disable headers import(f.e. to hide sensitive data)
-fresh.partition.models - use to import latest modified files when processing partitions defined in Partitioned directories parameter
-subst K: C:/test - use to associate a root path part with a drive or another path.
-skip.download - use to disable dependencies downloading and use only download cache
-prescript [cmd] - runs a script command before bridge execution. Example: -prescript \"script.bat\"
The script must be located in the bin directory, and have .bat or .sh extension.
The script path must not include any parent directory symbol (..)
The script should return exit code 0 to indicate success, or another value to indicate failure.
STRING      

 

Bridge Mapping

Meta Integration Repository (MIR)
Metamodel
(based on the OMG CWM standard)
"OpenStack Swift Object Store - New Beta Bridge"
Metamodel
File System (File)
Mapping Comments
     
Attribute Array Elementary Item, Field, Attribute, Array Field, Elementary Item, Partition Field  
Name Name  
Position Position  
Class Array Element, Group Item, Array Group Item, Array Object, Element, Object, Sheet  
Name Name  
PropertyElementTypeScope UDPs  
Name Name  
Scope Scope  
PropertyType UDP  
DataType Data Type  
DesignLevel Design Level  
Name Name  
Position Position  
StoreModel Cobol File, Parquet File, Delimited File, Avro File, Positional File, Json File, Collection, Orc File, Xml File, Excel File, File  
Name Name