Microsoft Azure Power BI Service (Repository) - Import - 7.1

Talend Data Catalog Bridges

author
Talend Documentation Team
EnrichVersion
7.1
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
EnrichPlatform
Talend Data Catalog

Bridge Requirements

This bridge:
  • requires Internet access to https://repo.maven.apache.org/maven2/ and/or other tool sites to download drivers into <TDC_HOME>/data/download/MIMB/. For more information on how to retrieve third-party drivers when the TDC server cannot access the Internet, see this article.

Bridge Specifications

Vendor Microsoft
Tool Name Azure Power BI
Tool Version 2.x
Tool Web Site https://powerbi.microsoft.com/
Supported Methodology [Business Intelligence] Multi-Model, Metadata Repository, Data Store (Physical Data Model, OLAP Dimensional Model, Stored Procedure Expression Parsing), BI Report (Relational Source, Dimensional Source, Expression Parsing, Report Structure) via Java REST API
Data Profiling
Multi-Model Harvesting
Remote Repository Browsing for Model Selection
Incremental Harvesting

SPECIFICATIONS
Tool: Microsoft / Azure Power BI version 2.x via Java REST API
See https://powerbi.microsoft.com/
Metadata: [Business Intelligence] Multi-Model, Metadata Repository, Data Store (Physical Data Model, OLAP Dimensional Model, Stored Procedure Expression Parsing), BI Report (Relational Source, Dimensional Source, Expression Parsing, Report Structure)
Component: MicrosoftAzurePowerBI version 11.0.0

DISCLAIMER
This bridge requires internet access to https://repo.maven.apache.org/maven2/ (and exceptionally a few other tool sites)
in order to download the necessary third party software libraries into $HOME/data/download/MIMB/
- If https fails, the bridge then tries with http.
- If a proxy is used to access internet, you must configure that proxy in the JRE (see the -j option in the Miscellaneous parameter).
- If the bridge does not have access to internet, that directory can be copied from another server with internet access.

By running this bridge, you hereby acknowledge responsibility for the license terms and any potential security vulnerabilities from these downloaded third party software libraries.

OVERVIEW
This bridge imports Business Intelligence (BI) reporting metadata from Microsoft Power BI service hosted on the Microsoft Azure cloud.

REQUIREMENTS

When connecting to the Power BI service hosted in Microsoft Azure cloud, the bridge uses Azure ActiveDirectory authentication.

The following Azure configuration steps are required:
- Connect to the Azure management console: https://manage.windowsazure.com
- Create or open the Azure Active Directory which corresponds to your organization.
- On the Applications page, create an application named 'MIMB' of type 'Native Client', and write down its Client ID (Application ID).
- Make sure to add permission to the Power BI Service application, and grant necessary permissions.

The bridge will use the Client ID (Application ID) information to connect to Azure, and obtain the authentication token.
For more details:
https://docs.microsoft.com/en-us/azure/active-directory/develop/active-directory-integrating-applications

Alternatively, it is possible to register the application with Azure Active Directory using this page:
https://dev.powerbi.com/apps
Note that it may be necessary to grant consent for the specified permissions using the Azure management console.

There are two ways to authenticate against Azure ActiveDirectory:
- As a regular user: the login user is usually in the form of an email address. E.g.
UserName@DirectoryName.onmicrosoft.com
- As a Service Principal: the user name parameter should be left empty, and the Tenant ID must be provided.

FREQUENTLY ASKED QUESTIONS

LIMITATIONS
Refer to the current general known limitations at http://metaintegration.com/Products/MIMB/MIMBKnownLimitations.html or bundled in Documentation/ReadMe/MIMBKnownLimitations.html
PowerBI supports 6 types of datasets: 'Push', 'Streaming', 'PushStreaming', 'AzureAS', 'AsOnPrem' and dataset from uploaded PBIX files. The structure of datasets (in term of tables and columns) is exposed only for 'Push' and 'PushStreaming' dataset types, due to Api limitations.

The PowerBI PBIX file format currently has no public Sdk or Api to allow inspecting the structure of the report, the tables it contains, and detailed lineage to sources. The PowerBI Rest Api only allow retrieving the name(s) of data connection(s) used by the report. Lineage metadata is only available at the model level or connection level, not at the table or column level.

SUPPORT
Provide a trouble shooting package with:
- the debug log (can be set in the UI or in conf/conf.properties with MIR_LOG_LEVEL=6)
- the metadata backup if available (can be set in the Miscellaneous parameter with option -backup)


Bridge Parameters

Parameter Name Description Type Values Default Scope
Login User The username which the bridge will use to log in.
This user name must have the necessary permissions to access the objects you wish to import.

In case of Microsoft Azure PowerBI, there are two ways to authenticate against Azure ActiveDirectory:
- As a regular user: the login user is usually in the form of an email address. E.g.
UserName@DirectoryName.onmicrosoft.com
- As a Service Principal: the user name parameter should be left empty, and the Tenant ID must be provided.
STRING   Administrator  
Login Password Enter the password associated with the username which the bridge will use to log in.

In case of Microsoft Azure PowerBI, there are two ways to authenticate against Azure ActiveDirectory:
- As a regular user: the login user is usually in the form of an email address. E.g.
UserName@DirectoryName.onmicrosoft.com
- As a Service Principal: the password parameter corresponds to the Client Secret.
PASSWORD      
Client ID Enter the Microsoft Azure Client ID for the application. STRING     Mandatory
Tenant ID Enter the Microsoft Azure ActiveDirectory Tenant ID.

In case of Microsoft Azure PowerBI, there are two ways to authenticate against Azure ActiveDirectory:
- As a regular user: the login user is usually in the form of an email address. E.g.
UserName@DirectoryName.onmicrosoft.com
- As a Service Principal: the user name parameter should be left empty, and the Tenant ID must be provided.
STRING      
Offline metadata directory In order to facilitate testing and reproducing the PowerBI metadata environment, when that environment is not installed locally, this parameter allows importing metadata from files previously downloaded from the PowerBI server. Specify in this parameter the directory path where the downloaded files are located. No connection to the PowerBI server is needed in this case, the usual connection parameters are ignored. DIRECTORY      
Miscellaneous Specify miscellaneous options identified with a -option followed by a value if required:

GENERAL OPTIONS
-m <Java Memory's maximum size>
1G by default on 64bits JRE or as set in conf/conf.properties, e.g.
-m 8G
-m 2500M

-j <Java Runtime Environment command line options>
This option must be the last one in the Miscellaneous parameter as all the text after -j is passed "as is" to the JRE, e.g.
-j -Dname=value -Xms1G
The following option must be set when a proxy is used to access internet (this is critical to access https://repo.maven.apache.org/maven2/ (and exceptionally a few other tool sites) in order to download the necessary third party software libraries.
-j -Dhttp.proxyHost=127.0.0.1 -Dhttp.proxyPort=3128 -Dhttps.proxyHost=127.0.0.1 -Dhttps.proxyPort=3128 -Dhttp.proxyUser=user -Dhttp.proxyPassword=pass -Dhttps.proxyUser=user -Dhttps.proxyPassword=pass

-jre <Java Runtime Environment full path name>
It can be an absolute path to javaw.exe on Windows or a link/script path on Linux, e.g.
-jre "c:\Program Files\Java\jre1.8.0_211\bin\javaw.exe"

-v <Environment variable value>
None by default, e.g.
-v var1=value1 -v var2="value2 with spaces"

-model.name <model name>
Override the model name, e.g.
-model.name "My Model Name"

-prescript <script name>
The script must be located in the bin directory, and have .bat or .sh extension.
The script path must not include any parent directory symbol (..).
The script should return exit code 0 to indicate success, or another value to indicate failure.
For example:
-prescript \"script.bat\"

-cache.clear
Clears the cache before the import, and therefore will run a full import without incremental harvesting.
Warning: this is a system option managed by the application calling the bridge and should not be set by users.

-backup <directory>
Full path of an empty directory to save the metadata input files for further troubleshooting.

MICROSOFT POWERBI OPTIONS
-c <list of Connection database types>
Sets a data store database type to any generic data connections (e.g. based on ODBC, OLEDB), in order to recognize the SQL syntax of that database to calculate lineage.
The list of connection database types must be specified as semi-column (;) separated value pairs: <Connection name>=<Database type> e.g.
-c Connection1=ORACLE;Connection2=TERADATA;
The list of database type supporting SQL parsing is:
ACCESS
DB2/UDB
HIVE
MICROSOFT SQL SERVER
MYSQL
NETEZZA
ORACLE
POSTGRESQL
SYBASE
TERADATA

-columns.notpropagated
Do not propagate the columns discovered while parsing PowerQuery M script steps back to the source tables/files.
STRING      

 

Bridge Mapping

Mapping information is not available