Creating Dataflows by Using the Dremio Software Connector
Follow these steps if you do not want to enable SSO for reports that will be based on datasets that are created from your dataflows.
- In a workspace in Power BI Service, click New > Dataflow. NOTE: Ensure that you are not using "My workspace". You can cannot create dataflows in your personal workspace.
- Under Define new tables, click Add new tables.
- On the Choose data source page, search on "Dremio" and select Dremio Software.
- Under Connection Settings, follow these steps:
- In the Server field, specify a hostname or IP address of a Dremio Software cluster. If your cluster uses a port other than 31010 for connections from client applications, specify the non-default port number, like this:
<hostname-or-IP-address>:<port-number>
. - In the Encryption field, select Enabled if your cluster has TLS enabled or Disabled if your cluster does not have TLS enabled.
- (Optional) In the Engine field, specify the name of the engine on which to run queries that use this connection. If you do not specify an engine, one is selected automatically according to engine routing rules set up for the cluster.
- (Optional) In the Tag field, specify a tag that you can use together with rules for routing queries. See Workload Management for more information.
- (Optional) In the Queue field, specify the name of a queue that you want to route queries to while this connection is open. See Workload Management for more information.
- In the Server field, specify a hostname or IP address of a Dremio Software cluster. If your cluster uses a port other than 31010 for connections from client applications, specify the non-default port number, like this:
- Under Connection credentials, follow these steps:
- In the Connection field, follow either of these steps:
- If you are creating the first connection in this workspace to your cluster, select Create new connection.
- If you or someone else has created a connection to your cluster already, select the connection that you want the dataflow to use or select Create new connection. If you select an existing connection, skip to step 6.
- In the Data gateway field, follow either of these steps:
- If you want the dataflow to connect to your cluster through a Power BI gateway, select the gateway that is already configured to support this.
noteYou must connect to your Dremio cluster through a gateway if the cluster is behind a firewall.
- If you want the dataflow to connect to your cluster directly, select (none).
- In the Authentication kind field, follow either of these steps:
- Select Username/Password, and then specify the username and password to use for authenticating to your cluster.
- Select Microsoft Account and then click Sign in.
- In the Connection field, follow either of these steps:
- Click Next in the bottom-right corner.
- Select the data from your cluster that you want to use for the dataflow, and then click Transform data.
- Apply any necessary transformations to the data, and then click Save & close.
- Name the dataflow and save it, and optionally set a refresh schedule for it.
- Return to the main page of the workspace in which you created the dataflow.
- Specify the credentials for datasets based on this dataflow to use when connecting to your cluster:
- Open the settings for the dataflow.
- Expand the Data source credentials section and click Edit credentials.
- In the configure dialog, follow either these steps:
- If you want reports to authenticate to your cluster by using a username and password, follow these steps:
- In the Authentication method field, select Basic.
- In the Username and Password fields, specify the username and password to use.
- Click Sign in.
- If you want reports to authenticate to your cluster by using a Microsoft account, follow these steps:
- Select OAuth2 in the Authentication method field.
- Click Sign in.
- If you want reports to authenticate to your cluster by using a username and password, follow these steps:
- Ensure that your workspace is shared with all of the people who you want to be able to consume the dataflow.