Skip to main content
Version: 24.3.x

Creating Dataflows by Using the Dremio Cloud Connector

Follow these steps if you want to enable SSO for reports that will be based on datasets that are created from your dataflows.

Ensure that the Dremio Software cluster that you connect to by using the Dremio Cloud connector meets these requirements:

  • TLS must be enabled.
  • Azure Active Directory (Azure AD) authentication must be enabled.
  1. In a workspace in Power BI Service, click New > Dataflow.

    note

    Ensure that you are not using "My workspace". You cannot create dataflows in your personal workspace.

  2. Under Define new tables, click Add new tables.

  3. On the Choose data source page, search on "Dremio" and select Dremio Cloud.

  4. Under Connection Settings, follow these steps:

    1. In the Server field, specify a hostname or IP address of a Dremio Software cluster that has TLS enabled. If your cluster uses a port other than 443 for connections from client applications, specify the non-default port number, like this: <hostname-or-IP-address>:<port-number>.
    2. Leave the Project field empty.
    3. (Optional) In the Engine field, specify the name of the engine on which to run queries that use this connection. If you do not specify an engine, one is selected automatically according to engine routing rules set up for the cluster.
    4. (Optional) In the Tag field, specify a tag that you can use together with rules for routing queries. See Workload Management for more information.
    5. (Optional) In the Queue field, specify the name of a queue that you want to route queries to while this connection is open. See Workload Management for more information.
  5. Under Connection credentials, follow these steps:

    1. In the Connection field, follow either of these steps:
      • If you are creating the first connection in this workspace to your cluster, select Create new connection.
      • If you or someone else has created a connection to your cluster already, select the connection that you want the dataflow to use or select Create new connection. If you select an existing connection, skip to step 6.
    2. In the Data gateway field, follow either of these steps:
      • If you want the dataflow to connect to your Dremio cluster through a Power BI gateway, select the gateway that is already configured to support this.
      note

      You must connect to your Dremio cluster through a gateway if the cluster is behind a firewall.

      • If you want the dataflow to connect to your cluster directly, select (none).
    3. In the Authentication kind field, select Microsoft Account and then click Sign in.
  6. Click Next in the bottom-right corner.

  7. Select the data from your cluster that you want to use for the dataflow, and then click Transform data.

  8. Apply any necessary transformations to the data, and then click Save & close.

  9. Name the dataflow and save it, and optionally set a refresh schedule for it.

  10. Return to the main page of the workspace in which you created the dataflow.

  11. Specify the credentials for datasets based on this dataflow to use when connecting to your cluster:

    1. Open the settings for the dataflow.
    2. Expand the Data source credentials section and click Edit credentials.
    3. In the configure dialog, follow these steps:
      1. In the Authentication method field, select OAuth2.
      2. If the dataflow will be used for the creation of reports that, when published, will allow single sign-on (SSO), select the check box Report viewers can only access this data source with their own Power BI identities using DirectQuery.
      3. Click Sign in.
  12. Ensure that your workspace is shared with all of the people who you want to be able to consume the dataflow.