Skip to main content
Version: 24.3.x

Getting Started with Kubernetes

You can follow these instructions to install Dremio in a Kubernetes cluster running in an on-premises environment.

If you are upgrading from the previous Helm chart for Dremio, please see the Migrating Helm Chart Versions documentation.

Prerequisites

  • Ensure that you have an existing Kubernetes cluster.
  • Ensure that Helm 3 is set up on a local machine.
  • Ensure that a local kubectl is configured to access your Kubernetes cluster.
  • Ensure that you have cloud storage available (required for 21.0.0+), see here for more details.

Procedure

  1. Download the dremio-cloud-tools repository.

  2. In a terminal window, change to the dremio-cloud-tools/charts/dremio_v2/ directory.

  3. Review the default values in the file values.yaml, which configures the Dremio installation. If you want to override any of these values, create a file with the .yaml extension in this directory, copy into this file the keys for which you want to set non-default values, and then set the values in the file. Making changes in this file allows you to quickly update to the latest version of the chart by copying the file across Helm chart updates. Refer to values.yaml Reference for details about the settings.

  4. Review Important Setup Considerations and make any of the listed changes to the values in your values.local.yaml file that you think are necessary for your environment.

  5. Install the Helm chart by running one of these commands from the charts directory:

    • If you are overriding any of the default values that are in the values.yaml file, run this command:

      $ helm install <release-name> dremio_v2 -f <file>

      where <file> is the name of the file that you are using to override values.

    • If you are not overriding any of the values in the values.yaml file, run this command:

      $ helm install <release-name> dremio_v2

    If the installation takes longer than a few minutes to complete, you can check the status of the installation by using the following command:

    $ kubectl get pods

    If a pod remains in Pending state for more than a few minutes, run the following command to view its status to check for issues, such as insufficient resources for scheduling:

    $ kubectl describe pods <pod-name>

    If the events at the bottom of the output mention insufficient CPU or memory, either adjust the values in your values.local.yaml and restart the process or add more resources to your Kubernetes cluster.

    When all of the pods are in the Ready state, the installation is complete.

Next Steps

Now that you've installed the Dremio Helm chart, you can get the HTTP addresses for connecting to Dremio's UI, connecting to Dremio from BI tools via JDBC/ODBC, and for connecting to Dremio from BI tools via Apache Arrow Flight.

Getting the HTTP address for connecting to the Dremio UI

Run the following command to use the service dremio-client in Kubernetes to find the host for the Dremio UI:

$ kubectl get services dremio-client
  • If the value in the TYPE column of the output is LoadBalancer, access the Dremio UI through the address in the EXTERNAL_IP column and port 9047. For example, in the output below, the value under the EXTERNAL-IP column is 8.8.8.8. Therefore, you can get to the Dremio UI via port 9047 on that address: http://8.8.8.8:9047

    $ kubectl get services dremio-client
    NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
    dremio-client LoadBalancer 10.99.227.180 8.8.8.8 31010:32260/TCP,9047:30620/TCP 2d

    If you want to change the expose port on the load balancer, change the value of the setting coordinator.web.port in the file values.local.yaml.

  • If the value in the TYPE column of the output is NodePort, access the Dremio UI through http://localhost:30670.

Getting the HTTP address for using ODBC or JDBC to connect from BI tools to Dremio

Run the following command to use the service dremio-client in Kubernetes to find the host for JDBC/ODBC connections by using the following command:

$ kubectl get services dremio-client
  • If the value in the TYPE column of the output is LoadBalancer, access Dremio using JDBC/ODBC through the address in the EXTERNAL_IP column and port 31010. For example, in the output below, the value under the EXTERNAL-IP column is 8.8.8.8. Therefore, you can get to the Dremio UI via port 9047 on that address: http://8.8.8.8:9047

    $ kubectl get services dremio-client
    NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
    dremio-client LoadBalancer 10.99.227.180 8.8.8.8 31010:32260/TCP,9047:30620/TCP 2d

    If you want to change the expose port on the load balancer, change the value of the setting coordinator.client.port in the file values.local.yaml.

  • If the value in the TYPE column of the output is NodePort, access Dremio using JDBC/ODBC through http://localhost:32390.

Getting the HTTP address for using Apache Arrow Flight to connect from BI tools to Dremio

Run the following command to use the service dremio-client in Kubernetes to find the host for Apache Arrow Flight connections by using the following command:

$ kubectl get services dremio-client
  • If the value in the TYPE column of the output is LoadBalancer, access Dremio using Flight through the address in the EXTERNAL_IP column and port 32010.

    For example, in the output below, the value under the EXTERNAL-IP column is 8.8.8.8. Therefore, you can get to the Dremio UI via port 9047 on that address: http://8.8.8.8:9047

    $ kubectl get services dremio-client
    NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
    dremio-client LoadBalancer 10.99.227.180 8.8.8.8 31010:32260/TCP,9047:30620/TCP 2d

    If you want to change the expose port on the load balancer, change the value of the setting coordinator.flight.port in the file values.local.yaml.

  • If the value in the TYPE column of the output is NodePort, access Dremio using Flight through http://localhost:31357.

Upgrading Zookeeper

To upgrade to the official Zookeeper image set the following tags in values.local.yaml file.

zookeeper:
image: zookeeper
imageTag: 3.8.0