Getting Started with Kubernetes
You can follow these instructions to install Dremio in a Kubernetes cluster running in an on-premises environment.
If you are upgrading from the previous Helm chart for Dremio, please see the Migrating Helm Chart Versions documentation.
Prerequisites
- Ensure that you have an existing Kubernetes cluster.
- Ensure that Helm 3 is set up on a local machine.
- Ensure that a local kubectl is configured to access your Kubernetes cluster.
- Ensure that you have cloud storage available (required for 21.0.0+), see here for more details.
Procedure
Download the
dremio-cloud-tools
repository.In a terminal window, change to the
dremio-cloud-tools/charts/dremio_v2/
directory.Review the default values in the file
values.yaml
, which configures the Dremio installation. If you want to override any of these values, create a file with the.yaml
extension in this directory, copy into this file the keys for which you want to set non-default values, and then set the values in the file. Making changes in this file allows you to quickly update to the latest version of the chart by copying the file across Helm chart updates. Refer tovalues.yaml
Reference for details about the settings.Review Important Setup Considerations and make any of the listed changes to the values in your
values.local.yaml
file that you think are necessary for your environment.Install the Helm Chart by running one of these commands from the
charts
directory:If you are overriding any of the default values that are in the
values.yaml
file, run this command:$ helm install <release-name> dremio_v2 -f <file>
where
<file>
is the name of the file that you are using to override values.If you are not overriding any of the values in the
values.yaml
file, run this command:$ helm install <release-name> dremio_v2
If the installation takes longer than a few minutes to complete, you can check the status of the installation by using the following command:
$ kubectl get pods
If a pod remains in Pending state for more than a few minutes, run the following command to view its status to check for issues, such as insufficient resources for scheduling:
$ kubectl describe pods <pod-name>
If the events at the bottom of the output mention insufficient CPU or memory, either adjust the values in your
values.local.yaml
and restart the process or add more resources to your Kubernetes cluster.When all of the pods are in the Ready state, the installation is complete.
Next Steps
Now that you've installed the Dremio Helm chart, you can get the HTTP addresses for connecting to Dremio's UI, connecting to Dremio from BI tools via JDBC/ODBC, and for connecting to Dremio from BI tools via Apache Arrow Flight.
Getting the HTTP address for connecting to the Dremio UI
Run the following command to use the service dremio-client
in Kubernetes to find the host for the Dremio UI:
$ kubectl get services dremio-client
If the value in the
TYPE
column of the output isLoadBalancer
, access the Dremio UI through the address in theEXTERNAL_IP
column and port 9047. For example, in the output below, the value under theEXTERNAL-IP
column is 8.8.8.8. Therefore, you can get to the Dremio UI via port 9047 on that address: http://8.8.8.8:9047$ kubectl get services dremio-client
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
dremio-client LoadBalancer 10.99.227.180 8.8.8.8 31010:32260/TCP,9047:30620/TCP 2dIf you want to change the expose port on the load balancer, change the value of the setting
coordinator.web.port
in the filevalues.local.yaml
.If the value in the TYPE column of the output is
NodePort
, access the Dremio UI through http://localhost:30670.
Getting the HTTP address for using ODBC or JDBC to connect from BI tools to Dremio
Run the following command to use the service dremio-client
in Kubernetes to find the host for JDBC/ODBC connections by using the following command:
$ kubectl get services dremio-client
If the value in the TYPE column of the output is
LoadBalancer
, access Dremio using JDBC/ODBC through the address in theEXTERNAL_IP
column and port 31010. For example, in the output below, the value under theEXTERNAL-IP
column is 8.8.8.8. Therefore, you can get to the Dremio UI via port 9047 on that address: http://8.8.8.8:9047$ kubectl get services dremio-client
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
dremio-client LoadBalancer 10.99.227.180 8.8.8.8 31010:32260/TCP,9047:30620/TCP 2dIf you want to change the expose port on the load balancer, change the value of the setting
coordinator.client.port
in the filevalues.local.yaml
.If the value in the TYPE column of the output is
NodePort
, access Dremio using JDBC/ODBC through http://localhost:32390.
Getting the HTTP address for using Apache Arrow Flight to connect from BI tools to Dremio
Run the following command to use the service dremio-client
in Kubernetes to find the host for Apache Arrow Flight connections by using the following command:
$ kubectl get services dremio-client
If the value in the TYPE column of the output is
LoadBalancer
, access Dremio using Flight through the address in theEXTERNAL_IP
column and port 32010.For example, in the output below, the value under the
EXTERNAL-IP
column is 8.8.8.8. Therefore, you can get to the Dremio UI via port 9047 on that address: http://8.8.8.8:9047$ kubectl get services dremio-client
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
dremio-client LoadBalancer 10.99.227.180 8.8.8.8 31010:32260/TCP,9047:30620/TCP 2dIf you want to change the expose port on the load balancer, change the value of the setting
coordinator.flight.port
in the filevalues.local.yaml
.If the value in the TYPE column of the output is
NodePort
, access Dremio using Flight through http://localhost:31357.
Upgrading Zookeeper
To upgrade to the official Zookeeper image set the following tags in values.local.yaml
file.
zookeeper:
image: zookeeper
imageTag: 3.8.0