Google Cloud Platform - GKE
Learn the deployment architecture, requirements, recommendations, and necessary steps for installing Dremio in a Kubernetes cluster on Google Kubernetes Engine (GKE).
Requirements
- GKE version 1.12.7 or later
- Kubernetes cluster on GKE (see Seting up a GKE Cluster for instructions)
- Worker node instance type (minimum): e2-highmem-16 (16 core, 128 GiB memory)
Setting up a GKE Cluster
To set up a Kubernetes cluster on GKE, use the Google console or CLI. To do it via Google console:
- Sign in to the Google cloud console at https://console.cloud.google.com/.
- Go to GKE page https://console.cloud.google.com/kubernetes/ and click on CREATE.
- Choose the standard option and proceed to the next screen.
- In the Node Pool section, click on the default pool and set the number of nodes to 5.
- In the Nodes section, select the machine type
e2-highmem-16
(16 vCPU, 128 GB memory). - Configure additional options as necessary and create the cluster.
- Connect to the cluster. See Using kubectl to interact with GKE for more information.
- Install Helm. See Helm Install for more information.
Deploying Dremio on the GKE Cluster
To deploy Dremio on the GKE Cluster, follow the steps in Deploy Dremio on Kubernetes.
High Availability
High availability is dependent on the Kubernetes infrastructure. For more information, see High Availability in Cluster Deployments.
Load Balancing
Load balancing distributes the workload between Dremio's web (UI and REST) client and ODBC/JDBC clients. All web and ODBC/JDBC clients connect to a single endpoint (load balancer) rather than directly to individual pods. These connections are then distributed across available coordinator (master-coordinator and secondary-coordinator) pods.