Configuring High Availability
This topic describes how to configure for Dremio high availability.
To configure for Dremio HA, ensure the prerequisites are met and then perform the following:
- Configure Dremio including Dremio services, external metadata storage, and external zookeeper quorum.
- Start up the Dremio coordinator nodes.
- Start up the Dremio executor nodes.
Prerequisites
- Network drive (NFS) with locking support for Dremio's metadata store.
- Ensure that the store is high-speed, low latency (for spilling operations purposes).
- Ensure that all Dremio coordinator nodes have read/write access to the shared network drive.
- Ensure that the guidelines of the shared network drive are followed for consistent synchronous writes.
- External Zookeeper
- (Optional) nginx for Linux Web Application HA & Load Balancing
Step 1: Configure Dremio
To configure Dremio, modify the dremio.conf file on all of the coordinator and executor nodes in the Dremio cluster.
The following must be configured:
- Dremio services, with
- Two (2) or more coordinator nodes with the master-coordinator role.
- One (1) or more nodes with the executor role (preferably 3 or more).
- External Metadata store - A network drive (NFS) with locking support specified with the
paths.local
property. This property must be set on all Dremio coordinator nodes. - External Zookeeper(s) - One or more external Zookeeper quorum specified with the
zookeeper
property. This property must be set on all Dremio nodes.