Skip to main content
Version: current [26.x Preview]

Automated Maintenance with Dremio Catalog

Dremio Catalog automates maintenance tasks for data within the catalog to optimize query performance and minimize storage costs. Dremio Catalog currently supports automation for the following Iceberg maintenance tasks:

Enabling Automatic Optimization and Table Cleanup

Automatic Optimization

To enable automated optimization for the Dremio Catalog:

  1. In the Sources section in the bottom-left corner of the Datasets page, right-click on your Dremio Catalog source and click Settings.
  2. In the Source Settings dialog, select Advanced Options.
  3. Toggle Enable auto optimization.
  4. Click Save.

Automatic Table Cleanup

To enable an automated vacuum for Dremio Catalog:

  1. In the Sources section in the bottom-left corner of the Datasets page, right-click on your Dremio Catalog source and click Settings.
  2. In the Source Settings dialog, select Advanced Options.
  3. Toggle Enable table clean up.
  4. Click Save.

Table-Level Configuration

To enable/disable automatic optimization and cleanup at the table level within Dremio Catalog:

  1. Locate the desired table in the Dremio Catalog.
  2. Right-click on the table name and click This is the icon that represents more actions. to open the table settings.
  3. In the Table Settings dialog, select Table Maintenance from the settings sidebar.
  4. Toggle the relevant settings—Enable automatic table maintenance and/or Enable table cleanup.

Customization

The following support keys are used to configure frequency and behavior for automatic maintenance operations:

  • dremio.optimization.auto.optimize.period.hours - controls how often automatic optimization should run. Defaults to 3 hours.
  • dremio.optimization.auto.vacuum.period.hours - controls how often table cleanup should run. Defaults to 24 hours.
  • dremio.optimization.auto.maintenance.rate_limit.batch_size - controls the maximim number of concurrent maintenance queries. Defaults to 10.