Skip to main content

Object Storage

Object storage provides scalable, cost-effective storage for data lakes. Dremio connects directly to your Amazon S3 bucket or Azure Storage to query data in place without moving or copying it.

Amazon S3

Dremio connects to Amazon S3, which stores data in Amazon S3 buckets on AWS. Use an AWS role to enable Dremio to access Amazon S3.

Azure Storage

Dremio connects to Azure Storage, which stores data in Azure Blob Storage and Azure Data Lake Storage Gen2.

Format Tables

Dremio can query data in object storage across multiple formats:

  • Apache Iceberg – Open table format designed for petabyte-scale analytics
  • Delta Lake – Open-source storage framework that brings reliability and performance to data lakes
  • Delimited files (CSV, TSV, etc.) - Text files with configurable delimiters
  • JSON - Structured and semi-structured JSON data
  • Parquet - Columnar format optimized for analytics
  • Excel (XLSX, XLS) - Spreadsheet files

For information about formatting files and folders as tables, see Table Formatting.