Skip to main content

Object Storage

Object storage provides scalable, cost-effective storage for data lakes. Dremio connects directly to your Amazon S3 bucket or Azure Storage to query data in place without moving or copying it.

Amazon S3

Dremio supports Amazon S3, which stores data in Amazon S3 buckets on AWS. Use data source credentials or project data credentials to enable Dremio to access Amazon S3 using the IAM role that is associated with your Dremio project.

Azure Storage

Dremio supports Azure Storage, which stores data in Azure Blob Storage and Azure Data Lake Storage Gen2.

Table Formatting

Dremio can query data in object storage across multiple formats:

  • Apache Iceberg – Open table format designed for petabyte-scale analytics
  • Delta Lake – Open-source storage framework that brings reliability and performance to data lakes
  • Delimited files (CSV, TSV, etc.) - Text files with configurable delimiters
  • JSON - Structured and semi-structured JSON data
  • Parquet - Columnar format optimized for analytics
  • Excel (XLSX, XLS) - Spreadsheet files

For information about formatting files and folders as tables, see Table Formatting.