Skip to main content
Version: current [26.x]

Load Data

Dremio provides the following features to load data into Apache Iceberg tables:

  • Autoingest Data into Apache Iceberg - Use autoingest pipes to automatically ingest data into Iceberg tables.

  • Clustering - Cluster your Iceberg tables for a more intuitive data layout that enables both efficient reads and writes.

Process Unstructured Data

Dremio allows you to process and combine structured, semi-structured, and unstructured data. Examples of unstructured data include PDFs, images, and videos that are stored in object storage.

Dremio natively offers AI functions to enable you to extract and process unstructured data:

  • AI_GENERATE – Process unstructured data, primarily for complex data extraction requiring multiple fields from source files.
  • AI_CLASSIFY – Categorize documents or analyze sentiment as VARCHAR values, using a provided classification list.
  • AI_COMPLETE – Generate text or create summaries as VARCHAR values.

These functions are processed using Dremio's Query Engine and the AI model provider of your choice. For more information on how to configure your model provider, see Configure Model Providers.