Load Data
Dremio provides the following features to load data into Apache Iceberg tables:
-
Autoingest Data into Apache Iceberg - Use autoingest pipes to automatically ingest data into Iceberg tables.
-
Clustering - Cluster your Iceberg tables for a more intuitive data layout that enables both efficient reads and writes.
Process Unstructured Data
Dremio allows you to process and combine structured, semi-structured, and unstructured data. Examples of unstructured data include PDFs, images, and videos that are stored in object storage.
Dremio natively offers AI functions to enable you to extract and process unstructured data:
AI_GENERATE– Process unstructured data, primarily for complex data extraction requiring multiple fields from source files.AI_CLASSIFY– Categorize documents or analyze sentiment asVARCHARvalues, using a provided classification list.AI_COMPLETE– Generate text or create summaries asVARCHARvalues.
These functions are processed using Dremio's Query Engine and the AI model provider of your choice. For more information on how to configure your model provider, see Configure Model Providers.