Skip to main content

Generating Labels and Wikis Preview

Try out Dremio's Generative AI for managing data! This tutorial guides you through the steps on how to generate labels and wikis for your datasets, so you can minimize the need for manual profiling and cataloging.

Prerequisites

Before you begin, sign up for Dremio Cloud and create your first project.

After you’ve signed up and created your first project, you first need to enable Generative AI, then you can generate labels and generate a wiki!

Step 1. Enable Generative AI

Opting into and enabling Generative AI provides you access to the Text-to-SQL experience, generated labels and wikis, and other Generative AI features as they become available.

To enable Generative AI:

  1. Go to Project Settings > Preferences.
  2. Toggle on the Generative AI preference.This is a screenshot showing how to enable Generative AI.
note

This feature set is in preview. It will allow this entire Sonar project and all users with access to the project to leverage Generative AI features. Note that these features have limited world knowledge and may occasionally generate incorrect information or produce harmful instructions or biased content. For more information, see Data Privacy.

Now you can manage data using Generative AI!

Step 2: Generate Labels

In order to generate labels, Generative AI bases its understanding on your schema by considering other labels that have been previously generated and labels that have been created by other users.

To generate labels:

  1. Navigate to the Datasets page by clicking This is the icon that represents the Datasets page. in the side navigation bar.

  2. In the Data panel on the left, locate Samples."samples.dremio.com"."NYC-taxi-trips-iceberg".

  3. Hover over the dataset until a metadata card appears and click Details Panel.

    This screenshot is showing how to locate the Details Panel on the metadata card.
  4. In the Dataset Overview on the right, click This is the icon that represents Generative AI. to generate labels.

    This screenshot is showing how to generate a label.
  5. In the Generating labels dialog, review the labels generated for the dataset.

    This screenshot is showing the labels generated.
  6. Click Save.

    The labels for the dataset will appear in the Dataset Overview.

Step 3: Generate a Wiki

In order to generate a wiki, Generative AI bases its understanding on your schema and data to produce descriptions of datasets, because it can determine how the columns within the dataset relate to each other and to the dataset as a whole.

To generate a wiki:

  1. On the Dataset page for Samples."samples.dremio.com"."NYC-taxi-trips-iceberg", click Generate wiki in the Wiki section. A dialog will open and a preview of the wiki content will generate on the right of the dialog.

    This screenshot is showing how to generate wikis.
  2. Click to copy the generated wiki content on the right of the dialog.

    This screenshot is showing how to generate wikis.
  3. Click within the text box on the left and paste the wiki content.

    This screenshot is showing a generated wiki.
  4. Click Save.

    The wiki for the dataset will appear in the Wiki section.

Wrap-up and Next Steps

You've reached the end of the tutorial! You should feel confident that you have a basic, but solid, knowledge of how to use Generative AI to manage your data and minimize the need for manual profiling and cataloging.

Next, try out Text-to-SQL! This Generative AI feature converts natural language queries into SQL directly within the SQL Runner.

To learn more about Dremio, check out Getting Started with Dremio.

Have Questions?

If you have questions about the concepts in this tutorial or other Dremio features, reach out to us using the chat feature This is the icon that represents the Intercom chat. in the bottom-right corner of the Dremio application.