Managing Your Data
When you work in the Datasets UI, there are different components that you can use to manage your data. The largest component is the Data panel, which is used to explore the spaces and sources in your data catalog, as shown in this image:
|1||By default, you have a home space that you can further organize by creating a hierachy of folders, and you can create additional spaces.|
|2||A space is a directory in which views are saved. Spaces provide a way to group datasets by common themes such as a project, purpose, department, or geographic region.|
|3||A source is a data lake or external source (such as a relational database) that you can connect to Dremio.|
|4||The title indicates that the Samples data lake is open and lists the contents of the sample source. A source also consists of layers, so if you expand a data source, you will find datasets and data types within the datasets.|
|5||The datasets are listed from the Samples data lake. A dataset is a collection of data. The datasets stored in files can be in many different formats, and to run SQL queries against data in different formats, you can create tables and views. Tables are files or folders that are in your source, whereas virtual datasets are views derived from tables or other views.|
By clicking on a dataset in the Data panel, you can view or edit the DDL (or original SQL) statement that was used for the dataset, which has been pre-populated in the SQL editor on the Data page.
The SQL editor will pre-populate the DDL statement as long as you have privilege access for the dataset. If you don’t have privilege access, the default
SELECT * FROM <name of your dataset> statement will be pre-populated instead, as shown in this image:
|1||The indicator that displays the name of the open dataset. The icon on the left indicates the type of dataset. In this case, the icon is for a table created from a source folder.|
|2||The Data panel is used to explore your data catalog.|
|3||Collapsing the Data panel hides the panel from view.|
|4||The Data page is where the SQL editor is located within the Datasets UI.|
When opening a dataset, a preview of the table data will not render by default. The table data will only load if you explicitly click Run or Preview.
Adding Data Objects
By opening the SQL Runner, you can directly add data objects from the Data panel into the SQL editor.
To add the data object, locate the object in the Data panel that you would like to use within the query. Click the + button or drag and drop the data into the SQL editor.
Starring Data Objects
When viewing the Data panel in the SQL Runner, you can star spaces, sources, folders, and datasets, which adds the item to your Starred list for easier access. The Starred list can hold up to 25 entities at a time, and each starred item remains on the list unless you unstar it.
To star a data object:
- In the Data panel, locate the data object that you want to star. In this example, a dataset is being starred.
- Click the (Star) icon that appears next to the data object. The data object will appear on your Starred list.
To unstar an item, click the Star icon again.
Starring is different than pinning items. You can only pin spaces and sources in the Datasets UI.