CREATE TABLE AS
Create a new table as a select statement from another table.
- To create a table inside a namespace like a folder, the namespace must already exist. Dremio does not support creating the namespace and a table inside it in a single command. Instead, you must explicitly create the namespace before you can create the table.
- Table names cannot include the following special characters:
/
,:
,[
, or]
.
CREATE TABLE [ IF NOT EXISTS ] <table_name>
[ AT { REF[ERENCE] | BRANCH } <reference_name> ]
[ PARTITION BY ( { <column_name> | <partition_transform> } [ , ... ] )
[ CLUSTER BY ( <column_name> [ , ... ] ) ]
[ LOCALSORT BY ( <column_name> [ , ... ] ) ]
[ TBLPROPERTIES ( '<property_name>' = '<property_value>' [ , ... ] ) ]
AS <select_statement>
Parameters
IF NOT EXISTS Optional
Include if Dremio should run the statement only if a table with the same name in the same path does not already exist.
<table_name> String
The name of the table that you want to create. The name of the table must be unique.
AT { REF[ERENCE] | BRANCH } <reference_name> String Optional
Specifies the reference at which you want the new table to be created. When this parameter is omitted, the current reference is used.
REF
: Identifies the specific reference.BRANCH
: Identifies the specific branch.
PARTITION BY ( { <column_name> | <partition_transform> } [ , ... ] ) String Optional
The columns or partition-transformation functions on which to partition the table data.
-
<column_name>
: The unique name of the column. Multiple columns may be specified as long as the column names include their data type. Use commas to separate each column name and data-type pair. If a column is listed as a partition column, it cannot also be listed as a sort column for the same table. Each column specified must also be listed as a dimension column. -
<partition_transform>
: Applies only to Iceberg tables. Use one of these partition-transformation functions:Transform Description identity( <col> ) Explicitly specified identity transform. year( <col> ) Partition by year. The column uses either the TIMESTAMP or DATE data type. month( <ts_col> ) Partition by month. The column uses either the TIMESTAMP or DATE data type. day( <ts_col> ) Partition by day. The column uses either the TIMESTAMP or DATE data type. hour( <ts_col> ) Partition by hour. The column uses either the TIMESTAMP or DATE data type. bucket( <count>, <col> ) Partition by hashed value into <count> buckets. truncate( <length>, <col> ) Partition by truncated value. - Strings are truncated to the specified length.
- Integer and biginteger values are truncated to bins.
Example: truncate(10, i) produces 0, 10, 20, and so on.
note- If you partition a table by a column that has more distinct values than Dremio can partition by, you will receive an error message. If you receive this error message, try partitioning your table by a different column.
- If the table is not partitioned, you will receive an error when you attempt to refresh the metadata using the
PARTITION BY
clause.
CLUSTER BY ( <column_name> [ , ... ] ) String Optional
The column(s) to cluster the table on.
Sets the clustering keys for the specified table. To specify multiple columns, use commas to separate each column name.
Clustering cannot be used in conjunction with PARTITION BY
or LOCAL SORT
.
LOCALSORT BY ( <column_name> [ , ... ] ) String
The name of the column that you want to sort the table by. Sorts each Parquet file fragment by the designated column. If you are doing a range filter on your LOCALSORT
column, your query will be faster because the data is already sorted.
TBLPROPERTIES ( '<property_name>' = '<property_value>' [ , ... ] )
Sets one or more table properties for controlling the behavior of the table. See Properties of Apache Iceberg Tables for a list of the supported properties.
AS <select_statement> String
Use the SELECT
statement to populate the new table using data from an existing table or view.
Examples
Create a table in your project storeCREATE TABLE demo_table
AS SELECT * FROM Samples."samples.dremio.com"."zips.json"
CREATE TABLE demo_table2
PARTITION BY (state)
LOCALSORT BY (city)
AS SELECT * FROM Samples."samples.dremio.com"."zips.json"
CREATE TABLE demo.example_table
AS SELECT * FROM "oracle_tpch".DREMIO.JOBS AT TAG Jan2020
CREATE TABLE demo.example_table
AS SELECT * FROM "oracle_tpch".DREMIO.JOBS AT COMMIT "ff2fe50fef5a030c4fc8e61b252bdc33c72e2b6f929d813833d998b8368302e2"
CREATE TABLE myAmazonS3Source.myFolder.myTable
AS SELECT * FROM myAmazonS3Source.anotherFolder.anotherTable