17.0.0 Release Notes (July 2021)
Google Cloud Storage (GCS) Connector
Google Cloud Storage has been added as an option for file system sources, like S3 and HDFS.
Upgrade Process for Custom ARP Sources
Added process forupgrading to new versions of Dremio when using custom ARP sources in the event of breaking changes/features being introduced, such as access control.
Timestamp Mapping for Oracle
Added a new UI option for Oracle date sources to map the Date column to indexed timestamp data. This helps to reduce the duration of queries that would otherwise have to convert the Date column to non-indexed timestamp data.
Median/Percentile SQL Functions
The ability to execute median/percentile commands via SQL functions has been added in the form of
PERCENTILE_DISC. Using these, you may now compute percentiles against any numeric column. The
MEDIAN function may also be used and is interpreted as
PERCENTILE_CONT, so either function name may be used to achieve the same result.
Elasticsearch 7 Connector Preview
This functionality’s preview status is lifted as of Dremio 18.0+ and is available to all users.
Dremio now supports Elasticsearch 7.0+ in the Elasticsearch connector (in addition to 5.x and 6.x), using the same Elasticsearch connector. You can create a new source and connect Dremio to an Elasticsearch 7.x cluster; however, if you already have an Elasticsearch source established with Dremio and you’re planning to upgrade that cluster to 7.0+, you’ll need to remove the source definition and re-add it.
For more information regarding these breaking changes and how they may affect your experience integrating with Dremio, please review Elastic’s breaking changes in 7.0.
Updated Kubernetes Helm Chart
Dremio 17.0.0 features an updated Helm chart for AWS profile authentication support on Kubernetes-based deployments.
Mixed Type Removal
Dremio 18.0 will remove support for columns with mixed data types. A standard schema will then be enforced. After the upgrade, mixed type columns will be converted on the next metadata refresh or query that reads from an affected PDS.
If files contain different types, Dremio will up-promote the data type to a common type, if possible. For example, it will convert data to BIGINT if files contain INT and BIGINT data types.
To prepare for this deprecation, the support key
store.disable.mixed_types may be used to help you identify any PDS that may contain mixed data types. Using this key will emulate the deprecation so that you may easily identify mixed types before all support is ended.
An additional CLI tool is available, which generates a CVS report with each column listed that is affected by the upcoming deprecation. Please contact Dremio Customer Support for assistance in obtaining and running this tool.
Dremio now supports Delta Lake tables where statistics for data files are added later.
After performing a query in Dremio, the results were being stored while a new executor node was simultaneously added. Because the node was in a “not ready” state, it caused the query to fail, even though the node was not used while performing the query.Rather than causing the query to fail, new nodes added while storing query results instead throw a special exception (which is ignored by Dremio) and results are stored locally rather than shared between nodes.
The log directory was moved to a new location, so events failed to appear in the
/var/log/dremio directory until the Dremio service was restarted or the file was full.When log files are moved to the backup location, a rollover is triggered so that all logged events will still appear in the expected directory.
The metadata refresh for Azure configurations was using the GMT timezone for offset calculations.._The metadata refresh has been updated to use the system’s default timezone.
17.1.0 Release Notes (Enterprise Edition Only, November 2021)
When using the raw local file systems for a source (such as Hadoop’s local file system that does not produce extra checksum files), Dremio encountered a cast exception due to other libraries relying on standard local file systems.This unexpected behavior has been resolved with the introduction of a new method that does not change the default Hadoop configuration.
Dremio made unnecessary privilege checks on users accessing child objects of a root entity as well as fetching a user’s role repeatedly. As a result, this created additional KVStore and external calls.Inheritance checks for child objects are only run from the root entity. When users access non-root objects, Dremio will only check grants for the object being accessed, thus reducing Dremio’s overall resource consumption and potential for privilege errors. With regard to roles, Dremio now only fetches a user’s role once and then caches it locally.
A query encountered an error while outstanding messages for a coordinator node, but the error was not acknowledged. This prevented any future query messages to the same coordinator from being acknowledged, and thus future queries could not progress beyond the queue.
All messages are acknowledged by the screen operator, even in the case of a query failing.