On this page

    Connecting to Your Data

    This section describes the data sources that you can configure and analyze using Dremio Cloud, including data lakes (distributed filesystems) and relational databases (external sources).

    note:

    Dremio Cloud does not support case-sensitive data file names, table names, or column names.

    For example, if you have three file names that have the same name, but with difference cases (such as, MARKET, Market, and market), Dremio Cloud is unable to discern the case differences, resulting in unanticipated data results.

    For column names, if, for example, two columns named Trip_Pickup_DateTime and trip_pickup_datetime exist in the table, one of the columns may disappear when the header is extracted.

    Data-as-code

    You can add an Arctic (Preview) catalog as a source to enable Git-like data management and allow data engineers to manage the data lake with the same best practices Git enables for software development, including commits, tags, and branches.

    Data Lakes

    You can run queries directly on the data in your data lake by formatting directories and files into tables. The following types of data lakes are supported:

    Relational Databases (External Sources)

    You can run queries directly on the data in relational databases, which are referred to as external sources. In addition, you can run external queries:

    • That use the native syntax of the relational database.
    • To process SQL statements that are not supported by Dremio Cloud or are too complex to convert.

    note:

    Decimal-to-decimal mappings are supported for relational database sources.

    The following external sources are supported: