Returns the cumulative distribution of the current row with regard to other values within the same window partition.
CUME_DIST() OVER ( [PARTITION BY partition_expression] [ORDER BY order_expression]) → double
- partition_expression: An optional expression that groups rows into partitions.
- order_expression: An optional expression that specifies the order of the rows within each partition.
SELECT "Category", "Descript", "DayOfWeek", CUME_DIST() OVER ( PARTITION BY "Category" ORDER BY "DayOfWeek") FROM Samples."samples.dremio.com"."SF_incidents2016.json" -- Category, Descript, DayOfWeek, EXPR$3 -- ARSON, ARSON, Friday, 0.13636363636363635 -- EMBEZZLEMENT, EMBEZZLED VEHICLE, Friday, 0.18452380952380953
This function does not support cumulative frame windows or sliding frame windows.The example uses Dremio’s Samples Data Lake, which includes sample datasets that you can practice with. You must manually add this data lake. To add in Dremio Cloud, see Add Dremio’s Samples Data Lake to Your Project. Guidance to add in Dremio Software is in development.
Was this page helpful?
Glad to hear it! Thank you for your feedback. Sorry to hear that. Thank you for your feedback.