ROLLUP, CUBE, and GROUPING SETS

ROLLUP, CUBE, and GROUPING SETS

ROLLUP, CUBE, and GROUPING SETS are extensions to GROUP BY that generate multiple levels of aggregation in a single query. They’re useful for generating reports with subtotals and grand totals.

ROLLUP

ROLLUP creates a hierarchical set of subtotals, rolling up from the most detailed level to a grand total.

Syntax

SELECT columns, aggregate_functions
FROM table
GROUP BY ROLLUP(column1, column2, ...);

Example

CREATE TABLE sales (
    id INTEGER PRIMARY KEY,
    region TEXT,
    category TEXT,
    amount FLOAT
);

INSERT INTO sales VALUES
(1, 'East', 'Electronics', 100),
(2, 'East', 'Electronics', 150),
(3, 'East', 'Clothing', 50),
(4, 'West', 'Electronics', 200),
(5, 'West', 'Clothing', 75),
(6, 'West', 'Clothing', 60);

SELECT region, category, SUM(amount) as total
FROM sales
GROUP BY ROLLUP(region, category);

Result:

region | category    | total
-------+-------------+-------
East   | Clothing    | 50.0
East   | Electronics | 250.0
West   | Electronics | 200.0
West   | Clothing    | 135.0
East   | NULL        | 300.0    -- Subtotal for East
West   | NULL        | 335.0    -- Subtotal for West
NULL   | NULL        | 635.0    -- Grand total

How ROLLUP Works

For ROLLUP(region, category), it produces:

  1. Detail rows: (region, category) - each combination
  2. Region subtotals: (region, NULL) - totals per region
  3. Grand total: (NULL, NULL) - overall total

The NULL values indicate the aggregation level.

Column Order Matters

ROLLUP aggregates from right to left:

-- ROLLUP(a, b, c) produces:
-- (a, b, c) - detail
-- (a, b, NULL) - subtotal by a, b
-- (a, NULL, NULL) - subtotal by a
-- (NULL, NULL, NULL) - grand total

CUBE

CUBE generates all possible combinations of grouping columns, providing a complete multi-dimensional analysis.

Syntax

SELECT columns, aggregate_functions
FROM table
GROUP BY CUBE(column1, column2, ...);

Example

SELECT region, category, SUM(amount) as total
FROM sales
GROUP BY CUBE(region, category);

Result:

region | category    | total
-------+-------------+-------
East   | Clothing    | 50.0
East   | Electronics | 250.0
West   | Electronics | 200.0
West   | Clothing    | 135.0
East   | NULL        | 300.0    -- Subtotal for East
West   | NULL        | 335.0    -- Subtotal for West
NULL   | Clothing    | 185.0    -- Subtotal for Clothing
NULL   | Electronics | 450.0    -- Subtotal for Electronics
NULL   | NULL        | 635.0    -- Grand total

How CUBE Works

For CUBE(region, category), it produces all 2^n combinations:

  1. (region, category) - detail rows
  2. (region, NULL) - totals by region
  3. (NULL, category) - totals by category
  4. (NULL, NULL) - grand total

GROUPING SETS

GROUPING SETS provides explicit control over which grouping combinations to generate, allowing you to specify exactly which aggregation levels you need.

Syntax

SELECT columns, aggregate_functions
FROM table
GROUP BY GROUPING SETS ((columns1), (columns2), ...);

Example

-- Specify exact grouping combinations
SELECT region, category, SUM(amount) as total
FROM sales
GROUP BY GROUPING SETS ((region, category), (region), ());

Result:

region | category    | total
-------+-------------+-------
East   | Clothing    | 50.0
East   | Electronics | 250.0
West   | Electronics | 200.0
West   | Clothing    | 135.0
East   | NULL        | 300.0    -- region subtotal
West   | NULL        | 335.0    -- region subtotal
NULL   | NULL        | 635.0    -- grand total

How GROUPING SETS Works

Each inner parentheses defines a grouping:

  • (region, category) - group by both columns (detail rows)
  • (region) - group by region only (region subtotals)
  • () - empty set produces grand total

Equivalence with ROLLUP and CUBE

GROUPING SETS can express any ROLLUP or CUBE:

-- These are equivalent:
GROUP BY ROLLUP(a, b)
GROUP BY GROUPING SETS ((a, b), (a), ())

-- These are equivalent:
GROUP BY CUBE(a, b)
GROUP BY GROUPING SETS ((a, b), (a), (b), ())

Selective Subtotals

Unlike ROLLUP and CUBE, GROUPING SETS lets you pick specific combinations:

-- Only region and category subtotals, no detail rows
SELECT region, category, SUM(amount) as total
FROM sales
GROUP BY GROUPING SETS ((region), (category));

Result:

region | category    | total
-------+-------------+-------
East   | NULL        | 300.0    -- region subtotal
West   | NULL        | 335.0    -- region subtotal
NULL   | Clothing    | 185.0    -- category subtotal
NULL   | Electronics | 450.0    -- category subtotal

GROUPING() Function

The GROUPING() function identifies whether a NULL value in the result represents an actual NULL in the data or indicates a super-aggregate row (subtotal/grand total).

Syntax

GROUPING(column)

Returns:

  • 0 if the column is part of the current grouping (normal row)
  • 1 if the column is aggregated (super-aggregate row)

Example

SELECT
    region,
    category,
    SUM(amount) as total,
    GROUPING(region) as is_region_aggregated,
    GROUPING(category) as is_category_aggregated
FROM sales
GROUP BY GROUPING SETS ((region, category), (region), ());

Result:

region | category    | total | is_region_aggregated | is_category_aggregated
-------+-------------+-------+----------------------+-----------------------
East   | Clothing    | 50.0  | 0                    | 0
East   | Electronics | 250.0 | 0                    | 0
West   | Electronics | 200.0 | 0                    | 0
West   | Clothing    | 135.0 | 0                    | 0
East   | NULL        | 300.0 | 0                    | 1   -- category aggregated
West   | NULL        | 335.0 | 0                    | 1   -- category aggregated
NULL   | NULL        | 635.0 | 1                    | 1   -- both aggregated

Distinguishing NULL Values

Use GROUPING() to handle NULL values properly:

SELECT
    CASE WHEN GROUPING(region) = 1 THEN '(All Regions)' ELSE COALESCE(region, 'Unknown') END as region,
    CASE WHEN GROUPING(category) = 1 THEN '(All Categories)' ELSE COALESCE(category, 'Unknown') END as category,
    SUM(amount) as total
FROM sales
GROUP BY ROLLUP(region, category);

Comparison: ROLLUP vs CUBE vs GROUPING SETS

Feature ROLLUP CUBE GROUPING SETS
Subtotals Hierarchical only All combinations User-defined
Groupings n + 1 2^n As specified
Use case Hierarchical reports Cross-tabulation Custom reports
Flexibility Low Medium High

Grouping Count Example

For 3 columns (a, b, c):

ROLLUP(a, b, c) produces 4 groupings:

  • (a, b, c)
  • (a, b, NULL)
  • (a, NULL, NULL)
  • (NULL, NULL, NULL)

CUBE(a, b, c) produces 8 groupings:

  • (a, b, c)
  • (a, b, NULL)
  • (a, NULL, c)
  • (NULL, b, c)
  • (a, NULL, NULL)
  • (NULL, b, NULL)
  • (NULL, NULL, c)
  • (NULL, NULL, NULL)

Use Cases

ROLLUP for Hierarchical Reports

Time-based hierarchies (year > quarter > month):

SELECT
    EXTRACT(YEAR FROM order_date) as year,
    EXTRACT(QUARTER FROM order_date) as quarter,
    SUM(amount) as total
FROM orders
GROUP BY ROLLUP(
    EXTRACT(YEAR FROM order_date),
    EXTRACT(QUARTER FROM order_date)
);

Geographic hierarchies (country > region > city):

SELECT country, region, city, SUM(sales) as total
FROM stores
GROUP BY ROLLUP(country, region, city);

CUBE for Cross-Tabulation

Analyze sales by multiple dimensions:

SELECT product_type, customer_segment, SUM(revenue) as total
FROM sales
GROUP BY CUBE(product_type, customer_segment);

This gives totals for:

  • Each product_type + customer_segment combination
  • Each product_type (all segments)
  • Each customer_segment (all products)
  • Grand total

Working with NULL Markers

NULL in the result indicates an aggregation level. To distinguish from actual NULL data:

SELECT
    COALESCE(region, '(All Regions)') as region,
    COALESCE(category, '(All Categories)') as category,
    SUM(amount) as total
FROM sales
GROUP BY ROLLUP(region, category);

Result:

region         | category          | total
---------------+-------------------+-------
East           | Clothing          | 50.0
East           | Electronics       | 250.0
West           | Electronics       | 200.0
West           | Clothing          | 135.0
East           | (All Categories)  | 300.0
West           | (All Categories)  | 335.0
(All Regions)  | (All Categories)  | 635.0

Performance Considerations

  • CUBE generates 2^n groupings, which can be expensive for many columns
  • ROLLUP is more efficient for hierarchical data (n+1 groupings)
  • Consider adding a WHERE clause to reduce input data
  • Indexes don’t help with ROLLUP/CUBE aggregations
Columns ROLLUP Groupings CUBE Groupings
2 3 4
3 4 8
4 5 16
5 6 32
6 7 64

For more than 4-5 columns, CUBE may produce too many rows.

Complete Example

-- Sales analysis with ROLLUP
CREATE TABLE quarterly_sales (
    id INTEGER PRIMARY KEY,
    year INTEGER,
    quarter INTEGER,
    product TEXT,
    revenue FLOAT
);

INSERT INTO quarterly_sales VALUES
(1, 2024, 1, 'Widget', 10000),
(2, 2024, 1, 'Gadget', 15000),
(3, 2024, 2, 'Widget', 12000),
(4, 2024, 2, 'Gadget', 18000),
(5, 2024, 3, 'Widget', 11000),
(6, 2024, 3, 'Gadget', 16000);

-- Hierarchical report: Year > Quarter > Product
SELECT
    year,
    quarter,
    product,
    SUM(revenue) as total_revenue,
    COUNT(*) as transactions
FROM quarterly_sales
GROUP BY ROLLUP(year, quarter, product)
ORDER BY year, quarter, product;

This produces a report with:

  • Detail rows per product per quarter
  • Quarterly totals (all products)
  • Yearly totals (all quarters, all products)
  • Grand total