公告

日历

-- Pass 1: Aggregate sales by product, store, day
WITH sales AS (
  SELECT 
    product_id,
    store_id,
    date_id,
    SUM(sales_amount) AS total_sales
  FROM Sales_Fact
  GROUP BY product_id, store_id, date_id
),

-- Pass 2: Aggregate inventory by product, store, day
inventory AS (
  SELECT 
    product_id,
    store_id,
    date_id,
    SUM(on_hand_quantity) AS total_inventory
  FROM Inventory_Fact
  GROUP BY product_id, store_id, date_id
)

-- Pass 3: Combine the aggregated results
SELECT 
  s.product_id,
  s.store_id,
  s.date_id,
  s.total_sales,
  i.total_inventory
FROM sales s
LEFT JOIN inventory i
  ON s.product_id = i.product_id
 AND s.store_id   = i.store_id
 AND s.date_id    = i.date_id;

✅ Now the join is between small aggregated tables, not massive fact tables.
✅ Avoids mis-grain joins because aggregation happens first.
✅ More efficient and semantically correct.

3. When to Use Multipass SQL

When users want to combine metrics from different processes (sales, inventory, shipments, returns, etc.).
When fact tables are at different grains (transaction-level vs. snapshot-level).
When fact-to-fact join would otherwise cause performance or correctness problems.

4. Related Design Principle

This is part of the "separate fact tables, conformed dimensions" strategy in dimensional modeling.
OLAP tools (like MicroStrategy, BusinessObjects, Cognos) often generate multipass SQL automatically behind the scenes to support cross-fact analysis.

👉 In short:
Multipass SQL means aggregating each fact table separately, then combining the smaller result sets, instead of directly joining giant fact tables.
It’s all about performance and avoiding mis-grain mistakes.

posted on 2025-08-18 09:56 ZhangZhihuiAAA 阅读(7) 评论(0) 收藏举报

刷新页面返回顶部

导航

1. The Problem: Fact-to-Fact Joins

2. The Multipass SQL Solution

3. When to Use Multipass SQL

4. Related Design Principle


博客园 © 2004-2025 浙公网安备 33010602011771号浙ICP备2021040463号-3