公告

日历

branch_key	branch_name	branch_type	parent_branch_key	pathstring	pathstring_names	depth
1	USA	Country	NULL	/1/	/USA/	1
2	Northeast	Region	1	/1/2/	/USA/Northeast/	2
3	Boston	City	2	/1/2/3/	/USA/Northeast/Boston/	3
4	Boston-01	Branch	3	/1/2/3/4/	/USA/Northeast/Boston/B01/	4
5	Midwest	Region	1	/1/5/	/USA/Midwest/	2
6	Chicago	City	5	/1/5/6/	/USA/Midwest/Chicago/	3
7	Chicago-01	Branch	6	/1/5/6/7/	/USA/Midwest/Chicago/C01/	4

Columns

branch_key: Surrogate key for the node.
branch_name: Name of the node (e.g., "Chicago").
branch_type: What kind of node it is (Country, Region, City, Branch, etc.).
parent_branch_key: Links to the parent node’s surrogate key.
pathstring: Path of IDs from root to this node, delimited (e.g., /1/5/6/7/).
pathstring_names: Human-readable path for reporting.
depth: Numeric depth level (root=1).

3. Advantages of Pathstring

Variable depth friendly — you don’t need to redesign when levels change.

Easy to filter for descendants:

Easy to find ancestors:

4. Best Practices

Delimiter choice: Use / or another character unlikely to appear in IDs.
Indexing:
- For descendant queries, index the pathstring column.
- For ancestor queries, consider materialized paths with reverse strings or store both directions.
Depth column: Store depth for quick filtering (e.g., find only level 3).
SCD handling: If node names or parents can change, consider SCD Type 2 so you preserve historical paths.
Pathstring vs. path IDs:
- Keep pathstring with surrogate keys for joins.
- Keep pathstring_names for display purposes.

5. When to Use Pathstring

✅ When the hierarchy is irregular and changes over time.
✅ When you need fast descendant/ancestor queries without recursive CTEs.
✅ When you can afford a slightly larger dimension table (path redundancy).

posted on 2025-08-16 09:57 ZhangZhihuiAAA 阅读(14) 评论(0) 收藏举报

刷新页面返回顶部

导航

1. The Problem

2. Pathstring Approach

Example Table: `dim_branch`

Columns

3. Advantages of Pathstring

4. Best Practices

5. When to Use Pathstring


博客园 © 2004-2025 浙公网安备 33010602011771号浙ICP备2021040463号-3

导航

1. The Problem

2. Pathstring Approach

Example Table: dim_branch

Columns

3. Advantages of Pathstring

4. Best Practices

5. When to Use Pathstring

Example Table: `dim_branch`