了解筛选索引

开始

筛选索引是SQL Server 2008中的一种新功能，它是对表中的部分行进行索引。

基本语法：

create nonclustered index index_name on <object> (columns) where <filter_predicate>

在一些特定的应用环境下，筛选索引与传统的全表非聚集索引相比，具有以下优点。

提高了查询性能和计划质量
减少了索引存储开销
减少了索引维护开销

接下来，我以例子来说明这三方面的优点。

提高了查询性能和计划质量

在数据库TestDB上创建两个表(table_a & table_b),而且每一个表都有相同的记录行(各100W行记录)。可以参见下面的测试脚本SQL：

use TestDB

if object_id('table_a') is not null drop table table_a

if object_id('table_b') is not null drop table table_b

create table table_a (id int identity,col1 int,col2 nvarchar(128),constraint pk_table_a primary key(id))

create table table_b (id int identity,col1 int,col2 nvarchar(128),constraint pk_table_b primary key(id))

insert into table_a(col1,col2)

select top(1000000) a.object_id as col1,b.name as col2

from sys.all_objects a,

sys.all_columns b

insert into table_b(col1,col2)

select col1,col2 from table_a

在Microsoft SQL Server Management Studio 新建一个查询，并执行上面的SQL语句。

没有索引情况：

假设我要查，条件等于"col1 between -200 and 10"的id & col1记录，那么对应SQL语句是:

select id,col1 from table_a a where a.col1 between -200 and 10

为了能够跟踪到执行计划情况和IO信息，我这里设置了"set statistics profile,io on":

use TestDB

set statistics profile,io on

select id,col1 from table_a a where a.col1 between -200 and 10

set statistics profile,io off

执行结果返回17540行记录，在执行计划过程，采用聚集索引扫描（pk_table_a），IO逻辑读取4311次：

图1.

筛选索引 Vs. 全表非聚集索引：

为了提升查询性能，通常会在字段col1上创建一个非聚集索引，如（ix_table_a_col1）:

create nonclustered index ix_table_a_col1 on dbo.table_a(col1)

同时，为了让筛选索引和全表非聚集索引进行比较，我在表table_b上创建了一个筛选索引，如(ix_table_b_col1_Filtered):

create nonclustered index ix_table_b_col1_Filtered on dbo.table_b(col1) where col1>=-200

接下来，要查询两个表中"col1 between -200 and 10"的id & col1记录：

use TestDB

set statistics profile,io on

select id,col1 from table_a a where a.col1 between -200 and 10

select id,col1 from table_b a where a.col1 between -200 and 10

set statistics profile,io off

图2.

图2. 从表table_a和表table_b的实际执行计划统计信息中，看TotalSubtreeCost（所有子操作的预计开销合计）数据，使用筛选索引的table_b（TotalSubtreeCost=0.05036455）明显低于于使用全表非聚集索引的table_a（TotalSubtreeCost=0.02331454）。也就是使用筛选索引的成本，是使用全表非聚集索引的成本的1/2。