Longest Increasing Subsequences(最长递增子序列)的两种DP实现

一、本文内容
最长递增子序列的两种动态规划算法实现,O(n^2)及O(nlogn).
 
 
二、问题描述
最长递增子序列:给定一个序列,从该序列找出最长的 升序/递增 子序列。
特点:1、子序列不要求连续; 2、子序列在原序列中按严格(strictly)升序排序; 3、最长递增子序列不唯一。
 
注:下文最长递增子序列用缩写LIS表示。
 
example:
0, 8, 4, 12, 2, 10, 6, 14, 1, 9, 5, 13, 3, 11, 7, 15
 
对应的LIS:
0, 2, 6, 9, 13, 15
0, 4, 6, 9, 11, 15
 
 
三、算法描述
1、考察第i+1个元素时,不考虑前面i个元素的状态
给定长度为n的序列A[0..n-1],对于给定某个的阶段状态,它以前各阶段的状态无法直接影响它未来的决策,而只能间接通过当前状态来影响;换句话说,每个状态都是过去历史的完整总结。即LIS[i]当前的状态与LIS[0..i-1]已无关。这几句话可以用Fibonacci的递归树来验证,对LIS的递归树同样适用。 如下面的递归树:
 
                                    LIS(3)          
                                /       |      \
                      LIS(2)     LIS(1)    LIS(0) 
                      /     \       /        
             LIS(1)  LIS(0)  LIS(0)
            /   
         LIS(0) 
 
很显然,递归树有很多重复/重叠的子问题,而大问题的最优解又可以由这些子问题的最优解得到,符合DP的两个条件,故可以用DP来解决LIS问题。
那么给定序列A[0..n-1],求它的LIS,可以top-down分解任务,得到递归树;递归树bottom-up就可以建立子问题的查询表以供求解当前问题时查询。
这就是DP的两种处理方式:Memoization(top-down) 和 Tabulation (bottom-up).
 
假设LIS[i]表示A[0..i]的最长递增子序列的长度,那么
LIS[i+1] = max{1, LIS[j]+1}, A[i+1]>A[j], for any j<=i;   注意:到A[i]的子序列长度不一定大于A[j]的子序列长度(如上例当A[j]=12, LIS[j]={0, 4, 12}和当A[i]=2, LIS[i]={0, 2})
若A[i+1]>A[j],则A[i+1]可以接在LIS[j]长的子序列后面构成一个更长的子序列。
同时, 从A[i+1]开始又构成一个长度为1的子序列。故两者取大值。
 
 
2、考察第i+1个元素时,考虑前面i个元素的状态
那么,什么时候在一个已存在的序列中添加或者替换一个元素是安全的呢?(DP算法都是offline algorithm,需要全局考虑)
显然,我们需要维护所有递增的子序列(称其为active list,即可能成为max{LIS}的子序列)。而这些子序列的长度都不同,
可以按照插入排序的思想,一个一个元素地从前面找到其应该归属的子序列(active list)。
 
有A[i]>A[i-1]和A[i]<A[i-1]两种情况,A[i]<A[i-1]又有一种特殊情况A[i]是当前序列中的最小元素,即A[i]<A[j], for any j<i;
 
case 1. If A[i] is smallest among all end candidates of active lists, we will start new active list of length 1. 
(when we encounter new smallest element in the array, it can be a potential candidate to start new sequence,{2, 5, 3, 1, 2, 3, 4, 5, 6})
 
case 2. If A[i] is largest among all end candidates of active lists, we will clone the largest active list, and extend it by A[i].
 
case 3. If A[i] is in between, we will find a list with largest end element that is smaller than A[i]. Clone and extend this list by A[i]. We will discard all other lists of same length as that of this modified list. 
(找到end element小于A[i]的active list之后,其他相同长度的active list将被删除,因为A[i]小于这些等长active list的end element,用新的active list代替将被删除的active list:复制小于A[i]的active list,并添加A[i])
 
处理过程如下
example:A[ ] = {0, 8, 4, 12, 2, 10, 6, 14, 1, 9, 5, 13, 3, 11, 7, 15}
A[0] = 0. Case 1. There are no active lists, create one.
0.
-----------------------------------------------------------------------------
A[1] = 8. Case 2. Clone and extend.
0.
0, 8.
-----------------------------------------------------------------------------
A[2] = 4. Case 3. Clone, extend and discard.
0.
0, 4.
0, 8. Discarded
-----------------------------------------------------------------------------
A[3] = 12. Case 2. Clone and extend.
0.
0, 4.
0, 4, 12.
-----------------------------------------------------------------------------
A[4] = 2. Case 3. Clone, extend and discard.
0.
0, 2.
0, 4. Discarded.
0, 4, 12.
-----------------------------------------------------------------------------
A[5] = 10. Case 3. Clone, extend and discard.
0.
0, 2.
0, 2, 10.
0, 4, 12. Discarded.
-----------------------------------------------------------------------------
A[6] = 6. Case 3. Clone, extend and discard.
0.
0, 2.
0, 2, 6.
0, 2, 10. Discarded.
-----------------------------------------------------------------------------
A[7] = 14. Case 2. Clone and extend.
0.
0, 2.
0, 2, 6.
0, 2, 6, 14.
-----------------------------------------------------------------------------
A[8] = 1. Case 3. Clone, extend and discard.
0.
0, 1.
0, 2. Discarded.
0, 2, 6.
0, 2, 6, 14.
-----------------------------------------------------------------------------
A[9] = 9. Case 3. Clone, extend and discard.
0.
0, 1.
0, 2, 6.
0, 2, 6, 9.
0, 2, 6, 14. Discarded.
-----------------------------------------------------------------------------
A[10] = 5. Case 3. Clone, extend and discard.
0.
0, 1.
0, 1, 5.
0, 2, 6. Discarded.
0, 2, 6, 9.
-----------------------------------------------------------------------------
A[11] = 13. Case 2. Clone and extend.
0.
0, 1.
0, 1, 5.
0, 2, 6, 9.
0, 2, 6, 9, 13.
-----------------------------------------------------------------------------
A[12] = 3. Case 3. Clone, extend and discard.
0.
0, 1.
0, 1, 3.
0, 1, 5. Discarded.
0, 2, 6, 9.
0, 2, 6, 9, 13.
-----------------------------------------------------------------------------
A[13] = 11. Case 3. Clone, extend and discard.
0.
0, 1.
0, 1, 3.
0, 2, 6, 9.
0, 2, 6, 9, 11.
0, 2, 6, 9, 13. Discarded.
-----------------------------------------------------------------------------
A[14] = 7. Case 3. Clone, extend and discard.
0.
0, 1.
0, 1, 3.
0, 1, 3, 7.
0, 2, 6, 9. Discarded.
0, 2, 6, 9, 11.
----------------------------------------------------------------------------
A[15] = 15. Case 2. Clone and extend.
0.
0, 1.
0, 1, 3.
0, 1, 3, 7.
0, 2, 6, 9, 11.
0, 2, 6, 9, 11, 15. <-- LIS List
----------------------------------------------------------------------------
 
注:观察上面的处理过程,我们都只在处理所有active list的最后一个元素end element(粗体),那么仅仅需要维护所有active list构成的end element集合(粗体斜边),
      可以用一维数组来存储。discard操作可以用replace操作来模拟。
 
 
四、算法实现
 
第三节的算法描述序号分别对应下面的算法实现
1、时间复杂度为O(n^2)的DP实现
 1 /**
 2  @description: Longest Increasing Subsequence
 3  @author:   seiyagoo
 4  @create:   2013.10.25
 5  @modified: 2013.10.26
 6 **/
 7 int LIS_1(int A[], int size){
 8 
 9     int *LIS   = new int[size];
10     vector<int> *vec = new vector<int>[size];
11 
12     /* Compute optimized LIS values in bottom up manner */
13     for(int i=0; i < size; i++){
14         LIS[i]=1;                        //初始化默认长度
15         int max_j=0, flag=0;
16         for(int j=0; j < i; j++){        //查表,找出前面最长的序列, 若将A[i]加入LIS[j](LIS[j]+1的含义)的递增子序列比当前的LIS[i]更长, 则更新LIS[i]
17             if(A[i] > A[j] && LIS[i] < LIS[j]+1){
18                 LIS[i] = LIS[j]+1;
19                 max_j=j;
20                 flag=1;
21             }
22         }
23         if(flag)                        //copy前面最长子序列到vec[i]
24             vec[i].insert(vec[i].end(), vec[max_j].begin(), vec[max_j].end());
25         vec[i].push_back(A[i]);       //最后放入A[i]
26     }
27     
28     /*Show LIS of the current state*/
29     vector<int>::iterator it;
30     cout<<left;
31     for(int i=0; i<size; i++){
32         cout<<setw(2)<<A[i]<< "  -->  ";
33         for(it = vec[i].begin(); it!=vec[i].end(); it++)
34             cout<<*it<<" ";
35         cout<<endl;
36     }
37     
38     /* Pick maximum of all LIS values, namely max{LIS[i]} */
39     int max_len=0;
40     for(int i = 0; i < size; i++ )
41       if( max_len < LIS[i] )
42          max_len = LIS[i];
43          
44     delete[] LIS;
45     delete[] vec;
46 
47     return max_len;
48 }

 

2、时间复杂度为O(nlogn)的DP实现
 1 /**
 2  @description: Longest Increasing Subsequence
 3  @author:   seiyagoo
 4  @create:   2013.10.25
 5  @modified: 2013.10.26
 6 **/
 7 
 8 // Binary search (note boundaries in the caller)
 9 // A[] is ceilIndex in the caller
10 int CeilIndex(int A[], int l, int r, int key) {
11     int m;
12  
13     while( r - l > 1 ) {
14         m = l + (r - l)/2;
15         (A[m] >= key ? r : l) = m; // ternary expression returns an l-value
16     }
17  
18     return r;
19 }
20 
21 int LIS_2(int A[], int size) {
22     // boundary case: when array size is one
23     if( 1 == size ) return 1;
24  
25     int *tailTable   = new int[size];
26     vector<int> *vec = new vector<int>[size];
27     int len; // always points empty slot
28  
29     //memset(tailTable, INT_MAX, sizeof(tailTable[0])*size);  @bug
30 
31     for(int i = 0; i < size; i++)
32         tailTable[i] = INT_MAX;
33     
34     tailTable[0] = A[0];          //tailTable[0] store the smallest value
35     vec[0].push_back(A[0]);
36     
37     len = 1;
38     for( int i = 1; i < size; i++ ) {
39         if( A[i] < tailTable[0] ) {  //case 1: new smallest value
40             tailTable[0] = A[i];
41 
42             /*discard and create*/
43             vec[0].clear();
44             vec[0].push_back(A[i]);
45         }
46         else if( A[i] > tailTable[len-1] ) { //case 2: A[i] wants to extend largest subsequence
47             tailTable[len++] = A[i];
48             
49             /*clone and extend*/
50             vec[len-1] = vec[len-2];
51             vec[len-1].push_back(A[i]);
52         }
53         else { //case 3: A[i] wants to be current end candidate of an existing subsequence, It will replace ceil value in tailTable
54             int ceilIndex = CeilIndex(tailTable, -1, len-1, A[i]);
55             tailTable[ceilIndex] = A[i];
56 
57             /*discard, clone and extend*/
58             vec[ceilIndex].clear();
59             vec[ceilIndex] = vec[ceilIndex-1];
60             vec[ceilIndex].push_back(A[i]);
61         }
62         
63         /*Printf all the active lists*/
64         vector<int>::iterator it;
65         cout<<left;
66         cout<<"A["<<i<<"] = "<<A[i]<<endl<<endl;
67         cout<<"active lists:"<<endl;
68         for(int i=0; i<len; i++){
69             for(it = vec[i].begin(); it!=vec[i].end(); it++)
70                 cout<<*it<<" ";
71             cout<<endl;
72         }
73 
74         /*Printf end elements of all the active lists*/
75         cout<<endl<<"end elements array:"<<endl;
76         for(int i = 0; i < size; i++)
77             if(tailTable[i] != INT_MAX)
78                 cout<<tailTable[i]<<" ";
79         cout<<endl;
80         cout<<"-------------------------"<<endl;
81     }
82     
83     
84     delete[] tailTable;
85     delete[] vec;
86  
87     return len;
88 }

 

五、运行结果

example:

0, 8, 4, 12, 2, 10, 6, 14, 1, 9, 5, 13, 3, 11, 7, 15
 
算法实现一
 
算法实现二
 
 
参考:
《编程之美》
posted on 2013-10-26 15:15  Seiyagoo  阅读(1829)  评论(2编辑  收藏  举报