决策变元选择_决策分支策略——文献学习SAT Solving with Reference Points

要点：使用DMRP algorithm (decision making with a reference point)在重启阶段为解决问题提供好的方向。

引文来自：

Kottler S. (2010) SAT Solving with Reference Points. In: Strichman O., Szeider S. (eds) Theory and Applications of Satisfiability Testing – SAT 2010. SAT 2010. Lecture Notes in Computer Science, vol 6175. Springer, Berlin, Heidelberg

Abstract

Many state-of-the-art SAT solvers use the VSIDS heuristic to make branching decisions based on the activity of variables or literals. In combination with rapid restarts and phase saving this yields a powerful decision heuristic in practice. However, there are approaches that motivate more in-depth reasoning to guide the search of the SAT solver. But more reasoning often requires more information and comes along with more complex data structures. This may sometimes even cause strong concepts to be inapplicable in practice.

译文：然而，有一些方法可以激发更深入的推理来指导SAT求解器的搜索。

译文：但更多的推理往往需要更多的信息，并伴随着更复杂的数据结构。这有时甚至可能导致强概念在实践中不适用。

In this paper we present a suitable data structure for the DMRP approach to overcome the problem above. Moreover, we show how DMRP can be combined with CDCL solving to be competitive to state-of-the-art solvers and to even improve on some families of industrial instances.

译文：在本文中，我们提出了一个适用于DMRP方法的数据结构来克服上述问题。

译文：此外，我们还展示了如何将DMRP与CDCL解决方案结合在一起，以便与最先进的解决方案相竞争，甚至改进一些工业实例系列。

1 Introduction

Research in satisfiability checking (SAT) has managed to bridge the gap between theory and practice in many aspects. There are several kinds of real-world problems that are actually tackled by modelling those problems as SAT instances like hardware and software verification [21,10], planning [11], automotive product configuration [13] and haplotype inference in bioinformatics [15] (cf. [16]).

译文：可满足性检验的研究在许多方面弥补了理论与实践的差距。译文：通过将这些问题建模为SAT实例，实际上可以解决几种实际问题，如硬件和软件验证[21,10]、规划[11]、汽车产品配置[13]和生物信息学[15]中的单倍型推断(cf.[16])。

In the domain of SAT solving there are different schemes and even more variants of these schemes to decide whether there exists a satisfying assignment to the variables of a Boolean formula in CNF or if a formula cannot be satisfied by any assignment.译文：在SAT求解的领域中，有不同的方案，甚至这些方案的变种，以决定是否存在一个布尔公式的变量分配CNF或如果一个公式不能满足分配。

Both experiments and applications show that there is no perfect SAT solving approach that is suited for all different categories and families of problem instances. 译文：实验和应用表明，没有一种完美的SAT解决方法适合所有不同类别和家庭的问题实例。

However, conflict-driven solving has proven itself to be very successful on a wide range of benchmarks. In this paper we study the quite new DMRP algorithm (decision making with a reference point) [8,9] from a practical point of view. Moreover, a hybrid approach that combines DMRP and CDCL solving is presented which is also motivated by experimental evaluations.

The paper is organised as follows: In section 2 we sketch related work in the domain of CDCL and DMRP solving. Section 3 examines the DMRP approach from a practical point of view and we introduce a new implementation for this approach. 译文：从实用的角度研究了DMRP方法，并介绍了该方法的新实现。In section 4 we motivate the combination of CDCL and DMRP to a new hybrid approach. In section 5 some experimental results are presented.

2 Related Work

CDCL采用了从冲突赋值中学习的思想。

In this chapter two different SAT solving approaches are sketched. The state-ofthe-art conflict-driven solving and the quite recent DMRP approach that operates on complete assignments. By V(F) resp. Γ(F) we state the set of variables resp. clauses of a formula F (we omit F if evident). A clause consists of literals l_i that are variables v or their negations v'. The polarity of a literal is true or f alse respectively. var(l) indicates the variable of literal l.

Conflict-Driven Solving.

Conflict-driven solving with clause learning (CDCL) is a leading approach and is especially but not only successful for industrial problems. It is based on the GRASP algorithm [17] which extends the original DPLL branch-and-bound procedure [5,4] by the idea of learning from conflicting assignments.

Moreover, conflicts are analysed to jump over parts of the search space that would cause further conflicts. 此外，对冲突进行分析，以跳过搜索空间中可能导致进一步冲突的部分。

There are several improvements to the original algorithm like the two-watched-literal data structure and the VSIDS (variable state independent decaying sum) variable selection heuristic [18].

In recent years further improvements have been achieved by developing different restart strategies like the concept of rapid and adaptive restarts [3,2] and so-called Luby restarts [14].

In combination with phase-saving [19] frequent restarts constitute a strong concept especially for industrial SAT instances.

此外，对冲突进行分析，以跳过搜索空间中可能导致进一步冲突的部分。

Decision Making with a Reference Point.

DMRP is a new SAT solving approach that was proposed by Goldberg in [8,9]. Even though DMRP uses Boolean constraint propagation (BCP) with backtracking and learning from conflicting assignments it is not a simple variant of CDCL.译文：尽管DMRP使用布尔约束传播(BCP)来回溯并从冲突的赋值中学习，但它不是CDCL的一个简单变体。

In difference to CDCL solvers DMRP additionally holds a complete assignment (a so-called reference point).

译文：DMRP还持有一个完整的分配(一个所谓的参考点)

The algorithm aims for modifying the current reference point P to P’ in order to satisfy a clause under consideration. 译文：该算法的目的是将当前参考点P修改为P’，以满足考虑中的一个子句。

Furthermore, it is crucial that all clauses being satisfied by P remain satisfied by the modified reference point P‘ .

译文：此外，至关重要的是，所有被P所满足的子句都必须被修改的参考点P’所满足。

Algorithm 1 gives an overview of the DMRP approach, though this notation varies in some ways from the original notation in [9].译文：算法1概述了DMRP方法，尽管这种表示法在某些方面与[9]中的原始表示法不同。

One invocation of the DMRP subsolver (line 7 of Algorithm 1) takes a clause and a reference point as arguments.

译文：对DMRP子解算器(算法1的第7行)的一次调用采用子句和引用点作为参数。

DMRP子解算器(算法1的第7行)以子句和引用点作为参数

It may either compute a modification to the reference point or it may learn the empty clause or else it times out.

译文：它可以计算对参考点的修改，或者学习空子句，或者超时。

The latter case causes the surrounding algorithm to call the DMRP subsolver with another unsatisfied clause.

译文：后一种情况会导致周围的算法调用带有另一个不满意子句的DMRP子解算器。

3 A Closer Look at DMRP

Taking the set of clauses that are not satisfied by a current assignment as basis for branching decisions requires the solver to know this set of clauses.译文：如果将当前赋值不满足的子句集作为分支决策的基础，则求解器需要知道该子句集。

This could be realized analogously to how it is implemented in many local search approaches [20,7] where the solver keeps track of clauses that change their state from ’satisfied’ to ’not satisfied’ and vice versa whenever the value of a variable changes.译文：这可以类似于在许多本地搜索方法中实现的方法[20,7]，在这些方法中，求解器跟踪子句的状态从‘满意’变为‘不满意’，反之亦然，只要变量的值发生变化

However, for any variable v this implies the solver to know all clauses where v resp. v' occurs in.

Since the two watched literals scheme was introduced [18] most CDCL based solvers do not maintain complete occurrence-lists of variables.译文：由于两个观察的文字方案被引入，所以大多数基于CDCL的求解器不维护变量的完整的事件发生列表。

快速知道当前赋值条件下哪些是unsat子句哪些是sat的子句（剩余的是未知结论子句）。

In this section we present a data structure that allows for a fast computation of the most frequently required information in the DMRP approach by simultaneously avoiding the maintenance of complete occurrence-lists.

译文：在本节中，我们将介绍一种数据结构，它允许在DMRP方法中快速计算最常用的信息，同时避免维护完整的事件发生列表。

3.1 Different States of Variables

In CDCL solvers each variable v can actually have three values: val(v) ∈ {true, f alse, unknown}. In general, any variable whose value is known has either been chosen as decision variable or its value was implied by BCP. To undo decisions and their implications both types of assignments (decisions and implications) are placed on a stack (often called trail) in the order they are assigned [6,3].

译文：为了撤销决策及其含义，两种类型的分配(决策和含义)都按照分配的顺序放在堆栈(通常称为trail)上[6,3]。

In the DMRP algorithm we introduce two different kinds of values expressed by the functions pval and tval: The DMRP algorithm maintains a reference point P which is an assignment to all the variables in the formula. Hence, for any variable v in the formula the reference point P either contains v or its negation v‘.

译文：在DMRP算法中，我们引入了pval和tval函数表示的两种不同的值:

译文：DMRP算法维护一个参考点P，它是公式中所有变量的赋值。译文：因此，对于公式中的任意变量v，参考点P要么包含v，要么包含其否定v '。

For a variable v we refer to its value in P by pval(v) ∈ {true, f alse}.译文：对于一个变量v，我们通过pval(v)∈{true, f alse}来表示它在P中的值。

The second kind of value tval(v) is used to state a temporary modification of pval(v).译文：第二类值tval(v)用于声明pval(v)的临时修改。

The default of tval(v) ∈ {true, f alse, ref} is ref which indicates that the corresponding variable is not affected by the current temporary modification of P and hence the value given by pval(v) is valid.译文：tval(v)∈{true, f alse, ref}的默认值是ref，表示对应的变量不受当前P临时修改的影响，因此pval(v)给出的值是有效的。

During the search for a modification of P to P‘ (line 14 of Algorithm 1) that reduces the set of unsatisfied clauses M to M’ ⊂ M the temporary value tval(v) ！= ref hides pval(v) for any variable v.译文：在搜索P到P '的一个修改(算法1第14行)，将不满足子句M的集合减少为M '⊂M时，临时值tval(v) != ref隐藏了任何变量v的pval(v)。

For any literal l with polarity b the function pval(l) (resp. tval(l)) is true iff pval(var(l)) = b (resp. tval(var(l)) = b) and it is tval(l) = ref iff tval(var(l)) = ref. 译文：对于任何极性为b的文字l，使用函数pval(l) (resp。如果pval(var(l)) = b (resp)，则tval(l))为真。tval(var(l)) = b)，它是tval(l) = ref iff tval(var(l)) = ref

3.2 Clauses Satisfied by the Reference Point

In addition to standard SAT solving the algorithm has to maintain a reference point P. Obviously, if all clauses Γ are satisfied by P the algorithm has found a model for the formula. Hence, for the remaining section we assume the set of clauses M that contains all clauses not satisfied by P to be non-empty

After any initialisation of P the set M can be computed by simply traversing Γ. However, whilst the algorithm tries to modify P in order to satisfy more clauses of M we have to keep track of those clauses in Γ \ M that become temporarily unsatisfied by a temporarily modified reference point. These clauses are put onto a stack D which is described further below. The first matter is how to compute the clauses that become unsatisfied by a modification of the reference point.

Similar to the concept of watched literals [18] for each clause C in Γ \ M we choose one literal l ∈ P to take on responsibility for C regarding its satisfiability by the current reference point P. By definition for any clause in Γ \ M at least one such literal l ∈ C has to exist with pval(l) = true. We say a literal l is responsible for a set of clauses R(l). Whenever the value of a variable v := var(l) changes from tval(v) = ref to ¬pval(v) all clauses in R(l) have to be traversed. For each clause C ∈ R(l) a new literal from the current (modified) reference point has to be found that takes on responsibility for C.

Note that - in addition to the responsibilities regarding the reference point - there are also two literals per clause that watch this clause in the sense of the usual two-watched-literal scheme [18]. This is necessary to notice whenever a temporary modification (tval) generates a unit clause or completely unsatisfies a clause. Let the set of clauses that are watched by a literal l be W(l). We examine this in more detail now.

Whenever the value of a variable is changed whilst searching for a modified reference point P (tval is changed) we have to take care of W(l) and R(l) of the corresponding literal l that became false under tval. When examining W(l) the usual three cases may happen for any affected clause C that is watched by l and any other literal lw ∈ C. For these cases only the values of tval act a part:

W.1 Another literal lj ∈ {C \ lw} with tval(lj ) = f alse can watch C. W.2 There is no other literal in {C \ lw} that is not false. Hence tval(lw) has to be set to true to satisfy C. W.3 If in the second case above tval(lw) is already set to f alse a conflicting assignment is generated and the algorithm jumps back to resolve the conflict.

The following update is done after the list W(l) was examined successfully:

R.1 If tval(l) equals pval(l) nothing has to be done.

R.2 tval(l) differs from pval(l) and another literal in C can be found to take responsibility for C. This might be any literal lj ∈ C for which it is tval(lj ) = ref and pval(lj) = true. In that case C is removed from R(l) and put into R(lj ). Or we might find a literal with tval(lj) = true. In that case C remains in R(l) since tval(lj ) was obviously assigned before the current modification of l in the reference point.

R.3 tval(l) differs from pval(l) but no other literal ∈ C satisfies C under the current temporary point. In that case C is put on the stack D that keeps track of all clauses that are not satisfied by the current temporary reference point. Note that since W(l) was examined first there are at least two literals li, lj ∈ C for which tval(li) = tval(lj) = ref and pval(li) = pval(lj ) = f alse. If this did not hold one of the cases W.2, W.3 or R.2 would apply

Note that this implementation (sketched in Algorithm 2) allows for backtracking without any updates of the sets R(l) of any literal l. The responsibility list1R(l) only has to be examined when tval of a variable changes from ref to true or f alse not for the opposite case. Moreover, the data structure is sound in the sense that no clause that becomes unsatisfied by P will be missed.

3.3 Keeping Track of Temporarily Unsatisfied Clauses

While trying to modify a reference point P to P to reduce the set M of clauses that are unsatisfied by P to M ⊂ M a data structure D is used to store those clauses that are unsatisfied by any temporary reference point Pt. In the subsection above we described when clauses are added to D. The data structure D has to meet three main demands:

– Clauses that are not satisfied by the current point Pt have to be found in reasonable time without having to traverse the clause’s literals at each lookup in the data structure.

– Backjumping over parts of the temporary modification (due to a conflict - see case W.3) has to be very fast with least possible overhead to update D.

– When a temporarily unsatisfied clause C is chosen from D by the decision procedure the set ΛC = {l ∈ C : tval(l) = ref} contains those literals whose value in Pt may possibly be modified to satisfy C (as stated in case R.3 above it is |ΛC | ≥ 2). Our data structure has to support finding that literal of ΛC which satisfies the most clauses in D.

Fast Backjumping.

The main data structure is depicted in Figure 1. Since the second issue above is fundamental we use a stack to realise D which has one entry pL for each decision level. Each entry itself basically points to a set of clauses L that became unsatisfied by the current point Pt at this level. In addition each L has a flag that indicates if the referred clauses still have to be considered to belong to D. This allows for very fast backjumping: For each level we jump back the according flag in L is set to false, the set of clauses in L is deleted and pL is popped from the stack. This means a negligible overhead compared to backjumping in CDCL solving. Note, that an entry pL which is removed from the stack does not destroy the corresponding set L which allows other data still to refer to L. These invalid references may be updated lazily later on. Another important advantage of this implementation will become evident further below.

Finding clauses not satisfied by Pt.

To find those clauses in D that still have to be satisfied by further modifications of the current point Pt the clause sets L that are referred by entries pL of the stack have to be traversed. Let this procedure be called findUnsat(D). We do not remove any satisfied clause C from any set L that is still flagged to be valid since this would require to put such clauses back into L whenever the satisfying modification to C is undone. Instead, we additionally cache a literal for each clause in L as a kind of representative.

Fig. 1. Basic data structure to represent D

Thus any entry in L (besides the flag) is an element LC which itself consists of a clause C and one representative literal l ∈ C. When a clause C in a set L is found to be satisfied by a temporary point Pt the representative literal l is set to that one which actually satisfies this clause (tval(l) = true).

When searching for unsatisfied clauses in D we first check the state of the representative literal before the entire clause is checked. On the one hand this guarantees that a satisfied clause is not checked twice unless a modification makes this necessary. On the other hand significant changes of Pt to the satisfiability state of any clause are not missed. The latter issue would require extra maintenance if only Boolean flags were used to mark satisfied clauses. To find clauses in D that are still unsatisfied by the current point Pt we traverse the stack from its top2 to its bottom. This prefers the most recently added clauses. The first clause that is found to be unsatisfied by Pt is taken as basis for the next branching decision.

Computation of the MakeCount of variables. Given a clause C∗ that is unsatisfied under the current point Pt the algorithm has to find that literal of ΛC∗ = {l ∈ C∗ : tval(l) = ref} which satisfies most clauses in D (or optionally D∪M) when its value in Pt is changed. To compute this so-called MakeCount of a variable we use another data structure that interplays with the above one. This data structure allows for lazy computation of the MakeCount of a variable and is basically organised as follows:

Each variable v that is not yet affected by the temporary modification of the reference point Pt (tval(v) = ref) is associated with a list Ωv of elements M. Each element represents a clause in D that can be satisfied by flipping the current value of v in the point Pt. Due to the laziness of the data structure it might be that an element M is out of date. More precisely (see Figure 2) each element M (representing a clause C) in a list of variable v consists of two fields:

The first field references the set L of clauses in which C is contained. The second field is an index into L that indicates the particular clause C (i.e. the according element LC) that can be satisfied by flipping the value of v in P_t.

Whenever case R.3 from above applies for a clause C a pointer to C and any representing literal are wrapped into an object LC. This data is appended to the set of clauses L which is referenced from the topmost entry on the stack D. At this point we also add an entry M to the Make-lists (Ωv) of all variables v of ΛC = {l ∈ C : tval(l) = ref}. We take a closer look at the different cases when computing the valid MakeCount from the possibly out-of-date information:

– It might be that an element M refers to a clause that has already been removed from D. In that case the flag of the structure L referenced by M has been invalidated during backtracking. Hence, this case can be realised immediately and M can also be deleted from the list.

– We can assume now that the clause C indirectly referenced by M is still contained in D. Let us assume for now that C is already satisfied by a further modification of the point Pt. Recall, that what we actually get from M is a reference to LC

- the clause C and a representing literal l of C. We can distinguish between two cases:

• C might have been already considered by the procedure findUnsat(D) to find unsatisfied clauses in D as described above. In that case findUnsat() has changed the representative literal l such that by checking the value (tval) of l in Pt we know that C is satisfied and we are done.

• If C was not considered by findUnsat(D) yet, the satisfiability state of the clause has to be computed by checking its literals. Given that C is satisfiable under Pt a literal that satisfies C will be found and will be made the representative literal for this clause in LC. This allows for fast detection of the satisfiability of C later on and will relieve findUnsat(D) from checking all literals of C. The representing literal guarantees that for each temporary point Pt there is at most one traversal through all literals of a clause to recognise that this clause is satisfied by Pt.

– In case C is not satisfied by the current point Pt this is recognised by a check of all literals in C.

The realisation of the set D and the data structure to compute MakeCounts of variables follows the concept of lazy data structures and avoids to store complete occurrence lists for literals. MakeCount lists do not require any update operation on backjumping. Even though indices in M become undefined when the referred set L is cleared during backjumping, this is not problematic since an index is only used after L is asserted to be still valid by its flag.

The size of a list Ωv of a variable v gives an upper bound Ω v on the valid MakeCount of v. Hence, to determine the variable with maximum MakeCount of a clause, the variables are traversed in descending order regarding Ω. This allows for early termination when Ω becomes smaller than the actual valid maximum MakeCount.

3.4 Learning

Two aspects that have to be considered for the realisation of the DMRP approach are related to learning (Algorithm 1, line 22 ) as mentioned in case W.3:

Whenever a unit clause C = (l) is learned the algorithm jumps back to decision level 0, assigns l to be true and propagates all implications of this assignment. This also requires a modification of the current reference point P to P with a difference to previously described modifications: The set of clauses M unsatisfied by P does not necessarily have to be a subset of M – the set of clauses unsatisfied by the previous reference point P.

Secondly, for any learned lemma that is generated when a conflict is analysed the data structure has to be updated properly. We use the following property.

Property: Any lemma generated by the function analyze-conflict in line 22 of Algorithm 1 contains at least one literal l with l ∈ P (pval(l) = true) regarding the current valid reference point P. In other words: No generated lemma extends3 the current set M.

Proof: We prove this property by the construction of learned lemmata. The surrounding function dmrpTryModifyPoint (lines 7, 14 of Algorithm 1) only considers clauses in Γ \M∪C where a modification of P is wanted that additionally satisfies C. Since C is always the base for the decision at the first decision level any temporary point Pt = P will always satisfy C during one execution of dmrpTryModifyPoint. Hence, C can never be an assign-reason to an assignment of a variable, since assign-reasons are clauses that become unit during the search. Thus, all assign-reasons are clauses from the set Γ \ M that, by definition, are satisfied by at least one literal ∈ P.

Running into a conflict means that for a clause C0 (conflicting clause) all literals are set to false. The lemma λ∗ is generated by recursively resolving out variables (that were no decisions) from the conflicting clause by using the according assign-reasons. C0 can be seen as the first version (λ0) of the generated lemma λ∗. Given that C0 ∈ Γ \ M one of the literals of λ0 is in P. Let l∗ be one literal ∈ λi with l∗∈P (pval(l∗) = true). If any literal l = l∗ is resolved out from λi the resolvent λi+1 still contains literal l∗. If on the other hand l∗ is resolved out by the use of its assign reason C∗, clause C∗ has to contain literal l∗. Since C∗ ∈ Γ \ M it also has to contain a literal l ∈ P. Moreover, with l∗∈P it is l = l∗ and the new resolvent λi+1 contains literal l ∈ P. By induction the final lemma contains at least one literal that is in P.

For any generated lemma λ∗ we chose that literal l ∈P∩λ∗ which was assigned at the highest decision level d (most recently). By the above property at least one such literal l has to exist. Literal l takes on responsibility for λ∗: R(l) ← R(l) ∪ {λ∗}. The functions in lines 24 and 25 of Algorithm 1 determine a new point Pt. If l /∈ Pt the lemma λ∗ is also appended to the list L that is referred by the stack D for decision level d and considered for the MakeCounts as described in the previous section 3.3. These two actions guarantee a proper update of the entire data structure and no more special treatments are needed.

4 Combining DMRP and CDCL to a Hybrid Solver

In Algorithm 1 we assume the initial reference point to be given from outside. In the original paper [9] reference points are chosen at random and are then tried to modify by a call to function solveDMRP. In case no result can be computed within a certain amount of time (i.e. number of conflicts) solveDMRP will be invoked with a new initial point. This is similar to local search restarts but with the difference that the DMRP algorithm itself carefully reasons on how to modify a reference point. However, the choices of initial points are crucial for the algorithm as presented in section 5.

As mentioned in section 2 CDCL solvers perform restarts quite frequently. At a restart activity values of variables or literals are kept and also a subset of the learned clauses is carried along for the next start. However, the current partial assignment (all literals in the trail) is almost completely rejected, even though phase saving keeps some information. This motivates a hybrid approach that reasonably alternates the CDCL and the DMRP algorithms. The DMRP approach offers a suitable possibility to take a closer look at the drawback of a partial assignment before it is rejected. It may focus on the not yet satisfied clauses.

Our recent implementation that is shown in Algorithm 3 combines both approaches by the use of the Luby et al. restart strategy [14] which proved itself successful in both theory and practice. The Luby strategy assumes that the algorithm does not have any external information and does not know when it is best to perform a restart. In that case the available computation time is shared almost equally among different restart strategies [14]. The function maxConflictCount in Algorithm 3 returns the number of conflicts for the next run due to the Luby strategy. That is the product of a constant factor f and the next number of the sequence (1, 1, 2, 1, 1, 2, 4, 1, 1, 2, 4, 8, 1,...) (see [14] for details).

The function chooseAlgo decides on which algorithm to use for the next run. On average we achieved the best results when running the DMRP algorithm exactly for the smallest conflict limit (when cl = f).

Since the DMRP algorithm requires a reference point i.e. an assignment to all variables the last partial assignment of the CDCL solver has to be extended to a complete assignment (extendPartialAssignmToRefPoint). This is done by continuing the previous CDCL search with the last partial assignment. However, within this execution only binary clauses are considered during search until all variables are assigned a value. This assignment constitutes the initial reference point for the DMRP algorithm. In this phase the solver may also realise that the formula is unsatisfiable. For the case the partial assignment is empty (at algorithm start) this function simply computes a reference point that satisfies all binary clauses. Taking care of binary clauses at first is also motivated by the work in [22] and [1] where the idea to primarily focus on binary clauses has also improved solving for some families of instances. This also guarantees an additional invariant for our data structure that a binary clause can neither be contained in the set M nor in the delta stack D (resp. its elements).

Some Adaptions for the Hybrid Approach

In addition to standard CDCL solving each clause of the formula is assigned an activity value initialised to zero at the beginning. Whenever a clause is involved in a conflict (i.e. it is used for resolution during the generation of a lemma) its activity value is increased. In some solvers (for instance [6]) this technique is common for learned clauses to clear the clause database of inactive learned clauses periodically. Our hybrid solver maintains an activity value for every clause.

This activity value of a clause is taken into account when the next clause from set M has to be chosen (line 6 of Algorithm 1) to be handled by the function dmrpTryModifyPoint. We always choose the clause with the highest activity value for the next attempt to modify the current reference point. However, if the call to dmrpTryModifyPoint times out or for two subsequent calls to solveDMRP the next clause with the second highest activity value is chosen.

In contrast to the original DMRP algorithm the conflict limit (timeout) for the function solveDMRP depends on the success of its subroutine dmrpTryModifyPoint in line 7 of Algorithm 1. If the current reference point could be improved the initial conflict limit is reset.

The solver also differs in the computation of the MakeCount of a variable. For the MakeCount one can count only the clauses currently in D to get the most local improvement or all clauses in D∪M can be considered to make decisions more globally. For variables that have the same MakeCount ties can be broken in favour of different issues which is explained in more detail in the next section.

5 Experiments and Evaluation

For the evaluation presented in this section we have run our solver for all industrial (application) instances of the SAT competitions resp. SAT Races of the years 2006 - 2009 that add up to 564 non-trivial4 instances. Each instance is preprocessed in advance and the timeout for the solvers was set to 1200 seconds. As a reference and also to check results we have run our CDCL solver using the Luby restart strategy (without DMRP) and MiniSAT 2.0 [6].

译文：每个实例都被预先预处理，并且求解器的超时设置为1200秒。作为参考和检查结果，我们使用Luby重启策略(没有DMRP)和MiniSAT 2.0运行了CDCL求解器

Figure 3 shows the results of different configurations of the hybrid approach. Furthermore, there are results that show performance of a pure DMRP solver. The presented configurations differ in the following issues that are related to decision making: As mentioned above the MakeCount may consider all clauses in D∪M (global) or only clauses in D (local). If two variables have the same MakeCount ties are broken in favour of the variable v that:

译文：图3显示了混合方法的不同配置的结果。译文：提出的配置在以下决策相关的问题上有所不同:

译文：如上所述，MakeCount可以考虑D∪M(全局)中的所有子句，也可以只考虑D(局部)中的子句；两个变量具有相同的MakeCount情况下，依照变量的下述属性区分：

(Act) has the highest activity value similar to the VSIDS heuristic [18].

译文：(Act) 具有最高的活动值，类似于VSIDS启发式[18]

(BC) has the smallest set R(v). This can be seen as a simple approximation of the BreakCount of the variable. In difference to the MakeCount the BreakCount of a variable v states the number of clauses that become unsatisfied by a flip of the value of variable v. 译文：这可以看作是对变量的BreakCount的简单近似。与MakeCount不同，变量v的BreakCount表示变量v的值翻转后不能满足的子句的数量。

(DC) was chosen least often for DMRP decisions. This avoids flipping always the same variables back and forth in different calls to solveDMRP. 译文：为DMRP决策选择最少的次数。这样就避免了在对solveDMRP的不同调用中总是来回翻转相同的变量。

The left plot of Figure 3 shows clearly that pure DMRP solving could not compete with CDCL solving. 译文：单纯的DMRP求解无法与CDCL求解相竞争。

Both pure DMRP configurations (global and local MakeCount) solve around 224 of 564 instances within 1200 seconds. Initial reference points are always chosen at random. Timeouts for the analysis of one reference point (one call to solveDMRP) are changed according to the Luby sequence. Modifying the strategy on how to choose initial reference points showed quite some impact. Our assumption was that DMRP requires a better guidance on where to start search and how to choose its initial reference points. That motivates our hybrid approach where DMRP gets its initial reference points indirectly from the CDCL solver. As the plots show this clearly improves the performance of the solver.

Fig. 3. The left plot compares DMRP, CDCL and our hybrid approach on 564 industrial benchmarks. The right plot compares CDCL and the hybrid approach on 51 instances from hardware verification. A point (x, y) states that x instances were solved within y sec. by that solver. Legends are ordered regarding the number of solved instances after 1200 seconds. Using local (resp. global) MakeCount and smaller decision count (resp. Activity or BreakCount) to break ties is indicated by [local MC > DC].

A previous version of our hybrid approach [12] has taken part in the SAT competition 2009. That version mainly differs from the presented one regarding the restart strategy and the choice of when to perform DMRP resp. CDCL solving. It also implemented a more extensive solving of particular subsets of clauses which is only done for binary clauses in this improved approach. However, the results indicate that the older version did not utilize the DMRP approach in a sufficient way. Compared to MiniSAT 2.0 our hybrid approach also performs much better. Admittedly, this is not only due to the hybridisation with DMRP. This version of MiniSAT does neither use the Luby restart strategy nor phase saving. However, the hybrid approach also clearly outperforms our CDCL implementation with Luby restarts and phase saving.

The hybrid configuration where the MakeCount is computed locally outperformes the other configurations. It is interesting to notice that using the activity of variables to break ties does not achieve the best results. It turns out that it is better to prefer variables that were flipped least often at the current call of solveDMRP.

The right plot of Figure 3 compares pure CDCL with the hybrid approach on the 51 “Velev” instances of last years SAT competitions. For these instances that stem from the domain of hardware verification the hybrid approach clearly outperforms pure CDCL by solving 8 more instances.

Even though the hybrid implementation beats our pure CDCL solver on the entire benchmark set it turns out that for the most instances solved by the hybrid solver the answer was given by the CDCL part. Only about 6% were finally solved by the DMRP subsolver. Moreover, the improvement due to the hybridisation was mainly for unsatisfiable instances (17 more unsat results).

Our conjecture about this phenomenon is that DMRP generates some important lemmata: When the CDCL solver reaches the (current) maximum number of conflicts it delivers work to the DMRP solver. DMRP starts with an extension R of the last partial assignment P of the CDCL solver and hence focuses on a nearby part of the search space. When analysing this part it purposely examines clauses M that are not satisfied by R. In CDCL these clauses in M could likely become conflicting clauses if decisions were made similar to the values in R. Up to a certain point phase saving would do this after a normal CDCL restart. However, DMRP immediately considers clauses in M for search and resolution.

6 Conclusion

In this paper we have presented a data structure to implement the DMRP approach in an efficient way. Similar to the two-watched-literals scheme we choose one literal for each clause. The literal takes on responsibility so that a clause which is satisfied by a reference point is also satisfied by a modification of the point.

Moreover, we present a way how to determine that variable which satisfies the most previously unsatisfied clauses when its value is flipped (MakeCount). Based on this implementation we motivate a hybrid SAT solver that combines CDCL and DMRP solving.

Experiments have shown that our hybrid approach is competetive to the highly optimised state-of-the-art CDCL solvers and that the maintenance of complete assignments may definitely turn to account.

译文：在本文中，我们提出了一种数据结构来有效地实现DMRP方法。译文：与两种文字模式相似，我们为每个子句选择一个文字。译文：文字承担了责任，因此一个由参考点满足的子句也由该点的修改满足。

译文：此外，我们提出了一种方法来确定变量在其值被翻转时最满足之前未满足的子句(MakeCount)。译文：在此基础上，我们提出了一个结合CDCL和DMRP求解的混合SAT求解器。

译文：维持完成的赋值肯定会成为问题的关键。

References

1.

Bacchus, F.: Exploring the computational tradeoff of more reasoning and less searching. In: SAT 2002, pp. 7–16 (2002)Google Scholar
2.

Biere, A.: Adaptive restart strategies for conflict driven SAT solvers. In: Kleine Büning, H., Zhao, X. (eds.) SAT 2008. LNCS, vol. 4996, pp. 28–33. Springer, Heidelberg (2008)CrossRefGoogle Scholar
3.

Biere, A.: Picosat essentials. JSAT 4, 75–97 (2008)zbMATHGoogle Scholar
4.

Davis, M., Logemann, G., Loveland, D.: A machine program for theorem-proving. ACM Commun. 5(7), 394–397 (1962)MathSciNetCrossRefzbMATHGoogle Scholar
5.

Davis, M., Putnam, H.: A computing procedure for quantification theory. J. ACM 7(3), 201–215 (1960)MathSciNetCrossRefzbMATHGoogle Scholar
6.

Eén, N., Sörensson, N.: An extensible SAT-solver. In: Giunchiglia, E., Tacchella, A. (eds.) SAT 2003. LNCS, vol. 2919, pp. 502–518. Springer, Heidelberg (2004)CrossRefGoogle Scholar
7.

Fukunaga, A.S.: Efficient Implementations of SAT Local Search. In: SAT (2004)Google Scholar
8.

Goldberg, E.: Determinization of resolution by an algorithm operating on complete assignments. In: Biere, A., Gomes, C.P. (eds.) SAT 2006. LNCS, vol. 4121, pp. 90–95. Springer, Heidelberg (2006)CrossRefGoogle Scholar
9.

Goldberg, E.: A decision-making procedure for resolution-based SAT-solvers. In: Kleine Büning, H., Zhao, X. (eds.) SAT 2008. LNCS, vol. 4996, pp. 119–132. Springer, Heidelberg (2008)CrossRefGoogle Scholar
10.

Ivancic, F., Yang, Z., Ganai, M., Gupta, A., Ashar, P.: Efficient SAT-based bounded model checking for software verification. Theoretical Computer Science 404(3) (2008)Google Scholar
11.

Kautz, H.A., Selman, B.: Planning as satisfiability. In: Proceedings of the Tenth European Conference on Artificial Intelligence ECAI 1992, pp. 359–363 (1992)Google Scholar
12.

Kottler, S.: Solver descriptions for the SAT competition (2009), satcompetition.org
13.

Küchlin, W., Sinz, C.: Proving consistency assertions for automotive product data management. J. Automated Reasoning 24(1-2), 145–163 (2000)CrossRefzbMATHGoogle Scholar
14.

Luby, M., Sinclair, A., Zuckerman, D.: Optimal speedup of las vegas algorithms. In: ISTCS, pp. 128–133 (1993)Google Scholar
15.

Lynce, I., Marques-Silva, J.: SAT in bioinformatics: Making the case with haplotype inference. In: Biere, A., Gomes, C.P. (eds.) SAT 2006. LNCS, vol. 4121, pp. 136–141. Springer, Heidelberg (2006)CrossRefGoogle Scholar
16.

Marques-Silva, J.P.: Practical Applications of Boolean Satisfiability. In: Workshop on Discrete Event Systems, WODES 2008 (2008)Google Scholar
17.

Marques-Silva, J.P., Sakallah, K.A.: Grasp: A search algorithm for propositional satisfiability. IEEE Trans. Comput. 48(5), 506–521 (1999)MathSciNetCrossRefGoogle Scholar
18.

Moskewicz, M.W., Madigan, C.F., Zhao, Y., Zhang, L., Malik, S.: Chaff: engineering an efficient SAT solver. In: DAC (2001)Google Scholar
19.

Pipatsrisawat, K., Darwiche, A.: A lightweight component caching scheme for satisfiability solvers. In: Marques-Silva, J., Sakallah, K.A. (eds.) SAT 2007. LNCS, vol. 4501, pp. 294–299. Springer, Heidelberg (2007)CrossRefGoogle Scholar
20.

Selman, B., Levesque, H., Mitchell, D.: A new method for solving hard satisfiability problems. In: Tenth National Conference on Artificial Intelligence (1992)Google Scholar
21.

Velev, M.N.: Using rewriting rules and positive equality to formally verify wide-issue out-of-order microprocessors with a reorder buffer. In: DATE 2002 (2002)Google Scholar
22.

Zheng, L., Stuckey, P.J.: Improving SAT using 2SAT. In: Proceedings of the 25th Australasian Computer Science Conference, pp. 331–340. E (2002)Google Scholar

posted on 2020-06-30 18:47 海阔凭鱼跃越阅读(443) 评论(0) 收藏举报

刷新页面返回顶部