Specializing U for the useless clause problem

At first sight, it seems that plain algorithm U suffices in flagging useless clauses. Indeed, one hardly sees what additional, concise and useful, information could be given to programmers, whose expected reaction is to suppress the useless clause before recompiling. However or-patterns introduce their specific anomaly, which is related to the useless clause anomaly but does not reduce to it.

6.1 Useless clause is (almost) enough

Let us assume that we write a function to detect lists of type mylist (Section 2) whose first element is 1.

Intuitively, something is wrong: the last pattern looks too complicated. Indeed, the following code is more concise and equivalent.

Unfortunately, algorithm U silently accepts the first, “bad”, code. A good compiler should suggest that we might replace this bad code by the second, “good”, code.

In fact, algorithm U already does such a suggestion in the case of the following, “bad”, code, where the or-pattern is expanded.

Here, the compiler can tell us that the last two clauses are useless and we normally react by deleting them.

Clause Nil | One _ | Cons (_,_) -> false is useful because of pattern Nil. However, patterns One _ and Cons (_,_) are useless, since they can be deleted without altering f behavior. Moreover, a more positive definition of useless patterns is easily built upon the standard notion of useless clause by expanding or-patterns.

On the practical side, the Objective Caml compiler will here flag two useless patterns (and no useless clause):

6.2 Expansion of or-patterns

Because there can be many or-patterns, it is our interest to consider the expansion of exactly one or-pattern amongst many, so as to avoid producing code of exponential size. Consider function f below.

And we can safely assert that patterns 2k−1 and 2k are useful by using known algorithm U, which does not exhibit exponential behavior here, provided that we compute disjunctions sequentially.

Expansion considers that, in or-pattern (p₁∣p₂), the left alternative p₁ has a higher priority than the right alternative p₂. This left-to-right bias allows a clear decision in the following boundary examples.

Expansion shows in what sense the right alternative of or-patterns is useful for f1 and useless in the remaining cases.

It may seem that giving good diagnostics forces us into reconsidering the definition of matching, which does not specify any order for trying to match or-pattern arguments (except for Haskell matching). In fact, for strict and Laville’s matching, we still can avoid specifying such an order: those definitions of pattern matching rely on what row is matched and not on how it is matched. However, in practice, for the sake of consistency between diagnostics and produced code (because of variables in or-patterns, execution can indeed reveal the matched alternative), the pattern matching compiler must take left-to-right order into account. This is easily done by the (strict) compiler of Le Fessant & Maranget [2001], which performs the expansion during pattern matching compilation, as by any compiler that features or-patterns by performing expansion before pattern matching compilation such as the SML/NJ compiler Appel and MacQueen [1991].

In the next section we describe a refinement of our algorithm U. This refinement aims at finding useless patterns and relies on the expansion of or-patterns. As a preliminary, we first note that expansion does not alter the output of algorithm U.

Proof: For Haskell matching, the equality follows from definitions — H((r₁∣r₂), v₁) = T, if and only if H(r₁, v₁) = T or H(r₁, v₁) = F ∧ H(r₂, v₁) = T. □

6.3 Rules for finding useless patterns

It is certainly easier to first consider finding useless sub-patterns in the case of one or-pattern. Let P be a pattern matrix and let q^→ be a pattern vector, with q₁ being the or-pattern (r₁∣r₂). We wish to make a distinction between four possibilities, r₁ and r₂ are both useful, r₁ alone is useless, r₂ alone is useless, and both r₁ and r₂ are useless. More concretely, we design a new function U′(P,q^→ ) that returns a set of useless patterns (more exactly a set of useless pattern positions); that is, ∅ in the first case, {r₁} in the second case, {r₂} in the third case, and the distinguished set ⊤ in the fourth case.

Where P @ q^→ ′ means adding row q^→ ′ to the bottom of matrix P. Then, we use U to compute the utility of both expansions and we write E_r1 and E_r2 for the results of these computations, logically encoding True by ∅ and False by ⊤ (E sets are sets of useless patterns). Then we combine E_r1 and E_r2 into E_q1 by the rules given in the left table of Figure 1.

If r₁ and r₂ are themselves or-patterns, we would like to compute the utility of their arguments. To do so, U′ is called recursively. As a consequence, results other than ∅ and ⊤ are possible, when r₁ or r₂ are partially useless. We combine those new results as described in the second table of figure 1. Those extra rules complete the definition of pattern utility by expansion. As an example of such nested expansions, assume q₁ = ((r₁∣r₂)∣(r₃∣r₄)), the utility of r₄ is computed on the expansion consisting of matrixP @ ((r₁∣r₂) q₂⋯q_n) @ (r₃ q₂⋯q_n) and vector (r₄ q₂⋯q_n).

If several components of q^→ are or-patterns, we perform several independent expansions. For instance, let us assume that both q₁ and q₂ are or-patterns, we first proceed as described above, yielding one result E_q1. Then, we expand P and q^→ along their second column. Such an expansion can be defined easily as the composition of swapping the first two columns of P and q^→ and of the expansion introduced at the beginning of this section. This process yields another result E_q2.

We now need to combine the two results E_q1 and E_q2. Let us first consider the case when neither E_q1 nor E_q2 is ⊤. Then, those two results are (possibly empty) sets of useless patterns which we combine by set union, yielding the new result E_q1 ∪ E_q2. Let us now consider the case when E_q1 is ⊤. We assume, as we show later, that U′ is a conservative extension of U. That is U′(P, q^→ ) = ⊤, if and only if U(P, q^→ ) = False. Hence, E_q1 is ⊤ implies U(P, q^→ ) is False — i.e. q^→ is useless w.r.t. P. However, swapping two columns in P (and q^→) does not change U result (Lemma 3). Thus we also have E_q2 = ⊤. Conversely, if E_q2 is ⊤, then E_q1 necessarily is ⊤. Overall, whether expansion is performed along first or second column does not matter and the value of U′(P, q^→ ) should be ⊤. As a conclusion, we can define the combination of the utility of two disjoint or-patterns to be E_q1 ∪ E_q2, provided we adopt the extra definition ⊤ ∪ ⊤ = ⊤.

6.4 Computation of useless patterns

We now give a precise description of algorithm U′, as implemented in the Objective Caml compiler. The key idea is to use specialization (S(c, P) of Section 3.1) as a tool to discover or-patterns, before performing expansions as we did in the previous section. In practice, it is convenient to partition the columns of matrices and vectors into three subparts. We note those separations with “•”. That is, U′ takes such “dotted” matrices and vectors as arguments, written P • Q • R and p^→ • q^→ • r^→. Dotted matrices and vectors stand for triples of matrices and vectors. Later in this section, component q^→ will hold patterns that cannot contain or-patterns (i.e. wildcards), while all the components of r^→ will be or-patterns.

Dotted matrices and vectors define matchings in the ordinary sense, provided we erase the dots. More precisely we concatenate the subparts column-wise, written “&”, and consider U(P &Q & R, p^→ &q^→ &r^→). This new notation emphasizes the distinction between column-wise (or vertical) concatenation and row-wise (or horizontal) concatenation, which we write “@”.

Figure 2 defines some useful operations on dotted matrices. It is assumed that sub-matrix P has n columns (n > 0).

Informally, the first phase of algorithm U′ destructures the patterns of p^→ (using S from figure 2), looking for or-patterns. When or-patterns are found, the corresponding columns are transferred to the R subpart (using ⇒₂), ready for the expansion phase. Other columns are transferred to the Q subpart (using ⇒₁).

To compute the utility of clause number i in match … with p₁ -> e₁ | p₂ -> e₂ | … | p_m -> e_m, we perform the initial call

The typical call U′(P • Q • R, p^→ • q^→ • r^→) yields four situations. First three situations are the “search for or-patterns” phase and apply when P has columns.

We now prove that the new algorithm U′ is a conservative extension of the original algorithm U.

Proof: We prove the following stronger property.

U′(P • Q • R,

→

•

→

•

→

) = ⊤ ⇐⇒ U(P &Q &R,

→

) = False

Proof is by induction on the definition of U′. Most cases are obvious, case 4-(a) is the base case, inductive cases 2. and 3. follow from Lemma 3 on the irrelevance of column order, while inductive case 1. is like inductive case 1. in Proposition 2.

Case 4-(b) (P is empty, R is not empty) is the most interesting. We first consider one expansion. In order to simplify notations a bit, we define S = Q &R and s^→ = q^→ &r^→. Furthermore, we express the expansion of r_j = (t₁∣t₂) as follows.

E_t1 = U′(P′ • Q′ • , (t₁) •

→

′ • ), E_t2 = U′(P″ • Q″ • , (t₂) •

→

′ • )

Where it should be clear that P″ is P′@(t₁) and Q″ is Q′ @ q^→ ′. We further define S′ to be P′ &Q′ and S″ to be P″ &Q″. Notice that S″ is S′ with the row (t₁) &q^→ ′ added. Let also s^→ ′ be the vector (t₁∣t₂) &q^→ ′. Matrix S′ and vector s^→ ′ are the images of S and s^→ by the same permutation of columns. By lemmas 3 and 4, we have:

U(S,

→

) = U(S′,

→

′) = U(S′, (t₁) &

→

′) ∨ U(S″, (t₂) &

→

′).

By induction, we have the following two equivalences.

E_t1 = ⊤ ⇐⇒ U(S′, (t₁) &

→

′) = False E_t2 = ⊤ ⇐⇒ U(S″, (t₂) &

→

′) = False

Then, since E_(t1∣t2) = ⊤ if and only if E_t1 = ⊤ and E_t2 = ⊤, we have:

E_rj = ⊤ ⇐⇒ U(S,

→

) = False.

And we can conclude, since U′( • Q • R, • q^→ • r^→) = E_r1 ∪ ⋯ ∪ E_rz. □

One can make the computation of U′ slightly more efficient by the following two techniques:

6 Specializing U for the useless clause problem

6.1 Useless clause is (almost) enough

6.2 Expansion of or-patterns

6.3 Rules for finding useless patterns

6.4 Computation of useless patterns