[Haskell-cafe] ANN: unification-fd 0.7.0

wren ng thornton Mon, 19 Mar 2012 00:05:17 -0700

--------------------------------------------
-- unification-fd 0.7.0
--------------------------------------------

The unification-fd package offers generic functions for single-sortedfirst-order structural unification (think of programming in Prolog, orof the metavariables in type inference)[1][2]. The library *is*sufficient for implementing higher-rank type systems a la [Peyton Jones,Vytiniotis, Weirich, Shields], but bear in mind that unificationvariables are the metavariables of type inference--- not the type-variables.

An effort has been made to make the package as portable as possible.However, because it uses the ST monad and the mtl-2 package it can't beH98 nor H2010. However, it only uses the following common extensionswhich should be well supported[3]:


    Rank2Types
    MultiParamTypeClasses
    FunctionalDependencies -- Alas, necessary for type inference
    FlexibleContexts       -- Necessary for practical use of MPTCs
    FlexibleInstances      -- Necessary for practical use of MPTCs
    UndecidableInstances   -- For Show instances due to two-level types


--------------------------------------------
-- Changes (since 0.6.0)
--------------------------------------------

This release is another major API breaking release. Apologies, butthings are a lot cleaner now and hopefully the API won't break again fora while. The biggest change is that the definition of terms has changedfrom the previous:


    data MutTerm v t
        = MutVar  !v
        | MutTerm !(t (MutTerm v t))

To the much nicer:

    data UTerm t v
        = UVar  !v
        | UTerm !(t (UTerm t v))

The old mnemonic of "mutable terms" was inherited from the code'sprevious life implementing a logic programming language; but when I wasplaying around with implementing a type checker I realized that thenames don't really make sense outside of that original context. So thenew mnemonic is "unification terms". In addition to being a bit shorter,it should help clarify the separation of concerns (e.g., betweenunification variables vs lambda-term variables, type variables, etc.).

The swapping of the type parameters is so that UTerm can have instancesfor Functor, Monad, etc. This change should've been made along with there-kinding of variable types back in version 0.6.0, since the UTerm typeis the free monad generated by t. I've provided all the categorytheoretic instances I could imagine some plausible reason for wanting.Since it's free, there are a bunch more I haven't implemented since theydon't really make sense for structural terms (e.g., MonadTrans,MonadWriter, MonadReader, MonadState, MonadError, MonadCont). If you cancome up with some compelling reason to want those instances, I can addthem in the future.

Since the order of type parameters to BindingMonad, UnificationFailure,Rank, and RankedBindingMonad was based on analogy to the order forterms, I've also swapped the order in all of them for consistency.

I've removed the eqVar method of the Variable class, and instead addedan Eq superclass constraint. Again, this should've happened with there-kinding of variables back in version 0.6.0. A major benefit of thischange is that now you can use all those library functions which requireEq (e.g., many of the set-theoretic operations on lists, like (\\) andelem).

I've added new functions: getFreeVarsAll, applyBindingsAll, freshenAll;which are like the versions without "All", except they're lifted tooperate over Foldable/Traversable collections of terms. This is crucialfor freshenAll because it allows you to retain sharing of variablesamong the collection of terms. Whereas it's merely an optimization forthe others (saves time for getFreeVarsAll, saves space forapplyBindingsAll).

The type of the seenAs function has also changed, to ensure thatvariables can only be seen as structure rather than as any UTerm.


Thanks to Roman Cheplyaka for suggesting many of these changes.


--------------------------------------------
-- Description
--------------------------------------------

The unification API is generic in the type of the structures beingunified and in the implementation of unification variables, followingthe two-level types pearl of Sheard (2001). This style mixes well withSwierstra (2008), though an implementation of the latter is not includedin this package.

That is, all you have to do is define the functor whose fixed-point isthe recursive type you're interested in:


    -- The non-recursive structure of terms
    data S a = ...

    -- The recursive term type
    type PureTerm = Fix S

And then provide an instance for Unifiable, where zipMatch performs onelevel of equality testing for terms and returns the one-level spinefilled with pairs of subterms to be recursively checked (or Nothing ifthis level doesn't match).


    class (Traversable t) => Unifiable t where
        zipMatch :: t a -> t b -> Maybe (t (a,b))

The choice of which variable implementation to use is defined bysimilarly simple classes Variable and BindingMonad. We store thevariable bindings in a monad, for obvious reasons. In case it's notobvious, see Dijkstra et al. (2008) for benchmarks demonstrating thecost of naively applying bindings eagerly.

There are currently two implementations of variables provided: one basedon STRefs, and another based on a state monad carrying an IntMap. Theformer has the benefit of O(1) access time, but the latter is plentyfast and has the benefit of supporting backtracking. Backtracking itselfis provided by the logict package and is described in Kiselyov et al.(2005).

In addition to this modularity, unification-fd implements a number ofoptimizations over the algorithm presented in Sheard (2001)--- which isalso the algorithm presented in Cardelli (1987).

* Their implementation uses path compression, which we retain. Though wemodify the compression algorithm in order to make sharing observable.

* In addition, we perform aggressive opportunistic observable sharing, apotentially novel method of introducing even more sharing than isprovided by the monadic bindings. Basically, we make it so that we canuse the observable sharing provided by the modified path compression asmuch as possible (without introducing any new variables).

* And we remove the notoriously expensive occurs-check, replacing itwith visited-sets (which detect cyclic terms more lazily and without theasymptotic overhead of the occurs-check). A variant of unification whichretains the occurs-check is also provided, in case you really need tofail fast.

* Finally, a highly experimental branch of the API performs *weighted*path compression, which is asymptotically optimal. Unfortunately, thecurrent implementation is quite a bit uglier than the unweightedversion, and I haven't had a chance to perform benchmarks to see how theconstant factors compare. Hence moving it to an experimental branch.

These optimizations pass a test suite for detecting obvious errors. Ifyou find any bugs, do be sure to let me know. Also, if you happen tohave a test suite or benchmark suite for unification on hand, I'd loveto get a copy.



--------------------------------------------
-- Notes and limitations
--------------------------------------------

[1] At present the library does not appear amenable for implementinghigher-rank unification itself; i.e., for higher-ranked metavariables,or higher-ranked logic programming. To be fully general we'd have toabstract over which structural positions are co/contravariant, whetherthe unification variables should be predicative or impredicative, aswell as the isomorphisms of moving quantifiers around. It's on my todolist, but it's certainly non-trivial. If you have any suggestions, feelfree to contact me.

[2] At present it is only suitable for single-sorted (aka untyped)unification, a la Prolog. In the future I aim to support multi-sorted(aka typed) unification, however doing so is complicated by the factthat it can lead to the loss of MGUs; so it will likely be offered as analternative to the single-sorted variant, similar to how the weightedpath-compression is currently offered as an alternative.

[3] With the exception of fundeps which are notoriously difficult toimplement. However, they are supported by Hugs and GHC 6.6, so I don'tfeel bad about requiring them. Once the API stabilizes a bit more I planto release a unification-tf package which uses type families instead,for those who feel type families are easier to implement or use. Therehave been a couple requests for unification-tf, so I've bumped it up onmy todo list.



--------------------------------------------
-- References
--------------------------------------------

Luca Cardelli (1987) /Basic polymorphic typechecking/.
    Science of Computer Programming, 8(2):147--172.

Atze Dijkstra, Arie Middelkoop, S. Doaitse Swierstra (2008)
    /Efficient Functional Unification and Substitution/,
    Technical Report UU-CS-2008-027, Utrecht University.
    <http://www.cs.uu.nl/research/techreps/repo/CS-2008/2008-027.pdf>

Simon Peyton Jones, Dimitrios Vytiniotis, Stephanie Weirich, Mark
    Shields /Practical type inference for arbitrary-rank types/,
    to appear in the Journal of Functional Programming.
    (Draft of 31 July 2007.)

Oleg Kiselyov, Chung-chieh Shan, Daniel P. Friedman, and
    Amr Sabry (2005) /Backtracking, Interleaving, and/
    /Terminating Monad Transformers/, ICFP.
    <http://www.cs.rutgers.edu/~ccshan/logicprog/LogicT-icfp2005.pdf>

Tim Sheard (2001) /Generic Unification via Two-Level Types/
    /and Paramterized Modules/, Functional Pearl, ICFP.
    <http://web.cecs.pdx.edu/~sheard/papers/generic.ps>

Tim Sheard & Emir Pasalic (2004) /Two-Level Types and/
    /Parameterized Modules/. JFP 14(5): 547--587. This is
    an expanded version of Sheard (2001) with new examples.
    <http://web.cecs.pdx.edu/~sheard/papers/JfpPearl.ps>

Wouter Swierstra (2008) /Data types a la carte/, Functional
    Pearl. JFP 18: 423--436.
    <http://www.cs.ru.nl/~wouters/Publications/DataTypesALaCarte.pdf>


--------------------------------------------
-- Links
--------------------------------------------

Homepage:
    http://code.haskell.org/~wren/

Hackage:
    http://hackage.haskell.org/package/unification-fd

Darcs:
    http://community.haskell.org/~wren/unification-fd

Haddock (Darcs version):

http://community.haskell.org/~wren/unification-fd/dist/doc/html/unification-fd

--
Live well,
~wren

_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

[Haskell-cafe] ANN: unification-fd 0.7.0

Reply via email to