Re: [Haskell-cafe] What's this pattern called?

wren ng thornton Thu, 22 Oct 2009 21:27:54 -0700

Martijn van Steenbergen wrote:

Bonjour café,
data ExprF r
  =  Add  r  r
  |  Sub  r  r
  |  Mul  r  r
  |  Div  r  r
  |  Num  Int
This is a well-known pattern that for example allows nice notation ofmorphisms. But what is it called? I've heard fixed-point view, opendatatypes and some others, but I'm curious where this pattern comes upin literature and what it is called there.

This is an example of "open recursion", which is when you take somerecursive function/datatype and rewrite it without recursion by passingthe function/type in as an argument to itself. It's the datatypeequivalent of doing:


    fibF _ 0 = 0
    fibF _ 1 = 1
    fibF f n = f(n-1) + f(n-2)

    fib = fix fibF

Which can be useful for functions because we can use a differentfixed-point operator, e.g. one that adds memoization abilities or otherfeatures in addition to the recursion.

As others've mentioned, the open-recursive version of a recursive datatype happens to be a "functor". Or rather, the recursive type happens tobe isomorphic to the least fixed point of a generating functor[1][2]because the functor is also, in the terms of recursion theory, an"initial algebra". Part of why this pattern is so nice comes from thefact that it's a functor (so we can use fmap to apply a function one plydown), but part of it also comes from the isomorphism of using anexplicit fixed-point operator (which allows us to un-fix the type and dothings like storing the accumulators of a fold directly in the normalconstructors, rather than needing to come up with an ad-hoc isomorphicset of constructors[3]), and the fact that it's an initial algebra tiesthese two things together nicely.

This is also an example of Tim Sheard's "two-level types", albeit atrivial one since the fixed-point operator doesn't add anything otherthan recursion. One of the particular ideas behind Sheard's two-leveltypes is that we can split the original recursive type in a differentplace where one of the levels contains some constructors and the otherlevel contains other constructors. This can be helpful when you have afamily (informally speaking) of similar types, as for example withimplementing unification. All types that can be unified shareconstructors for unification variables; but the constructors for thestructural components of the type are left up to another level. Thus wecan reuse the variable processing code for unifying different types, andalso be modular about the type being unified.

[1] This should be somewhat obvious if you're familiar with theinductive phrasing of constructing the set of all values for some type.E.g. "Basis: [] is a list. Induction: (:) takes a value and a list intoa list". So we have some functor and we keep applying it over and overto generate the set of all values, building up from the base cases.

[2] Do note that in Haskell the least fixed point and the greatest fixedpoint coincide. Technically, whether the least or greatest fixed pointis used depends on the construction (e.g. catamorphisms use least,anamorphisms use greatest). This is also related to the topic of"codata" which is the fixed point of a terminal coalgebra.

[3] Data.List.unfoldr is a prime example of an ad-hoc isomorphic set ofconstructors. Instead of the current type, we could instead use animplementation where:


    newtype Fix   f   = Fix { unFix :: f (Fix f) }
    data    ListF a r = Nil | Cons a r
    type    List  a   = Fix (ListF a)

    unfoldr :: (b -> ListF a b) -> b -> List a
    unfoldr = ...

which is a bit more obviously correlated with anamorphisms in recursiontheory.


--
Live well,
~wren
_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] What's this pattern called?

Reply via email to