Re: [Haskell-cafe] about Haskell code written to be "too smart"

wren ng thornton Wed, 25 Mar 2009 14:57:30 -0700

Dan Weston wrote:

So to be clear with the terminology:


inductive   = good consumer?
coinductive = good producer?

So fusion should be possible (automatically? or do I need a GHC rule?) with
  inductive . coinductive

Or have I bungled it?

Not quite. Induction means starting from base cases and building thingsupwards from those. Coinduction is the dual and can be thought of asstarting from the ceiling and building your way downwards (until you hitthe base cases, or possibly forever).

So, if you have potentially infinite data (aka co-data) coming in, thenyou need to use coinduction because you may never see the basis but youwant to make progress anyways. In formal terms, coinduction on co-datagives the same progress guarantees as induction on data, thoughtermination is no longer a conclusion of progress (since coinduction mayproduce an infinite output from infinite input).

Haskell doesn't distinguish data and co-data, but you can imagine dataas if all the data constructors are strict, and co-data as if all theconstructors are lazy. Another way to think of it is that finite lists(ala OCaml and SML) are data, but streams are co-data.

For fusion there's the build/fold type and its dual unfold/destroy,where build/unfold are producers and fold/destroy are consumers. Tounderstand how fusion works, let's look at the types of build and fold.


    GHC.Exts.build      :: (forall b. (a -> b -> b) -> b -> b) -> [a]
    flip (flip . foldr) :: [a] -> ( (a -> b -> b) -> b -> b )

Together they give an isomorphism between lists as an ADT [a] and as acatamorphism (forall b. (a -> b -> b) -> b -> b), aka Church encoding.When we have build followed by foldr, we can remove the intermediatelist and pass the F-algebra down directly:


    foldr cons nil (build k) = k cons nil

For unfold/destroy fusion the idea is the same except that we use unfold(an anamorphism on the greatest fixed point) instead of fold (acatamorphism on the least fixed point). The two fixed points coincide inHaskell.

Since Haskell does build/fold fusion, "good producer" requires that thefunction was written using build, and "good consumer" requires it'swritten using foldr. Using these functions allows us to apply the rule,though it's not sufficient for "good fusion". Why the functions have theparticular types they do and why this is safe has to do with inductionand coinduction, but the relationship isn't direct.

The reason a coinductive function is easy to make into a good producerhas to do with that relationship. Take a canonically coinductivefunction like


    f []     = []
    f (x:xs) = x : f xs

Once we've made one step of recursion, we've generated (x:) and thenhave a thunk for recursing. Most importantly is that no matter how weevaluate the rest of the list, the head of the return value is alreadyknown to be (:) thus we can get to WHNF after one step. Whateverfunction is consuming this output can then take x and do whatever withit, and then pull on f xs which then takes a single step and returns(x':) along with a thunk f xs'. Because all of those (:) are beingproduced immediately, it's easy to abstract it out as a functionalargument--- thus we can use build.

Coinduction doesn't need to do 1-to-1 mapping of input to output, therejust needs to be the guarantee that we only need to read a finite amountof input before producing a non-zero finite amount of output. Thesefunctions are also coinductive:


    p []       = []
    p [x]      = [x]
    p (x:y:ys) = y : x : p ys

    q []       = []
    q [x]      = []
    q (x:y:ys) = y : q ys

    r []     = []
    r (x:xs) = x : x : r xs

They can also be written using build, though they're chunkier aboutreading input or producing output. These functions are not coinductivebecause there's no finite bound on how long it takes to reach WHNF:


    bad []     = []
    bad (x:xs) = bad xs

    reverse []     = []
    reverse (x:xs) = reverse xs ++ [x]

Because build/fold is an isomorphism, we can technically use build forwriting *any* function that produces a list. However, there's more tofusion than just using the build/fold isomorphism. The big idea behindit all is that when producers and consumers are in 1-to-1 correlation,then we can avoid allocating that 1 (the cons cell) and can just passthe arguments of the constructor directly to the consumer. For example:


    let buildF []     = []
        buildF (x:xs) = x : buildF xs

        consumeF []     = 0
        consumeF (x:xs) = 1 + consumeF xs
    in
        consumeF . buildF
==
    let buildF = \xs -> build (f xs)
            where
            f []     cons nil = nil
            f (x:xs) cons nil = x `cons` f xs cons nil

        consumeF = foldr consumeCons consumeNil
            where
            consumeNil       = 0
            consumeCons x rs = 1 + rs
    in
        consumeF . buildF
==
    let f []     cons nil = nil
        f (x:xs) cons nil = x `cons` f xs cons nil

        consumeNil       = 0
        consumeCons x rs = 1 + rs
    in
        foldr consumeCons consumeNil . \xs -> build (f xs)
==
    let... in
        \xs -> foldr consumeCons consumeNil (build (f xs))
==
    let... in
        \xs -> (f xs) consumeCons consumeNil

And now f never allocates any (:) or [], it just calls the two consumersdirectly. The first step of choosing to use build and foldr instead ofprimitive recursion is what enables the compiler to automatically do allthe other steps.

Leaving it at that is cute since we can avoid allocating the list,however, due to laziness we may still end up allocating a spine of callsto consumeCons, which isn't much better than a spine of calls to (:).This is why "good producers" are ones which are capable of producing asingle cons at a time, they never construct a spine before it is neededby the consumer. And this is why "good consumers" are ones which arecapable of consuming a single cons at a time, they never force theproduction of a spine without immediately consuming it. We can relaxthis goodness from 1-to-1 to chunkier things, but that also reduces thebenefits of fusion.




All of this can be generalized to other types besides lists, of course.

--
Live well,
~wren
_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] about Haskell code written to be "too smart"

Reply via email to