Re: [Haskell-cafe] Higher-order algorithms

wren ng thornton Mon, 23 Aug 2010 21:29:42 -0700

Eugene Kirpichov wrote:

Do there exist other nontrivial higher-order algorithms and datastructures?
Is the field of higher-order algorithms indeed as unexplored as it seems?

Many algorithms in natural language processing can be captured byhigher-order algorithms parameterized by the choice of semiring (ormodule space).

For example, consider the inference problem for hidden Markov models(which are often used for things like determining the part of speechtags for some sentence in natural language). To figure out the totalprobability that the HMM is in some state at some time, you use theForward algorithm.[1] To figure out the probability of the most likelystate sequence that has a specific state at some time, you use theViterbi algorithm. To figure out not only the probability of the mostlikely state sequence but also what that tag sequence actually is, youcan modify Viterbi to store back pointers.

All of these are the same algorithm, just with different (augmented)semirings. In order to prevent underflow for very small probabilities,we usually run these algorithms with probabilities in the log-domain.Those variants are also the same algorithm, just taking the image of thesemiring under the logarithm functor:


Forward     : FW ([0,1], +, 0, *, 1)

Log Forward : FW ([-Inf,0], <+>, -Inf, +, 0)
    where
    -- Ignoring infinities...
    x <+> y | x >= y    = x + log (1 + exp (y-x))
            | otherwise = y + log (1 + exp (x-y))

Viterbi     : FW ([0,1], max, 0, *, 1)

Log Viterbi : FW ([-Inf,0], max, -Inf, +, 0)

ViterbiBP Q : FW (Maybe([0,1],Maybe Q), argmax, Nothing, <*>,Just(1,Nothing))

    where
    -- Q = the type of the states in your HMM
    mx <*> my = do
        (px,x) <- mx
        (py,y) <- my
        return (px*py, y `mappend` x)


Log (ViterbiBP Q)
    : FW ( Maybe([-Inf,0],Maybe Q)
         , argmax, Nothing
         , <+>, Just(0,Nothing))
    where
    mx <+> my = do
        (px,x) <- mx
        (py,y) <- my
        return (px+py, y `mappend` x)

Using augmented semirings we can simplify the backpointer versionsignificantly in order to incorporate the optimizations usuallyencountered in practice. That is, the Maybes are required to make it asemiring, but we can optimize both of them away in practice, yielding anaugmented semiring over (Prob,Q) or (Log Prob, Q).

We get the same sort of thing for variants of the Backward algorithmused in the Forward--Backward algorithm. Of course, there's nothingspecial about HMMs here. We can extend the Forward--Backward algorithmto operate over tree structures instead of just list structures. Thatversion is called the Inside--Outside algorithm. And semirings show upall over the place in other algorithms too.

Of course, in hindsight this makes perfect sense: the powerset of thefree semiring over S is the set of all (automata theoretic) languagesover S. So semirings capture languages exactly; in the same way thatcommutative monoids capture multisets, and monoids capture sequences.This insight also extends to cover things like weighted-logicprogramming languages, since we can use any semiring we like, not justthe Boolean probability semiring. Automata theoretic languages areeverywhere.

[1] Or you combine the Forward and Backward algorithms, depending onwhat exactly you want. Same goes for the others.


--
Live well,
~wren
_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Higher-order algorithms

Reply via email to