Re: [PEG] Ambiguity

Peter J. Wasilko Fri, 20 Oct 2006 11:01:59 -0700


On Oct 20, 2006, at 12:17 PM, Sylvain Schmitz wrote:

A serious study could attempt to find deeper connections, but theconnections seem likely to be roughly the same as the ones withcontext-free grammars.

This level of analysis is exactly what I had in mind. The notationalsimilarities are nothing but a red herring, which I agree are apt tomislead the casual reader. (Indeed, it is this short of notationaloverloading that leads us to seek Quasi Natural Languagerepresentations in which terms of art can directly stand forthemselves rather than forcing the reader to disambiguate everygroup's preferred interpretation of common characters like '/'.)

Steedman's discussion of efficient chart-based CCG parsing strategiesstruck me as being vaguely analogous to Packrat Parsing. The "fullylexicalized" nature of the grammar which makes its dynamic extensionpainless, its ability to tackle phenomena like "argument clustercoordination", and its functional programming orientation in directlyderiving logical forms rather than parse trees are all very appealingand strike me as being more elegant than other approaches to context-free grammars.

In short, I would hypothesize that CGGs may be a viable compliment toPEGs in moving from Quasi Natural Language utterances that you mightmake in explaining your code to a fellow programmer used to workingin another programming language to unambiguous programming languagesemantics that are amenable to execution. So one might positinterleaving PEGs and CCGs in such a way that a PEG would tokenizeinput; invoke a CCG lexicon & parse phase to fill in local anaphora,and address control and binding issues; and in effect de-scramblesome natural language constructs that might be very hard to directlymodel with a PEG, before passing their logical forms on to a strictlyPEG phase for the final composition of an AST to be fed to aprogramming language interpreter or compiler.

In effect, the CCG could be invoked in parsing individual statementswhich might perhaps share some global state reflecting a discourserepresentation structure to resolve inter statement anaphora, whilethe PEG would select a canonical reading of each statement and derivethe overall parse of the program. This way we could optimize ourhandling of state by enapsulating stateful aspects of our code in aCCG/discourse-world outside of the PEG proper (somewhat akin to theIO Monad in Haskell).

As a further optimization, the first phase PEG could detect andimmediately parse simple statement forms like traditional functioncalls that wouldn't require CCG processing (as would QNL textcontaining complex NL forms which by language definition would bereduced to one canonical reading) without invoking the CCG at all.This way the language designer would only pay a minimal CCG overheadwhen using NL constructs.

But making the final call on the feasibility of such a hybridapproach and determining whether it could help us resolve theambiguity v. ordering conundrum is a bit outside the ambit of mypersonal expertise. Likewise, it may be possible to parse these sameQNL constructs with a cleverly crafted and highly optimized PEG,although I have yet to find any work trying to apply PEGs to NL text.Other open questions would be whether a PackRat Parser could be usedas a component of a CGG implementation and whether a CGG rule typecould be embedded in a PEG grammar.

In any case, I would direct anyone curious about CCG's to anexcellent tutorial on them that can be found at: <ftp://ftp.cogsci.ed.ac.uk/pub/steedman/ccg/manifesto.pdf>

There are also some CCG implementations, other papers, and many linksavailable at: <http://groups.inf.ed.ac.uk/ccg/index.html>


Cheers,

Peter


_______________________________________________
PEG mailing list
PEG@lists.csail.mit.edu
https://lists.csail.mit.edu/mailman/listinfo/peg

Re: [PEG] Ambiguity

Reply via email to