Re: Parser Generator?

2007-08-27 Thread Steven Bethard
Paul McGuire wrote: > On Aug 26, 10:48 pm, Steven Bethard <[EMAIL PROTECTED]> wrote: >> In Japanese and Chinese tokenization, word boundaries are not marked by >> different classes of characters. They only exist in the mind of the >> reader who knows which sequences of characters could be words giv

Re: Parser Generator?

2007-08-27 Thread Paul McGuire
On Aug 26, 10:48 pm, Steven Bethard <[EMAIL PROTECTED]> wrote: > Paul McGuire wrote: > > On Aug 26, 8:05 pm, "Ryan Ginstrom" <[EMAIL PROTECTED]> wrote: > >> The only caveat being that since Chinese and Japanese scripts don't > >> typically delimit "words" with spaces, I think you'd have to pass the

RE: Parser Generator?

2007-08-26 Thread Ryan Ginstrom
> On Behalf Of Paul McGuire > > On Aug 26, 8:05 pm, "Ryan Ginstrom" <[EMAIL PROTECTED]> wrote: > > The only caveat being that since Chinese and Japanese scripts don't > > typically delimit "words" with spaces, I think you'd have > to pass the > > text through a tokenizer (like ChaSen for Japane

Re: Parser Generator?

2007-08-26 Thread Steven Bethard
Paul McGuire wrote: > On Aug 26, 8:05 pm, "Ryan Ginstrom" <[EMAIL PROTECTED]> wrote: >> The only caveat being that since Chinese and Japanese scripts don't >> typically delimit "words" with spaces, I think you'd have to pass the text >> through a tokenizer (like ChaSen for Japanese) before using Py

Re: Parser Generator?

2007-08-26 Thread Paul McGuire
On Aug 26, 8:05 pm, "Ryan Ginstrom" <[EMAIL PROTECTED]> wrote: > > On Behalf Of Jason Evans > > Parsers typically deal with tokens rather than individual > > characters, so the scanner that creates the tokens is the > > main thing that Unicode matters to. I have written > > Unicode-aware scanners

RE: Parser Generator?

2007-08-26 Thread Ryan Ginstrom
> On Behalf Of Jason Evans > Parsers typically deal with tokens rather than individual > characters, so the scanner that creates the tokens is the > main thing that Unicode matters to. I have written > Unicode-aware scanners for use with Parsing-based parsers, > with no problems. This is pret

Re: Parser Generator?

2007-08-26 Thread Jack
Thanks Json. There seem to be a few options that I can pursue. Having a hard time chooing one now :) "Jason Evans" <[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED] > On Aug 24, 1:21 pm, "Jack" <[EMAIL PROTECTED]> wrote: >> "Jason Evans" <[EMAIL PROTECTED]> wrote in message >> >htt

Re: Parser Generator?

2007-08-26 Thread Jack
Good to know, thanks Paul. ! "Paul McGuire" <[EMAIL PROTECTED]> wrote in message > Pyparsing was already mentioned once on this thread. Here is an > application using pyparsing that parses Chinese characters to convert > to English Python. > > http://pypi.python.org/pypi/zhpy/0.5 > > -- Paul -

Re: Parser Generator?

2007-08-26 Thread Jason Evans
On Aug 24, 1:21 pm, "Jack" <[EMAIL PROTECTED]> wrote: > "Jason Evans" <[EMAIL PROTECTED]> wrote in message > >http://www.canonware.com/Parsing/ > > Thanks Jason. Does Parsing.py support Unicode characters (especially CJK)? > I'll take a look. Parsers typically deal with tokens rather than indi

Re: Parser Generator?

2007-08-24 Thread Paul McGuire
On Aug 18, 11:37 pm, "Jack" <[EMAIL PROTECTED]> wrote: > Thanks for all the replies! > > SPARK looks promising. Its doc doesn't say if it handles unicode > (CJK in particular) encoding though. > > Yapps also looks powerful:http://theory.stanford.edu/~amitp/yapps/ > > There's also PyGgyhttp://lava.n

Re: Parser Generator?

2007-08-24 Thread Jack
Thanks Jason. Does Parsing.py support Unicode characters (especially CJK)? I'll take a look. "Jason Evans" <[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED] > On Aug 18, 3:22 pm, "Jack" <[EMAIL PROTECTED]> wrote: >> Hi all, I need to do syntax parsing of simple naturual languages, >> fo

Re: Parser Generator?

2007-08-22 Thread Jason Evans
On Aug 18, 3:22 pm, "Jack" <[EMAIL PROTECTED]> wrote: > Hi all, I need to do syntax parsing of simple naturual languages, > for example, "weather of London" or "what is the time", simple > things like these, with Unicode support in the syntax. > > In Java, there are JavaCC, Antlr, etc. I wonder wha

Re: Parser Generator?

2007-08-19 Thread Jack
Very interesting work. Thanks for the link! "Alex Martelli" <[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED] > > > """ > NLTK ¡ª the Natural Language Toolkit ¡ª is a suite of open source Python > modules, data sets and tutorials suppor

Re: Parser Generator?

2007-08-19 Thread Alex Martelli
Jack <[EMAIL PROTECTED]> wrote: > Thanks for the suggestion. I understand that more work is needed for natural > language > understanding. What I want to do is actually very simple - I pre-screen the > user > typed text. If it's a simple syntax my code understands, like, Weather in > London, I'll

Re: Parser Generator?

2007-08-19 Thread Jack
Thanks for the suggestion. I understand that more work is needed for natural language understanding. What I want to do is actually very simple - I pre-screen the user typed text. If it's a simple syntax my code understands, like, Weather in London, I'll redirect it to a weather site. Or, if it's

Re: Parser Generator?

2007-08-19 Thread samwyse
Jack wrote: > Thanks for all the replies! > > SPARK looks promising. Its doc doesn't say if it handles unicode > (CJK in particular) encoding though. > > Yapps also looks powerful: http://theory.stanford.edu/~amitp/yapps/ > > There's also PyGgy http://lava.net/~newsham/pyggy/ > > I may also giv

Re: Parser Generator?

2007-08-18 Thread Jack
Thanks for all the replies! SPARK looks promising. Its doc doesn't say if it handles unicode (CJK in particular) encoding though. Yapps also looks powerful: http://theory.stanford.edu/~amitp/yapps/ There's also PyGgy http://lava.net/~newsham/pyggy/ I may also give Antlr a try. If anyone has ex

Re: Parser Generator?

2007-08-18 Thread Tommy Nordgren
On 19 aug 2007, at 00.22, Jack wrote: > Hi all, I need to do syntax parsing of simple naturual languages, > for example, "weather of London" or "what is the time", simple > things like these, with Unicode support in the syntax. > > In Java, there are JavaCC, Antlr, etc. I wonder what people use >

Re: Parser Generator?

2007-08-18 Thread beginner
On Aug 18, 5:22 pm, "Jack" <[EMAIL PROTECTED]> wrote: > Hi all, I need to do syntax parsing of simple naturual languages, > for example, "weather of London" or "what is the time", simple > things like these, with Unicode support in the syntax. > > In Java, there are JavaCC, Antlr, etc. I wonder wha

Re: Parser Generator?

2007-08-18 Thread Diez B. Roggisch
Jack schrieb: > Hi all, I need to do syntax parsing of simple naturual languages, > for example, "weather of London" or "what is the time", simple > things like these, with Unicode support in the syntax. > > In Java, there are JavaCC, Antlr, etc. I wonder what people use > in Python? Antlr also ha