You're probably right Paul.  But, my assumption is that the originators of 
legal documents pay a little more attention to getting the citation correct and 
in the right format then say Joe Bloggs does when completing an address block.  

I think that Kent has reached the end of his commendable effort.  I'll test out 
the latest version in anger over the coming weeks on large numbers of legal 
documents.

Dinesh



--------------------------------------------------------------------------------

Message: 2
Date: Tue, 10 Feb 2009 14:29:20 -0600
From: "Paul McGuire" <pt...@austin.rr.com>
Subject: Re: [Tutor] Picking up citations
To: <tutor@python.org>
Message-ID: <0a8f5cca89bf4b08becd3c4b86f18...@awa2>
Content-Type: text/plain; charset="us-ascii"

Dinesh and Kent -

I've been lurking along as you run this problem to ground.  The syntax you
are working on looks very slippery, and reminds me of some of the issues I
had writing a generic street address parser with pyparsing
(http://pyparsing.wikispaces.com/file/view/streetAddressParser.py).  Mailing
list companies spend beaucoup $$$ trying to parse addresses in order to
filter duplicates, to group by zip code, street, neighborhood, etc., and
this citation format looks similarly scary.  

Congratulations on getting to a 95% solution using PLY.

-- Paul



_______________________________________________
Tutor maillist  -  Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor

Reply via email to