You're probably right Paul. But, my assumption is that the originators of legal documents pay a little more attention to getting the citation correct and in the right format then say Joe Bloggs does when completing an address block.
I think that Kent has reached the end of his commendable effort. I'll test out the latest version in anger over the coming weeks on large numbers of legal documents. Dinesh -------------------------------------------------------------------------------- Message: 2 Date: Tue, 10 Feb 2009 14:29:20 -0600 From: "Paul McGuire" <pt...@austin.rr.com> Subject: Re: [Tutor] Picking up citations To: <tutor@python.org> Message-ID: <0a8f5cca89bf4b08becd3c4b86f18...@awa2> Content-Type: text/plain; charset="us-ascii" Dinesh and Kent - I've been lurking along as you run this problem to ground. The syntax you are working on looks very slippery, and reminds me of some of the issues I had writing a generic street address parser with pyparsing (http://pyparsing.wikispaces.com/file/view/streetAddressParser.py). Mailing list companies spend beaucoup $$$ trying to parse addresses in order to filter duplicates, to group by zip code, street, neighborhood, etc., and this citation format looks similarly scary. Congratulations on getting to a 95% solution using PLY. -- Paul
_______________________________________________ Tutor maillist - Tutor@python.org http://mail.python.org/mailman/listinfo/tutor