Re: Usable street address parser in Python?

2010-04-21 Thread Albert van der Horst
In article 4bcddc5a$0$1630$742ec...@news.sonic.net, John Nagle na...@animats.com wrote: Iain King wrote: Not sure on the volume of addresses you're working with, but as an alternative you could try grabbing the zip code, looking up all addresses in that zip code, and then finding whatever one

Re: Usable street address parser in Python?

2010-04-20 Thread Tim Roberts
John Nagle na...@animats.com wrote: Unfortunately, now it won't run with the released version of pyparsing (1.5.2, from April 2009), because it uses originalTextFor, a feature introduced since then. I worked around that, but discovered that the new version is case-sensitive. Changed Keyword

Re: Usable street address parser in Python?

2010-04-20 Thread John Yeung
My response is similar to John Roth's. It's mainly just sympathy. ;) I deal with addresses a lot, and I know that a really good parser is both rare/expensive to find and difficult to write yourself. We have commercial, USPS-certified products where I work, and even with those I've written a

Re: Usable street address parser in Python?

2010-04-20 Thread Iain King
On Apr 20, 8:24 am, John Yeung gallium.arsen...@gmail.com wrote: My response is similar to John Roth's.  It's mainly just sympathy. ;) I deal with addresses a lot, and I know that a really good parser is both rare/expensive to find and difficult to write yourself.  We have commercial,

Re: Usable street address parser in Python?

2010-04-20 Thread Grant Edwards
On 2010-04-20, Tim Roberts t...@probo.com wrote: This is a very tricky problem. Consider Salem, Oregon, which puts the direction after the street: 3340 Astoria Way NE Salem, OR 97303 In Minneapolis, the direction comes before the street in some quadrants and after it in others. I

Re: Usable street address parser in Python?

2010-04-20 Thread John Nagle
Iain King wrote: Not sure on the volume of addresses you're working with, but as an alternative you could try grabbing the zip code, looking up all addresses in that zip code, and then finding whatever one of those address strings most closely resembles your address string (smallest Levenshtein

Re: Usable street address parser in Python?

2010-04-19 Thread Paul McGuire
On Apr 17, 2:23 pm, John Nagle na...@animats.com wrote:    Is there a usable street address parser available?  There are some bad ones out there, but nothing good that I've found other than commercial products with large databases.  I don't need 100% accuracy, but I'd like to be able to

Re: Usable street address parser in Python?

2010-04-19 Thread Stefan Behnel
John Nagle, 17.04.2010 21:23: Is there a usable street address parser available? What kind of street address are you talking about? Only US-American ones? Because street addresses are spelled differently all over the world. Some have house numbers, some use letters or a combination, some

Re: Usable street address parser in Python?

2010-04-19 Thread John Nagle
John Nagle wrote: Is there a usable street address parser available? There are some bad ones out there, but nothing good that I've found other than commercial products with large databases. I don't need 100% accuracy, but I'd like to be able to extract street name and street number for at

Re: Usable street address parser in Python?

2010-04-18 Thread John Roth
On Apr 17, 1:23 pm, John Nagle na...@animats.com wrote:    Is there a usable street address parser available?  There are some bad ones out there, but nothing good that I've found other than commercial products with large databases.  I don't need 100% accuracy, but I'd like to be able to

Usable street address parser in Python?

2010-04-17 Thread John Nagle
Is there a usable street address parser available? There are some bad ones out there, but nothing good that I've found other than commercial products with large databases. I don't need 100% accuracy, but I'd like to be able to extract street name and street number for at least 98% of US