30 million records isn't really a lot with an index and shouldn't be taking quite that long?
Johnathan On May 29, 2011, at 12:28 AM, Stephen Woodbridge <wood...@swoodbridge.com> wrote: > I have had similar performance experiences working with tiger data in other > applications, ie not this geocoder, where queries cost about 200-400ms > initially querying a 30 million record streets table and then go to 20-40ms > afterwards. I have always attributed this to page caching. My queries are > typically spatial in nature and I cluster my data based on the spatial index. > But for the geocoder, I would expect similar performance if you cluster your > data by zipcode and then sort your input data by zipcode, you should get very > good performance depending on your queries and indexes. > > -Steve > > On 5/28/2011 5:24 PM, Paragon Corporation wrote: >> Mikal, >> Can you send me the change you made and the indexes you added. When >> adding some more data, I realized I had hardcoded an index for our local >> state (MA) and I know without that index that that particular query does >> run pretty slow. So just wondering if its along the same lines. >> I've also fixed I think all the issues with running the loader on >> Unix/Linux -- well at least I was able to get it to run on my CentOS. >> Thanks to all who contributed input to that. I took bits and pieces from >> many people's comments but couldn't apply a full diff from anyones since >> I had already changed the code too much to safely apply any of those >> patches. >> How many states do you have loaded BTW? I just have CA loaded on my >> CentOS -- which is an 8GB/8 core cloud server. I'm getting around 38ms - >> 450ms per test, but I have yet to load the other states. >> It also seems to cache very well so that if I geocode an address on the >> same street (not necessarily same address), the first call might take >> 450ms and the second 38ms. I suspect this might be because I also marked >> a good chunk of the functions STABLE or IMMUTABLE. >> Thanks, >> Regina >> http://www.postgis.us >> >> *From:* postgis-users-boun...@postgis.refractions.net >> [mailto:postgis-users-boun...@postgis.refractions.net] *On Behalf Of >> *Mikal Laster >> *Sent:* Friday, May 27, 2011 8:18 AM >> *To:* postgis-users@postgis.refractions.net >> *Subject:* [postgis-users] Fw: re: Geocoder (from extras) >> >> >> >> --- On *Fri, 5/27/11, Mikal Laster /<orcl...@yahoo.com>/* wrote: >> >> >> From: Mikal Laster <orcl...@yahoo.com> >> Subject: re:[postgis-users] Geocoder (from extras) >> To: postgis-users@postgis.refractions.net >> Date: Friday, May 27, 2011, 7:22 AM >> >> in response to >> >> http://postgis.refractions.net/pipermail/postgis-users/2011-May/029566.html. >> After creating some indexes and rewriting geocode_address. I was >> able to get geocode to run in 483-523 ms for "5775 Perimeter Dr >> Dublin, Ohio". This used to take 1700-2000 ms for me. I'm removing >> the main inner qui >> >> >> >> _______________________________________________ >> postgis-users mailing list >> postgis-users@postgis.refractions.net >> http://postgis.refractions.net/mailman/listinfo/postgis-users > > _______________________________________________ > postgis-users mailing list > postgis-users@postgis.refractions.net > http://postgis.refractions.net/mailman/listinfo/postgis-users _______________________________________________ postgis-users mailing list postgis-users@postgis.refractions.net http://postgis.refractions.net/mailman/listinfo/postgis-users