Hi,

ETL gig involving bulk-geocoding a table of Location names.  You'll need to be 
familiar with the Geonames service (and perhaps Yahoo/Google geocoding APIs) 
and have solid SQL skills.

You'll be given 3 MySQL tables:

1.) A "Locations" table containing 7400 international locations.  Each location 
row has a "parent id" defined, creating a hierarchical relationship between the 
various locations.  So, more precise names can be created by recursively 
concatenating the location names together.  For example:

-- World (id = 0, parent = null)
   -- United States  (id = 100, parent = 0)
      -- New York (id = 200, parent = 100)
          -- Montauk (id = 300, parent = 200)

2.) A table of US zip codes (will help to disambiguate some US locations).  
3.) A table of country codes (will help to break up the lookups into country 
chunks)

Your delivarable:
 
1.) A plan/approach to geocoding.  This will outline the steps you'll take 
before doing any work.
     For example, perhaps you'd want to join countries -> locations, pull 
geonames feeds for those countries, break down the lookups by country.  Also, 
what's your plan for disambuating the lookup results?  If you want to write a 
loop that hits geonames 7500 times with a raw location name, thank you in 
advance for not replying.  :)

2.) An updated Locations table containing 5 new fields:
     a.) lat coord
     b.) long coord
     c.) foreign key to the US zip code table (if applicable)
     d.) foreign key to the Country table
     e.) geonames ID
 
Cheers,
Matt
 
 
 
_______________________________________________
New York PHP Community Jobs Mailing List
http://lists.nyphp.org/mailman/listinfo/jobs

NYPHPCon 2006 Presentations Online
http://www.nyphpcon.com

Show Your Participation in New York PHP
http://www.nyphp.org/show_participation.php

Reply via email to