Thanks v much Raphael. This is great.

On Friday 1 April 2016, Raphael Susewind <li...@raphael-susewind.de> wrote:

> Dear all,
>
> following up on my earlier email, I just pushed a list of pincodes for
> all electoral booths across India to GitHub and made a pull request to
> the datameet repository:
>
> https://github.com/datameet/pincodes/pull/2
>
> Please note that this can be incomplete, and is based on a rather
> brutish, quick and dirty hack - see comments in rolls2pincode.pl. But it
> does use the same IDs as those in the 2014 elections, and hence can be
> combined with my GIS shapefiles for polling booths:
>
> http://dx.doi.org/10.4119/unibi/2674065
>
> I leave it to others to double-check accuracy and create actual pincode
> maps. I hope this is useful,
>
> Best,
> Raphael
>
> On 28.03.2016 07:50, Raphael Susewind wrote:
>
> > Dear Avinash and all,
> >
> > I will try to make some time this week to scrape the pincodes from
> > electoral rolls for all polling booths in my electoral GIS shapefiles.
> >
> > Since pincode is in latin script, this should not be affected by the
> > much discussed PDF scraping issues with electoral rolls.
> >
> > We could then either go down the voronoi route, or alternatively use the
> > heatmap processing chain that I used to generate AC boundaries - this
> > latter would have the advantage of dealing with wrong coordinates in the
> > booth point dataset (basically, not all electoral booth coordinates are
> > correct; consequently, if we only voronoi, we would have a blip of
> > pincode B within a see of pincode A quite frequently. The heatmap stuff
> > takes care of this).
> >
> > Since I am not familiar with postal boundaries: can anyone here confirm
> > whether pincode areas are contiguous, and whether each pincode has only
> > one area? Or can it be that several non-contiguous areas have the same
> > pincodem intersparsed with other pincodes? (In which case voronoi would
> > perhaps be the better solution at last)
> >
> > In any case, I hope to give you the pincode for each polling booth by
> > end of the week or so (based on all-India 2014 electoral rolls),
> >
> > Best,
> > Raphael
> >
> > On 28.03.2016 06:33, Avinash Celestine wrote:
> >
> >> perhaps one way is to avoid using postal data altogether.
> >>
> >> All header pages in electoral rolls(the first page) contain the name of
> >> the polling station related to that roll, the PS number, and importantly
> >> the pin code.
> >>
> >>  A site like psleci.nic.in <http://psleci.nic.in> has geog coordinates
> >> of polling stations (though Raphael had collected the data earlier*).
> >> Matching the two will give a fairly dense scattering of points  - in
> >> fact much more dense than if we used some of the methods earlier in this
> >> thread.
> >>
> >> We thus have a way of associating a pin code with a geo coordinate. We
> >> can then use the voronoi method.
> >>
> >> Electoral rolls are mostly in pdf which make them difficult to scrape.
> >> But from what i have seen, for any given state, the location on the
> >> header page, of the pincode number is more or less constant, making it
> >> possible to target just that part of the page with any pdf parser.
> >>
> >> Electoral rolls have become difficult to download in bulk( a good
> >> thing!) but i understand different people on this group have the pdfs
> >> for different states. Putting this stuff together should give us
> >> comprehensive data on header pages for atleast some states.
> >> Alternatively, we can file RTIs for just the header pages of electoral
> >> rolls, though i dont know how successful that would be.
> >>
> >> * Raphael's data is
> >> at https://github.com/raphael-susewind/india-election-data
> >>
> >>
> >>
> >> On Sun, Mar 27, 2016 at 12:07 PM, srinivas kodali <
> iota.kod...@gmail.com <javascript:;>
> >> <mailto:iota.kod...@gmail.com <javascript:;>>> wrote:
> >>
> >>     Well, There were postal delivery zones in the past and the postal
> >>     department even used to make maps of these zones. The Delhi postal
> >>     delivery zone map
> >>     <
> https://drive.google.com/file/d/0B1RcWLku0ZOWWVBHMldrZWdfZEU/view?usp=sharing>
> had
> >>     boundaries for delhi. I am not sure if other cities had them or how
> >>     long the postal department was doing this, but it certainly can help
> >>     with the boundaries for cities.
> >>
> >>     Regards,
> >>     Srinivas Kodali
> >>     www.lostprogrammer.com <http://www.lostprogrammer.com>
> >>     /"Not everyone who wanders is lost, I am probably a bit"/
> >>
> >>     On Tue, Mar 22, 2016 at 9:29 PM, Arun Ganesh <arungra...@gmail.com
> <javascript:;>
> >>     <mailto:arungra...@gmail.com <javascript:;>>> wrote:
> >>
> >>         Shravan, crowdsourcing the boundaries of pincodes is not as
> >>         trivial as you think. To start with, an area does not fall under
> >>         a pincode, rather a street does based on the post office that
> >>         services it. Read
> >>         this:
> http://www.georeference.org/doc/zip_codes_are_not_areas.htm
> >>
> >>         You may also want to do some background reading of existing
> >>         research that has been done by the group
> >>         here: https://datameet.hackpad.com/M4hPFJVV2Gm?eid=v4YoXN4tTw5
> >>
> >>         To sum up, nobody has precise pincode boundaries like how you
> >>         imagine them, not even the postal department. Any existing
> >>         datasets are an estimate at best using some data processing on a
> >>         large volume of address data.
> >>
> >>         --
> >>         Datameet is a community of Data Science enthusiasts in India.
> >>         Know more about us by visiting http://datameet.org
> >>         ---
> >>         You received this message because you are subscribed to the
> >>         Google Groups "datameet" group.
> >>         To unsubscribe from this group and stop receiving emails from
> >>         it, send an email to datameet+unsubscr...@googlegroups.com
> <javascript:;>
> >>         <mailto:datameet+unsubscr...@googlegroups.com <javascript:;>>.
> >>         For more options, visit https://groups.google.com/d/optout.
> >>
> >>
> >>     --
> >>     Datameet is a community of Data Science enthusiasts in India. Know
> >>     more about us by visiting http://datameet.org
> >>     ---
> >>     You received this message because you are subscribed to the Google
> >>     Groups "datameet" group.
> >>     To unsubscribe from this group and stop receiving emails from it,
> >>     send an email to datameet+unsubscr...@googlegroups.com
> <javascript:;>
> >>     <mailto:datameet+unsubscr...@googlegroups.com <javascript:;>>.
> >>     For more options, visit https://groups.google.com/d/optout.
> >>
> >>
> >> --
> >> Datameet is a community of Data Science enthusiasts in India. Know more
> >> about us by visiting http://datameet.org
> >> ---
> >> You received this message because you are subscribed to the Google
> >> Groups "datameet" group.
> >> To unsubscribe from this group and stop receiving emails from it, send
> >> an email to datameet+unsubscr...@googlegroups.com <javascript:;>
> >> <mailto:datameet+unsubscr...@googlegroups.com <javascript:;>>.
> >> For more options, visit https://groups.google.com/d/optout.
> >
>
> --
> Dr Raphael Susewind | Associate, Contemporary South Asia Studies, Oxford
>          Snail Mail | Melanchthonstr. 4a, 33615 Bielefeld, Germany
>       Web & Twitter | https://www.raphael-susewind.de | @RaphaelSusewind
>              Impact | https://impactstory.org/raphael-susewind
>
> Please consider https://www.gnupg.org for encryption (key id 10AEE42F)
>
>
> --
> Datameet is a community of Data Science enthusiasts in India. Know more
> about us by visiting http://datameet.org
> ---
> You received this message because you are subscribed to the Google Groups
> "datameet" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to datameet+unsubscr...@googlegroups.com <javascript:;>.
> For more options, visit https://groups.google.com/d/optout.
>

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to