Thanks for the mention Arun! I have now updated the crawler - removing tabs, unwanted newlines, leading and trailing spaces in the data columns.
Here is the latest links to download: JSON: https://api.apify.com/v1/execs/7t9roKQ9yp6T8ZnpR/results?format=json&simplified=1&attachment=1 CSV: https://api.apify.com/v1/execs/7t9roKQ9yp6T8ZnpR/results?format=csv&simplified=1&attachment=1 Hope this helps! @Jasvinder I think one solution to extract locations for the stations is via Overpass using the station codes and combining them to the spreadsheet. -- Srihari On Fri, 4 Jan 2019 at 12:43, Jasvinder Singh <jasvinsinghre...@gmail.com> wrote: > Dear Arun, > Exactly the type of simple data sheet that newbies can understand. However > how location (Coordinates) is linked in this file? > > On Fri, Jan 4, 2019 at 12:11 PM Arun Ganesh <arungra...@gmail.com> wrote: > >> Spreadsheet if anyone wants to explore: >> https://docs.google.com/spreadsheets/d/1AFwl_5cB9qD39VWNox1LoeL3tGaGB22f7p4vc7IyMqY/edit#gid=0 >> >> There are 16,770 station entries of which 11,660 seem to be currently >> operational according to the expiry date of 2999. >> >> Filtering out goods stations, there are 9835 entries. This still seems to >> include a few yards and cabins that are not legitimate stations. Also >> noticed quite a few spelling and formatting issues in the names. The >> station codes look correct. Some amount of manual cleanup is needed on this >> list. >> >> The official number of stations according to IR is 7349 stations (as of >> 2017) >> <http://www.indianrailways.gov.in/railwayboard/uploads/directorate/stat_econ/IRSP_2016-17/Facts_Figure/Fact_Figures%20English%202016-17.pdf> >> and 1817 halts/block huts (2013) >> <http://www.indianrailways.gov.in/railwayboard/uploads/directorate/stat_econ/downloads/Data_Bank.pdf>. >> >> >> On Fri, Jan 4, 2019 at 11:53 AM Arun Ganesh <arungra...@gmail.com> wrote: >> >>> Beauty of the internet, crawler got done by Srihari: >>> https://twitter.com/sriharithalla/status/1080801313707896837 >>> >>> JSON data: >>> https://api.apify.com/v1/execs/TsBwnYutP5u9FCKp5/results?format=json&simplified=1 >>> >>> I'm in the process of doing a little bit of cleanup using openrefine and >>> will share on a spreadsheet. >>> >>> On Fri, Jan 4, 2019 at 10:01 AM Jasvinder Singh < >>> jasvinsinghre...@gmail.com> wrote: >>> >>>> Dear All, >>>> >>>> Not all the members are familiar with intricacies of the data >>>> collection for such projects. Since this seems to be a crowd sourcing >>>> endeavour, I suggest that the basic data collection protocol be enumerated >>>> for newbies so that they can also contribute data which can then be put in >>>> proper format by professionals. >>>> >>>> Regards, >>>> >>>> Jasvinder Singh >>>> >>>> On Tue, Dec 25, 2018 at 10:32 AM Nikhil VJ <nikhil...@gmail.com> wrote: >>>> >>>>> Hi folks, >>>>> >>>>> There's a project afoot in the OpenStreetMap and Wikidata communities >>>>> to get together Indian Railways data. >>>>> >>>>> One major part of it: Properly mapping all the railway stations of >>>>> India, and ensuring they have wikidata entries. >>>>> >>>>> Here's a wiki page set up for it: >>>>> https://www.wikidata.org/wiki/Wikidata:WikiProject_Indian_Railways >>>>> >>>>> I'm cross-posting from OpenStreetMap India Telegram >>>>> <https://t.me/OSMIndia> group: >>>>> >>>>> (Arun Ganesh): There seems to be around 7000 stations located. There >>>>> still ~1.5k missing. A lot more need names, refs and wikidata links. >>>>> Overpass: *http://overpass-turbo.eu/s/EC4 >>>>> <http://overpass-turbo.eu/s/EC4>* >>>>> >>>>> >>>>> (Srihari Thalla) : Last year I created two MapRoulette Challenges to >>>>> tag station codes and add Wiki tags >>>>> *https://maproulette.org/mr3/challenge/2403 >>>>> <https://maproulette.org/mr3/challenge/2403>* >>>>> *https://maproulette.org/mr3/challenge/2404 >>>>> <https://maproulette.org/mr3/challenge/2404>* >>>>> >>>>> -------- >>>>> >>>>> The overpass query above queries the whole country and may be slow or >>>>> timeout. I adapted the query to work only on the map area being >>>>> viewed, so you can zoom into smaller regions. And changed a few things, >>>>> included a legend in the comments to explain. >>>>> >>>>> *https://overpass-turbo.eu/s/EL9 <https://overpass-turbo.eu/s/EL9>* >>>>> >>>>> >>>>> Want to get involved? Engage here. >>>>> >>>>> >>>>> >>>>> Regards >>>>> >>>>> Nikhil VJ >>>>> >>>>> Pune, India >>>>> >>>>> >>>>> PS: Posting on an older thread from '15 that had the perfect subject >>>>> line - didn't want to create yet another new thread. Pro tip: Use >>>>> Datameet from >>>>> google groups <https://groups.google.com/forum/#!forum/datameet> - >>>>> its more fun and you can find stuff that was posted long before you >>>>> joined. >>>>> >>>>> -- >>>>> Datameet is a community of Data Science enthusiasts in India. Know >>>>> more about us by visiting http://datameet.org >>>>> --- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "datameet" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to datameet+unsubscr...@googlegroups.com. >>>>> For more options, visit https://groups.google.com/d/optout. >>>>> >>>> -- >>>> Datameet is a community of Data Science enthusiasts in India. Know more >>>> about us by visiting http://datameet.org >>>> --- >>>> You received this message because you are subscribed to the Google >>>> Groups "datameet" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to datameet+unsubscr...@googlegroups.com. >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> -- >> Datameet is a community of Data Science enthusiasts in India. Know more >> about us by visiting http://datameet.org >> --- >> You received this message because you are subscribed to the Google Groups >> "datameet" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to datameet+unsubscr...@googlegroups.com. >> For more options, visit https://groups.google.com/d/optout. >> > -- > Datameet is a community of Data Science enthusiasts in India. Know more > about us by visiting http://datameet.org > --- > You received this message because you are subscribed to the Google Groups > "datameet" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to datameet+unsubscr...@googlegroups.com. > For more options, visit https://groups.google.com/d/optout. > -- Datameet is a community of Data Science enthusiasts in India. Know more about us by visiting http://datameet.org --- You received this message because you are subscribed to the Google Groups "datameet" group. To unsubscribe from this group and stop receiving emails from it, send an email to datameet+unsubscr...@googlegroups.com. For more options, visit https://groups.google.com/d/optout.