Thanks for the mention Arun!

I have now updated the crawler - removing tabs, unwanted newlines, leading
and trailing spaces in the data columns.

Here is the latest links to download:
JSON:
https://api.apify.com/v1/execs/7t9roKQ9yp6T8ZnpR/results?format=json&simplified=1&attachment=1
CSV:
https://api.apify.com/v1/execs/7t9roKQ9yp6T8ZnpR/results?format=csv&simplified=1&attachment=1

Hope this helps!

@Jasvinder I think one solution to extract locations for the stations is
via Overpass using the station codes and combining them to the spreadsheet.

-- Srihari


On Fri, 4 Jan 2019 at 12:43, Jasvinder Singh <jasvinsinghre...@gmail.com>
wrote:

> Dear Arun,
> Exactly the type of simple data sheet that newbies can understand. However
> how location (Coordinates) is linked in this file?
>
> On Fri, Jan 4, 2019 at 12:11 PM Arun Ganesh <arungra...@gmail.com> wrote:
>
>> Spreadsheet if anyone wants to explore:
>> https://docs.google.com/spreadsheets/d/1AFwl_5cB9qD39VWNox1LoeL3tGaGB22f7p4vc7IyMqY/edit#gid=0
>>
>> There are 16,770 station entries of which 11,660 seem to be currently
>> operational according to the expiry date of 2999.
>>
>> Filtering out goods stations, there are 9835 entries. This still seems to
>> include a few yards and cabins that are not legitimate stations. Also
>> noticed quite a few spelling and formatting issues in the names. The
>> station codes look correct. Some amount of manual cleanup is needed on this
>> list.
>>
>> The official number of stations according to IR is 7349 stations (as of
>> 2017)
>> <http://www.indianrailways.gov.in/railwayboard/uploads/directorate/stat_econ/IRSP_2016-17/Facts_Figure/Fact_Figures%20English%202016-17.pdf>
>> and 1817 halts/block huts (2013)
>> <http://www.indianrailways.gov.in/railwayboard/uploads/directorate/stat_econ/downloads/Data_Bank.pdf>.
>>
>>
>> On Fri, Jan 4, 2019 at 11:53 AM Arun Ganesh <arungra...@gmail.com> wrote:
>>
>>> Beauty of the internet, crawler got done by Srihari:
>>> https://twitter.com/sriharithalla/status/1080801313707896837
>>>
>>> JSON data:
>>> https://api.apify.com/v1/execs/TsBwnYutP5u9FCKp5/results?format=json&simplified=1
>>>
>>> I'm in the process of doing a little bit of cleanup using openrefine and
>>> will share on a spreadsheet.
>>>
>>> On Fri, Jan 4, 2019 at 10:01 AM Jasvinder Singh <
>>> jasvinsinghre...@gmail.com> wrote:
>>>
>>>> Dear All,
>>>>
>>>> Not all the members are familiar with intricacies of the data
>>>> collection for such projects. Since this seems to be a crowd sourcing
>>>> endeavour, I suggest that the basic data collection protocol be enumerated
>>>> for newbies so that they can also contribute data which can then be put in
>>>> proper format by professionals.
>>>>
>>>> Regards,
>>>>
>>>> Jasvinder Singh
>>>>
>>>> On Tue, Dec 25, 2018 at 10:32 AM Nikhil VJ <nikhil...@gmail.com> wrote:
>>>>
>>>>> Hi folks,
>>>>>
>>>>> There's a project afoot in the OpenStreetMap and Wikidata communities
>>>>> to get together Indian Railways data.
>>>>>
>>>>> One major part of it: Properly mapping all the railway stations of
>>>>> India, and ensuring they have wikidata entries.
>>>>>
>>>>> Here's a wiki page set up for it:
>>>>> https://www.wikidata.org/wiki/Wikidata:WikiProject_Indian_Railways
>>>>>
>>>>> I'm cross-posting from OpenStreetMap India Telegram
>>>>> <https://t.me/OSMIndia> group:
>>>>>
>>>>> (Arun Ganesh): There seems to be around 7000 stations located. There
>>>>> still ~1.5k missing. A lot more need names, refs and wikidata links.
>>>>> Overpass: *http://overpass-turbo.eu/s/EC4
>>>>> <http://overpass-turbo.eu/s/EC4>*
>>>>>
>>>>>
>>>>> (Srihari Thalla) : Last year I created two MapRoulette Challenges to
>>>>> tag station codes and add Wiki tags
>>>>> *https://maproulette.org/mr3/challenge/2403
>>>>> <https://maproulette.org/mr3/challenge/2403>*
>>>>> *https://maproulette.org/mr3/challenge/2404
>>>>> <https://maproulette.org/mr3/challenge/2404>*
>>>>>
>>>>> --------
>>>>>
>>>>> The overpass query above queries the whole country and may be slow or
>>>>> timeout. I adapted the query to work only on the map area being
>>>>> viewed, so you can zoom into smaller regions. And changed a few things,
>>>>> included a legend in the comments to explain.
>>>>>
>>>>> *https://overpass-turbo.eu/s/EL9 <https://overpass-turbo.eu/s/EL9>*
>>>>>
>>>>>
>>>>> Want to get involved? Engage here.
>>>>>
>>>>>
>>>>>
>>>>> Regards
>>>>>
>>>>> Nikhil VJ
>>>>>
>>>>> Pune, India
>>>>>
>>>>>
>>>>> PS: Posting on an older thread from '15 that had the perfect subject
>>>>> line - didn't want to create yet another new thread. Pro tip: Use 
>>>>> Datameet from
>>>>> google groups <https://groups.google.com/forum/#!forum/datameet> -
>>>>> its more fun and you can find stuff that was posted long before you 
>>>>> joined.
>>>>>
>>>>> --
>>>>> Datameet is a community of Data Science enthusiasts in India. Know
>>>>> more about us by visiting http://datameet.org
>>>>> ---
>>>>> You received this message because you are subscribed to the Google
>>>>> Groups "datameet" group.
>>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>>> an email to datameet+unsubscr...@googlegroups.com.
>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>
>>>> --
>>>> Datameet is a community of Data Science enthusiasts in India. Know more
>>>> about us by visiting http://datameet.org
>>>> ---
>>>> You received this message because you are subscribed to the Google
>>>> Groups "datameet" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>> an email to datameet+unsubscr...@googlegroups.com.
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>> --
>> Datameet is a community of Data Science enthusiasts in India. Know more
>> about us by visiting http://datameet.org
>> ---
>> You received this message because you are subscribed to the Google Groups
>> "datameet" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to datameet+unsubscr...@googlegroups.com.
>> For more options, visit https://groups.google.com/d/optout.
>>
> --
> Datameet is a community of Data Science enthusiasts in India. Know more
> about us by visiting http://datameet.org
> ---
> You received this message because you are subscribed to the Google Groups
> "datameet" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to datameet+unsubscr...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.
>

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to