On 01/05/17 18:20, Ian Monat wrote:
> ...  I've written a script using the requests module but I
> think a web scraper like Scrapy, Beautiful Soup or Selinium may be
> required.

I'm not sure what you are looking for. Scrapy, BS etc will
help you read the HTML but not to fetch the file. Also do
you want to process the file (extract the text) in Python
too, or is it enough to just fetch the file?

If the problem is with reading the HTML then you need to
give us more detail about the problem areas and HTML
format.

If the problem is fetching the file, it sounds like you
have already done that and it should be a case of fine
tuning/tidying up the code you've written.

What kind of help exactly are you asking for?

-- 
Alan G
Author of the Learn to Program web site
http://www.alan-g.me.uk/
http://www.amazon.com/author/alan_gauld
Follow my photo-blog on Flickr at:
http://www.flickr.com/photos/alangauldphotos


_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor

Reply via email to