If you do not need a pure R solution, you might also find it helpful to blend languages. For scraping and munging tasks such as this I generally turn to python to do extraction then feed data to R for analysis via rpy.
On Thu, Nov 20, 2014 at 8:57 PM, Spencer Graves < spencer.gra...@structuremonitoring.com> wrote: > The Ecfun package includes functions written to scrape data from web > pages. See, e.g., readUShouse, readUSsenate, readUSstateAbbreviations. > They use getURL{RCurl} and readHTMLTable{XML}. > > > Hope this helps. > > > Spencer Graves > > > > On 11/20/2014 5:42 PM, Matt Considine wrote: > >> Hi, >> I'm wondering if anyone can point me to code to parse data on Google >> Finance pages, i.e. parse the results of a URL request such as this >> http://www.google.com/finance?q=apple >> >> I know how to return the contents of the page; it's figuring out the best >> tools to parse it that I'm interested in and hopefully someone has already >> done this. >> >> (For what it is worth, the only info I am looking for are the ticker, >> exchange, currency and "Mkt Cap" datapoint) >> >> Thanks in advance for any help - scraping is not my strong suit. >> Matt >> >> >> --- >> This email is free from viruses and malware because avast! Antivirus >> protection is active. >> >> ______________________________________________ >> R-help@r-project.org mailing list >> https://stat.ethz.ch/mailman/listinfo/r-help >> PLEASE do read the posting guide http://www.R-project.org/ >> posting-guide.html >> and provide commented, minimal, self-contained, reproducible code. >> > > ______________________________________________ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/ > posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.