[R] My First Attempt at Screen Scraping with R

Abraham Mathew Fri, 06 May 2011 17:49:51 -0700

Hello Folks,

I'm working on trying to scrape my first web site and ran into a issue
because I'm really
don't know anything about regular expressions in R.


library(XML)
library(RCurl)

site <- "http://thisorthat.com/leader/month";
site.doc <- htmlParse(site, ?, xmlValue)


At the ?, I realize that I need to insert a regex command which will
decipher the contents of the web page...right?

First, I'm not sure if the contents of the site would be considered a table
and I'm also not sure how to disregard pictures
when scraping the site.



> sessionInfo()
R version 2.13.0 (2011-04-13)
Platform: i686-pc-linux-gnu (32-bit)


Please Help!
Abraham

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] My First Attempt at Screen Scraping with R

Reply via email to