On 21/05/2009, at 5:18 PM, Dylan Fogarty-MacDonald wrote:
> I was using a combination of Nokogiri, HTML tidy (for some really
> nasty pages) and net/http. Now that I look at it, I probably
> should've been using Mechanize. Definitely check out YQL.
Mechanize supports both Hpricot and Nokogiri
> Wow. That's insane Lachlan! Can it be configured to work with other sites?
If you look at the examples of 'data' queries towards the bottom of
the list on the right, you'll see that it can parse HTML, RSS, ATOM,
JSON, XML, Microformats and more directly from a URL.
> I need something for on th
I was using a combination of Nokogiri, HTML tidy (for some really nasty
pages) and net/http. Now that I look at it, I probably should've been using
Mechanize. Definitely check out YQL.
Dylan
2009/5/21 Lachlan Hardy
>
> > I used mechanize a little while back. What are your current preferred
> scr
Wow. That's insane Lachlan! Can it be configured to work with other sites?
I need something for on the plane tomorrow - I've saved off copies of some
sites and want to write a scraper/aggregator.
- Mike
On Thu, May 21, 2009 at 5:12 PM, Lachlan Hardy wrote:
>
> > I used mechanize a little whil
> I used mechanize a little while back. What are your current preferred screen
> scraping tools?
http://developer.yahoo.com/yql/console/
--~--~-~--~~~---~--~~
You received this message because you are subscribed to the Google Groups "Ruby
or Rails Oceania" group.