http://nokogiri.org/ is great for this. You need parsing html, look at tutorial on their site: http://nokogiri.org/tutorials/parsing_an_html_xml_document.html
2012/9/6 Sybren Kooistra <[email protected]> > Hi all, > > I've collected a number of thousands of .hmtl documents and I need to > know how to parse through all these documents (that are in one folder) > automatically. > > So, I want to copy certain parts of all of these .html documents (for > example the header), but the websites are offline, on my hard disk, in > stead of online. > > What's the way to go? > > -- > Posted via http://www.ruby-forum.com/. > > -- You received this message because you are subscribed to the Google Groups ruby-talk-google group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at https://groups.google.com/d/forum/ruby-talk-google?hl=en
