I was able to use libs/xml to EASILY parse the reddit rss feeds.  Now, I want 
to get a little bit more information and actually parse the HTML document for 
a project I am working on.

The xhtml document looks like standard xhtml with the data represented in a 
table full of tr/tds, etc.  My question, is the xml library 'lazy' enough to 
allow for possible non standard patterns or should I use something like the 
pattern matcher.

My goal initially, is just to collect the rating, title, poster, see 
reddit.com (view source).

--
Berlin Brown
berlin dot brown at gmail dot com or 
bbrown at botspiritcompany dot com


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Factor-talk mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/factor-talk

Reply via email to