Ted Yu Fri, 11 Dec 2009 14:23:49 -0800
Hi, We want to strip out irrelevant contents from the web pages we crawl. Examples of irrelevant contents are display ads that surround the main body of article on a web page.
Please share your experience. Thanks