On Sun, 31 Jan 2010 20:05:46 +0800, Zhang Weiwu wrote:

> $ tidy -q -asxml -utf8 page_07_zh.html | xpath -e
> '//d...@class="advertisement"]'

exactly. Glad that you found both tidy & libxml-xpath-perl, and solve the 
problem yourself.

-- 
Tong (remove underscore(s) to reply)
  http://xpt.sourceforge.net/techdocs/
  http://xpt.sourceforge.net/tools/


-- 
To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org 
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Reply via email to