I'll have a look at this.  If tagsoup is present on a Debian system it should 
be detected automatically.
If not, its a fault of the package and i'll fix it.

On 22.02.2012, at 12:24, Christian Grün wrote:

> Tagsoup needs to be embedded in your classpath -- which is the case if
> BaseX is downloaded from our homepage). If you have installed BaseX
> via the Debian package manager, you'll have to manually embed the
> tagsoup.jar in the BaseX start scripts.
> 
> Hope this helps,
> Christian
> 
>> Well all I know is that
>> http://docs.basex.org/wiki/Parsers
>> should mention what to do to read HTML, and on my machine there is
>> $ apt-cache search tagsoup-java
>> libtagsoup-java - SAX-compliant parser for real-life HTML
>> libtagsoup-java-doc - API Documentation for TagSoup
>> 
>> Mainly it is tags like <img ...> without /> that throw basex off track.
>> _______________________________________________
>> BaseX-Talk mailing list
>> [email protected]
>> https://mailman.uni-konstanz.de/mailman/listinfo/basex-talk
> _______________________________________________
> BaseX-Talk mailing list
> [email protected]
> https://mailman.uni-konstanz.de/mailman/listinfo/basex-talk

-- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- --
Alexander Holupirek
|-- http://www.informatik.uni-konstanz.de/~holupire
|-- Database & Information Systems Group, U Konstanz
`-- Room E 221, 0049 7531 88 2188 (phone) 3577 (fax)

_______________________________________________
BaseX-Talk mailing list
[email protected]
https://mailman.uni-konstanz.de/mailman/listinfo/basex-talk

Reply via email to