I'll give tagsoup a try, i saw that was in there. thanks for the headsup! -byron
--- Andrzej Bialecki <[EMAIL PROTECTED]> wrote: > Byron Miller wrote: > > >http://people.apache.org/~andyc/neko/doc/html/changes.html > > > >Any chance of getting that rolled in? Has a few > fixes > >that look good. > > > > > > Did you try using TagSoup? Some time ago I added to > parse-html the > support for using TagSoup instead of NekoHTML (this > is an option in the > config file). I found that in many cases TagSoup > gives much better > results, especially for pages with multiple <html> > or <body> elements, > where neko would give up... > > -- > Best regards, > Andrzej Bialecki <>< > ___. ___ ___ ___ _ _ > __________________________________ > [__ || __|__/|__||\/| Information Retrieval, > Semantic Web > ___|||__|| \| || | Embedded Unix, System > Integration > http://www.sigram.com Contact: info at sigram dot > com > > >