I'm using Groovy, NekoHTML, and Xerces to parse a web page into xml.
Here's the sample code (called NewSaxParserText.groovy):

-------------------------------------------------------------
import org.cyberneko.html.parsers.SAXParser;

String url = "http://www.yahoo.com";
SAXParser parser = new SAXParser();
XmlParser xmlParser = new XmlParser(parser);
Node node = xmlParser.parse(url);
println(node)
-------------------------------------------------------------

I just updated my Xerces jar, and this code, which worked
xercesImpl-2.6.2.jar, broke when I updated to xercesImpl-2.8.1.jar. I
also tried updating to xercesImpl-2.9.0.jar, and got the same error.
Here is the stack trace:
Caught: java.lang.IncompatibleClassChangeError: Class
org.apache.xerces.parsers.AbstractSAXParser$AttributesProxy does not
implement the requested interface org.xml.sax.Attributes
        at NewSaxParserText.run(NewSaxParserText.groovy:6)
        at NewSaxParserText.main(NewSaxParserText.groovy)

How do I fix this? I looked at updating the cyberneko jar and that
doesn't help. Here are the results of my experiments of whether the
code executes or fails with various jar combinations:

<table>
<tr>
        <td><br></td><td>nekohtml-0.9.5</td><td>nekohtml-1.9.7</td>
</tr>
<tr>
        <td>xercesImpl-2.6.2</td><td>Executes</td><td>Executes</td>
</tr>
<tr>
        <td>xercesImpl-2.8.1</td><td>Fails</td><td>Fails</td>
</tr>
<tr>
        <td>xercesImpl-2.9.0</td><td>Fails</td><td>Fails</td>
</tr>
</table>

Thanks for your help!
   Best regards,
             Elizabeth

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to