org.xml.sax.SAXParseException with XmlSlurper

Andrew Myers Wed, 18 Nov 2015 17:48:50 -0800

Hi,

For a while I've been using groovy to parse some badly formed HTML viaXmlSlurper in conjunction with TagSoup, something like this:


def slurper = new XmlSlurper(new org.ccil.cowan.tagsoup.Parser())
def html = slurper.parseText(htmlText)

It works fine when I unit test it with Gradle, but I've tried to deploythis inside another webapp which runs on Lucee (http://lucee.org/) but Ithink I'm running into some kind of "Jar hell". When I try to parse thehtmlText, I get an error like this which makes me think it's not usingthe tagsoup Parser

The exception is: org.xml.sax.SAXParseException, with a stracktracestarting like this:

The element type "meta" must be terminated by the matching end-tag"</meta>". at org.apache.xerces.parsers.AbstractSAXParser.parse(UnknownSource):-1 atorg.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(UnknownSource):-1 at groovy.util.XmlSlurper.parse(XmlSlurper.java:205):205 atgroovy.util.XmlSlurper.parse(XmlSlurper.java:258):258 atgroovy.util.XmlSlurper.parseText(XmlSlurper.java:284):284 atgroovy.util.XmlSlurper$parseText.call(Unknown Source):-1 atorg.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:45):45atorg.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:108):108atorg.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116):116at

I'm a bit lost as to what to look for to debug this. Has anyone comeacross anything similar?


Thanks!
Andrew.

org.xml.sax.SAXParseException with XmlSlurper

Reply via email to