I am playing with a little test program here, trying to get xerces to parse
down dmoz.com RDF feed. So far so good except for how it does not like
special characters.  For example, the parser gives a fatal error after
encountering the tilde (or squiggly line) above the character n in Espanol.
Any thoughs on how to work past this ?

Thanks
www.shrio.com

Example Feed:
----------------------------------
<Topic r:id="Top/World">
  <tag catid="16"/>
  <d:Title>World</d:Title>
  <narrow r:resource="Top/World/Español"/>
  <narrow r:resource="Top/World/Français"/>
  <narrow r:resource="Top/World/Italiano"/>
  <narrow r:resource="Top/World/Românã"/>
  <narrow r:resource="Top/World/Català"/>
  <narrow r:resource="Top/World/Farsi"/>
</Topic>

Error:
------------------
(Note: Line 694 above is Espanol, and 695 is Francias)
[Fatal Error] structure_example.txt:695:4: The value of attribute
"r:resource" must not contain the '<' character. org.xml.sax.SAXException:
Stopping after fatal error: The value of attribute "r:resource" must not
contain the '<' character. at
org.apache.xerces.framework.XMLParser.reportError(XMLParser.java:1245) at
org.apache.xerces.framework.XMLDocumentScanner.reportFatalXMLError(XMLDocume
ntScanner.java:598) at
org.apache.xerces.framework.XMLDocumentScanner.scanAttValue(XMLDocumentScann
er.java:528) at
org.apache.xerces.framework.XMLDocumentScanner.scanElement(XMLDocumentScanne
r.java:1769) at
org.apache.xerces.framework.XMLDocumentScanner$ContentDispatcher.dispatch(XM
LDocumentScanner.java:1182) at
org.apache.xerces.framework.XMLDocumentScanner.parseSome(XMLDocumentScanner.
java:381) at
org.apache.xerces.framework.XMLParser.parse(XMLParser.java:1098) at
org.apache.xerces.framework.XMLParser.parse(XMLParser.java:1139) at
dmoz.main(dmoz.java:59)



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to