I am attempting to process rdf that has characters outside the ASCII
range, and am getting a SAXParseException: not well-formed (invalid
token)
Call:
store = ConjunctiveGraph()
store.load("ToolsTestA0Removed.rdf")
I thought this might be corrected by adding the encoding tot the top of
the file:
<?xml version='1.0' encoding='UTF-8'?>
But this did not correct the problem.
Is there a parsing option that I've missed, or some other error I'm
making? Will utf-8 encoding work for characters like hex A0 or hex 92?
Thanks in advance for help
Dave J
Trace:
Traceback (most recent call last):
File "C:\nbo\rdf2Forms.py", line 18, in <module>
store.load("endpoint/ToolsTestA0Removed.rdf") # Saved by
makeTriples.py.
File "build\bdist.win32\egg\rdflib\Graph.py", line 665, in load
self.parse(source, publicID, format)
File "build\bdist.win32\egg\rdflib\Graph.py", line 828, in parse
context.parse(source, publicID=publicID, format=format, **args)
File "build\bdist.win32\egg\rdflib\Graph.py", line 661, in parse
parser.parse(source, self, **args)
File "build\bdist.win32\egg\rdflib\syntax\parsers\RDFXMLParser.py",
line 37, in parse
self._parser.parse(source)
File "c:\python25\lib\xml\sax\expatreader.py", line 107, in parse
xmlreader.IncrementalParser.parse(self, source)
File "c:\python25\lib\xml\sax\xmlreader.py", line 123, in parse
self.feed(buffer)
File "c:\python25\lib\xml\sax\expatreader.py", line 211, in feed
self._err_handler.fatalError(exc)
File "c:\python25\lib\xml\sax\handler.py", line 38, in fatalError
raise exception
SAXParseException: file:///C|/ToolsTestA0Removed.rdf:373:684: not
well-formed (invalid token)
_______________________________________________
Dev mailing list
[email protected]
http://rdflib.net/mailman/listinfo/dev