Re: Setting a read limit when parsing a Graph

Alessandro Adamou Tue, 14 Aug 2012 05:00:50 -0700

Clearly I meant 1024 for 1 kiB (but anyway, once this works, I wouldlike to set it to 100k)

I was thinking that perhaps marks are being overridden, otherwiseXML-based parsers would fail as they could not encounter the closing tag(e.g. </rdf:RDF>).

So I was thinking, perhaps I should override the SimpleMGraph and set alimit to the triples instead?


Thank You

Alessandro


On 8/14/12 1:53 PM, Alessandro Adamou wrote:

Hi,
I need to write a function that performs lookahead of the OWL ontologyID for a Graph, therefore it has to scan the content up to a certainpoint to see if it has found an ontology IRI / version IRI pair.
I thought that setting mark() on a BufferedInputStream did the trick,something like:
MGraph graph = new SimpleMGraph();
BufferedInputStream bIn = new BufferedInputStream(content);
bIn.mark(1240); // Read up to 1k
parser.parse(graph, bIn, SupportedFormat.RDF_XML);

(parser has a Jena parser provider registered)
But apparently this is not working. Even for streams much longer than1 kiB, with the interesting triples right at the very end, thesetriples are always found.
Do the Clerezza parser override the marks on a buffered stream, ormaybe Jena is doing so? Or even better, am I doing this wrong?
Best,
-- Alessandro



--
M.Sc. Alessandro Adamou

Alma Mater Studiorum - Università di Bologna
Department of Computer Science
Mura Anteo Zamboni 7, 40127 Bologna - Italy

Semantic Technology Laboratory (STLab)
Institute for Cognitive Science and Technology (ISTC)
National Research Council (CNR)
Via Nomentana 56, 00161 Rome - Italy


"I will give you everything, just don't demand anything."
(Ettore Petrolini, 1917)

Not sent from my iSnobTechDevice

Re: Setting a read limit when parsing a Graph

Reply via email to