Hi Rajesh,

If you load the same ttl file 5 times, you will have 5 copies of the same 
triples in the database. MarkLogic simply persists what you ask it to persist. 
They also occur multiple times in the triple index (since they are linked to 
different fragments), but are deduplicated automatically by the SPARQL engine.

If you don’t want them inserted multiple times, you’ll have to find a way of 
preventing to insert them.

Perhaps SPARQL Update could be a solution?

Cheers

From: 
<[email protected]<mailto:[email protected]>>
 on behalf of Rajesh Kumar <[email protected]<mailto:[email protected]>>
Reply-To: MarkLogic Developer Discussion 
<[email protected]<mailto:[email protected]>>
Date: Monday, October 3, 2016 at 1:46 PM
To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Subject: [MarkLogic Dev General] duplicate triplestore files when loaded using 
rdf and mlcp

Hi Team,

When using sem:rdf-load and loading the turtle ( .ttl ) file , i observed 
multiple triple store files are getting created and the same with mlcp as well. 
But my Sparql query is working as expected eventhough there ae multiple 
triplestore files exist in database ( I guess that will be taken care by triple 
store.)

Load query:

import module namespace sem = "http://marklogic.com/semantics";
      at "/MarkLogic/semantics.xqy";

sem:rdf-load(
    'D:\rajesh\poc\pearson_HED\abc.ttl',
    ("turtle","override-graph=http://www.abc.org/ontologies/abc";),
    (),
    xdmp:default-permissions()
    )

Kindly let me know is there any way that we can restrict duplicate files in db. 
Tried with both graph and default options as well.

Thanks & Regards,
Rajesh
_______________________________________________
General mailing list
[email protected]
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to