On 29 Feb 2012, at 05:43, Sarven Capadisli wrote: > Hi, I was hoping if you guys could clarify some of these questions for me: > > When I import data into my TDB Triple Store using tdb.tdbloader, I get > information like the following: > > Add: 4,150,000 triples (Batch: 2,380 / Avg: 4,684) > Add: 4,200,000 triples (Batch: 29,620 / Avg: 4,732) > > What is batch exactly?
Batch is the time taken to load the last batch, that is the last 50,000 triples. > Why does it differ from one step to another? At a guess, other stuff happening on the same host? A batch might include a sync to disk too. I wouldn't have thought GC would be an issue. > Is there a way to set the batch number? > > Is there a way to configure TDB in order to perform faster importing? Loading from empty using tdbloader2 is the usual advice. Paolo has been working on a cross platform version of this. Damian
