On 29 Feb 2012, at 05:43, Sarven Capadisli wrote:

> Hi, I was hoping if you guys could clarify some of these questions for me:
> 
> When I import data into my TDB Triple Store using tdb.tdbloader, I get 
> information like the following:
> 
> Add: 4,150,000 triples (Batch: 2,380 / Avg: 4,684)
> Add: 4,200,000 triples (Batch: 29,620 / Avg: 4,732)
> 
> What is batch exactly?

Batch is the time taken to load the last batch, that is the last 50,000 triples.

> Why does it differ from one step to another?

At a guess, other stuff happening on the same host? A batch might include a 
sync to disk too. I wouldn't have thought GC would be an issue.

> Is there a way to set the batch number?
> 
> Is there a way to configure TDB in order to perform faster importing?

Loading from empty using tdbloader2 is the usual advice. Paolo has been working 
on a cross platform version of this.

Damian

Reply via email to