On 12-02-29 07:12 AM, Paolo Castagna wrote:
Sarven Capadisli wrote:
On 12-02-29 05:09 AM, Damian Steer wrote:
At a guess, other stuff happening on the same host? A batch might
include a sync to disk too. I wouldn't have thought GC would be an issue.

Not to my knowledge. I get the feeling that the disk falls asleep.
Hence, I'm investing with what I have right now.

Loading from empty using tdbloader2 is the usual advice. Paolo has
been working on a cross platform version of this.

Is it possible to use it on an existing store?

Hi Sarven,
a pure Java version of tdbloader2 (named tdbloader3) is available
as an *experimental* prototype, but it is just for bulk loads on an
initial empty store (as I think is the case for the existing
tdbloader2, right?).

Hi Paolo, thanks a lot for that info. I will give tdbloader2/3 a go on separate store.

Code here:
https://svn.apache.org/repos/asf/incubator/jena/Scratch/PC/tdbloader3/trunk/

JIRA issue here:
https://issues.apache.org/jira/browse/JENA-117

Great thank you!

How many triples is the RDF dataset you are trying to load?

I don't know the count at the moment.. but I've mentioned the size in reply to Andy's email; ~5100 RDF/XML files ~35 GB in total. Different sizes. Largest files are less than 15 MB.

-Sarven

Reply via email to