On 12-02-29 07:12 AM, Paolo Castagna wrote:
Sarven Capadisli wrote:
On 12-02-29 05:09 AM, Damian Steer wrote:
At a guess, other stuff happening on the same host? A batch might
include a sync to disk too. I wouldn't have thought GC would be an issue.
Not to my knowledge. I get the feeling that the disk falls asleep.
Hence, I'm investing with what I have right now.
Loading from empty using tdbloader2 is the usual advice. Paolo has
been working on a cross platform version of this.
Is it possible to use it on an existing store?
Hi Sarven,
a pure Java version of tdbloader2 (named tdbloader3) is available
as an *experimental* prototype, but it is just for bulk loads on an
initial empty store (as I think is the case for the existing
tdbloader2, right?).
Hi Paolo, thanks a lot for that info. I will give tdbloader2/3 a go on
separate store.
Code here:
https://svn.apache.org/repos/asf/incubator/jena/Scratch/PC/tdbloader3/trunk/
JIRA issue here:
https://issues.apache.org/jira/browse/JENA-117
Great thank you!
How many triples is the RDF dataset you are trying to load?
I don't know the count at the moment.. but I've mentioned the size in
reply to Andy's email; ~5100 RDF/XML files ~35 GB in total. Different
sizes. Largest files are less than 15 MB.
-Sarven