Hi,

I recently needed to convert 450+ XML files to triples.  Based on my
previous experience trying to import a large XML file that took 4 hours to
complete, I used SPARQLMotion to convert all 450 files in less than 5
minutes.  A HUGE improvement!  (I tried the import method and it ran 10
hours and only converted ~200 files.  Most of the files are small).

As output, I wrote a ttl file to disk (~1GB).  I then decided I wanted the
triples in TDB so I opened the TTL file and ran an insert to push the
approximately 5 million triples into TDB.  This ran for over an hour with
no sign of completing.

Next I tried to use the SM module ExportToTDB.  This ran quickly but
resulted in no triples in TDB.

Finally, I tried the SM module PerformUpdate with which I have had varying
levels of success in the past.  It appears to be following the same course
as the manual insert query in that the CPU utilization indicates something
is happening but I see no change in the TDB files on disk.

What should I expect for TDB performance?  I did not think of 5M triples as
being large and, based on published performance figures, I was expecting
<30 minutes for insertion.  I'm using TBC 6.4 Beta on Windows 10, quad-core
I7, SSD, 32GB RAM of which TBC is only using 4GB of an available Xmx=28GB.

Thanks for your input,

Tim

-- 
You received this message because you are subscribed to the Google Groups 
"TopBraid Suite Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to topbraid-users+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/topbraid-users/CAF0WbnLZCazjPiwUvkUmRHKnSQKo7_aO%2B%2BALZfxxn7moFbcdXQ%40mail.gmail.com.

Reply via email to