Questions about bulk loading .rdf from a directory path I have a directory of .rdf files partitioned into the the following graph structure.
within /home/lod_data I have the following directories that contain .rdf
files.
/home/lod_data/bbc
/home/lod_data/dbpedia
/home/lod_data/eunis
/home/lod_data/geonames
/home/lod_data/global.graph
/home/lod_data/gni
/home/lod_data/index.rdf
/home/lod_data/taxonconcept
there is a file for global.graph and a files in each of the directories for
the specific subgraph
for instance: taxonconcept/taxonconcept.ext.graph
Most of these are just one directory deep with .rdf files, one "dbpedia" has
subdirectories for species, authors etc.
If I try to run the following from isql-vt
SQL> ld_dir_all ('/home/lod_data', '*.rdf', '
http://lsd.taxonconcept.org/dataspace');
SQL> rdf_loader_run ();
It loads all the triples into the global graph <
http://lsd.taxonconcept.org/dataspace>
When I do a count by graph I don't see the subgraphs
SQL> SPARQL SELECT ?g count(*) WHERE {GRAPH ?g {?s ?p ?o} };
I had thought that this would work
ld_dir ('/home/lod_data', '*.rdf', 'http://lsd.taxonconcept.org/dataspace');
In the past ld_dir only worked when I load one graph at a time, for
instance.
ld_dir ('/home/lod_data/taxonconcept', '*.rdf',
'urn:org:linkedopenspeciesdata:dataspace:taxonconcept');
rdf_loader_run ();
In my experience ld_dir works when the .rdf is all within the first
directory, no subdirectories.
While ld_dir_all works with directories that have subdirectories.
I have a slightly modified version of the example dbpedia bulk_loader.isql,
that I modified so for a different global graph and .rdf rather than .n3
I will attach to this message.
My goal is to get understand some of the nuances of how to bulk load
.rdffrom a directory structure like the one above, so I can make it
easy for
others.
Specifically, a set of procedures which will create a global graph that
contains subgraphs based on a structure similar to what I have described.
Thanks in Advance,
- Pete
----------------------------------------------------------------
Pete DeVries
Department of Entomology
University of Wisconsin - Madison
445 Russell Laboratories
1630 Linden Drive
Madison, WI 53706
TaxonConcept Knowledge Base <http://www.taxonconcept.org/> / GeoSpecies
Knowledge Base <http://lod.geospecies.org/>
About the GeoSpecies Knowledge Base <http://about.geospecies.org/>
------------------------------------------------------------
bulk_loader_txn.isql
Description: Binary data
