Broken GND dataset after loading with tdb2.xloader+tdb2.tdbloader

2022-02-21 Thread Neubert, Joachim
I've reloaded the GND dataset at http://zbw.eu/beta/sparql/gnd/query with 4.5.0-SNAPSHOT. The sources were a 133G .nt.gz file, plus several small .ttl files with ontology etc. I loaded the large one with tdb2.xloader, and immediately after that the smaller ones with tdb2.tdbloader (see

Re: Broken GND dataset after loading with tdb2.xloader+tdb2.tdbloader

2022-02-21 Thread Andy Seaborne
On 21/02/2022 08:27, Neubert, Joachim wrote: I've reloaded the GND dataset at http://zbw.eu/beta/sparql/gnd/query with 4.5.0-SNAPSHOT. The sources were a 133G .nt.gz file, plus several small .ttl files with ontology etc. I loaded the large one with tdb2.xloader, and immediately after that

Re: Geo indexing Wikidata

2022-02-21 Thread Marco Neumann
I have a vague memory of running into this issue in the past, Lorenz. The wikidata RDF serialization does not conform strictly to the OGC GeoSPARQL data model nor do the geospatial access methods of the blazegraph extension attempt to exhibit any of the standards compliant features. That's

Geo indexing Wikidata

2022-02-21 Thread Lorenz Buehmann
Hi, we can use this as an complementary thread for the ongoing "loading Wikidata" threads, this time with focus on the geospatial part. Joachim already did the same for the text index and it works as expected, though still loading time could be improved. For the geospatial index things

Re: Geo indexing Wikidata

2022-02-21 Thread Andy Seaborne
On 21/02/2022 09:07, Lorenz Buehmann wrote: Any experience or comments so far? Using SubsystemLifecycle, could make the conversions by GeoSPARQLOperations.convertGeoPredicates extensible. Andy But having coordinate location (P625), located on astronomical body (P376) as

WG: Broken GND dataset after loading with tdb2.xloader+tdb2.tdbloader

2022-02-21 Thread Neubert, Joachim
This mail of yesterday didn't get through - here again. The data of the broken load is temporarily linked from http://134.245.93.72/beta/tmp. I've now invoked /opt/jena/bin/tdb2.tdbloader --loader=parallel --loc=/zbw/var/lib/fuseki/databases/temp ../var/gnd/2021-11/src/GND.utf8.ttl.gz