[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-02-07 Thread AWesterinen
AWesterinen added a comment. @AndySeaborne Agree. I was erring on the side of explaining where the SPARQL endpoint came from (not Jena TDB). TASK DETAIL https://phabricator.wikimedia.org/T299460 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: AW

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-02-07 Thread AndySeaborne
AndySeaborne added a comment. @AWesterinen - Fuseki is part of Jena. Most of the subsystems have informal names. People refer to "Jena" or "Fuseki" interchangeably and the context is the task they are doing. Being more specific on naming didn't catch on. TASK DETAIL https://phabricator.wik

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-02-07 Thread AndySeaborne
AndySeaborne added a comment. @Thadguidry - https://lists.apache.org/thread/vso02pwg4z6qcs3r1h0mcbc86ls74bhm where --parallel (the argument on sort(1) that is set by --threads) was set to 16. It took 31h compared to 39h without --parallel on sort(1). TASK DETAIL https://phab

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-02-06 Thread AWesterinen
AWesterinen added a comment. You add Fuseki to Jena to get a SPARQL endpoint. Jena + Fuseki is reasonable to investigate as a Blazegraph Alternative. TASK DETAIL https://phabricator.wikimedia.org/T299460 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-02-06 Thread Thadguidry
Thadguidry added a comment. Hi @AndySeaborne What is the latest benchmarks for loading Wikidata all and truthy with Jena 4.4.0 release annd the new TDB2 xloader with "--threads" argument? I noticed the release notes said this: > == Improved bulk loader > > This release includes the

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-24 Thread MPhamWMF
MPhamWMF triaged this task as "Medium" priority. TASK DETAIL https://phabricator.wikimedia.org/T299460 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: MPhamWMF Cc: Osmasuominen, dcausse, Smalyshev, Aklapper, Lucas_Werkmeister_WMDE, Gehel, Andrawaag,

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-24 Thread DD063520
DD063520 added a comment. @So9q : How would you like to serve everything from one place? It is normal to have replica of data. One of the big bottlenecks is IO. Or do I understand something wrong? TASK DETAIL https://phabricator.wikimedia.org/T299460 EMAIL PREFERENCES https://phabricato

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-24 Thread So9q
So9q added a comment. I read the whole thread and just want to point out that Jena supports SPARQL Update also. From what I can see, it seems to be able to replace Blazegraph. But it does not solve the issue of having multiple parallel servers all with their own snapshot of the current

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-24 Thread dcausse
dcausse added a comment. Sorry for the confusion that the rename I did of this task caused. Just to bring clarity on my reasoning as a maintainer of the wikidata query service stack as to why being specific on TDB2 might be helpful: - Some components of Jena are already being used (i.e.

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-23 Thread Osmasuominen
Osmasuominen added a comment. I have to agree with @AndySeaborne - talking about "Apache Jena with TDB2" makes as much sense as talking about "VW Beetle with an internal combustion engine". The framing makes it sound like Beetles come with all kinds of engines, though in reality they've all

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-22 Thread AndySeaborne
AndySeaborne added a comment. All - I'm sorry that this sub-task is being redirected to be about Virtuoso. This would be better moved to the Virtuoso task. Apache Jena releases a single software product. TDB is the only persistence layer for Apache Jena that comes from the Apache Jena pr

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-21 Thread TallTed
TallTed added a comment. @AndySeaborne -- It has been my understanding that Apache Jena (the framework) performs differently (which may include different speeds of various actions, which may have different limitations and/or comprise a different list) when the active "low level storage choic

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-21 Thread AndySeaborne
AndySeaborne added a subscriber: dcausse. AndySeaborne added a comment. Hi @dcausse - TDB2 on it's own doesn't provide SPARQL nor any of the other features. TDB2 is just one low level storage choice - it's not a standalone thing. I hope you find the description clearer now. The project h

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-21 Thread AndySeaborne
AndySeaborne renamed this task from "Evaluate Apache Jena TDB2" to "Evaluate Apache Jena". AndySeaborne updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T299460 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: AndySeaborne C

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena TDB2

2022-01-21 Thread dcausse
dcausse renamed this task from "Evaluate Apache Jena" to "Evaluate Apache Jena TDB2". dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T299460 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: Smalyshev, A

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-18 Thread AndySeaborne
AndySeaborne updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T299460 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: AndySeaborne Cc: Smalyshev, Aklapper, Lucas_Werkmeister_WMDE, Gehel, Andrawaag, Addshore, Susannaanas, Ak

[Wikidata-bugs] [Maniphest] T299460: Evaluate Apache Jena

2022-01-18 Thread AndySeaborne
AndySeaborne created this task. AndySeaborne added projects: Wikidata-Query-Service, Epic, Wikidata, MediaWiki-Stakeholders-Group. TASK DESCRIPTION Apache Jena https://jena.apache.org/ provides SPARQL 1.1, SHACL (core and SPARQL), ShEx, and RDF-star. Jena has been reported to [https://li