[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2024-04-19 Thread Gehel
Gehel closed this task as "Resolved".
Gehel claimed this task.

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Gehel
Cc: Daniel_Mietchen, Jane023, dr0ptp4kt, Fnielsen, EgonWillighagen, dcausse, 
Aklapper, Danny_Benjafield_WMDE, S8321414, Astuthiodit_1, AWesterinen, 
karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, 
Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, 
EBjune, KimKelting, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2024-04-05 Thread dcausse
dcausse moved this task from Blocked/Waiting to Needs Reporting on the 
Discovery-Search (Current work) board.
dcausse added a comment.


  Two scholia queries were rewritten:
  
  - 
https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/WDQS_graph_split/Federated_Queries_Examples#Property_paths
  - 
https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/WDQS_graph_split/Federated_Queries_Examples#Number_of_articles_with_CiTO-annotated_citations_by_year
  
  The pages also contains some documentation about to approach such rewrites.
  I'm boldly moving this ticket to our Needs Reporting (prior to be closed) 
column as I believe further explorations about how to rewrite scholia queries 
to support the split could perhaps be better handled in 
https://github.com/WDscholia/scholia.
  
  But please free to re-open this ticket if you believe it has some value.

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

WORKBOARD
  https://phabricator.wikimedia.org/project/board/1227/

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: Jane023, dr0ptp4kt, Fnielsen, Daniel_Mietchen, EgonWillighagen, dcausse, 
Aklapper, Danny_Benjafield_WMDE, S8321414, Astuthiodit_1, AWesterinen, 
karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, 
Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, 
EBjune, KimKelting, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2023-12-01 Thread Gehel
Gehel edited parent tasks, added: T352538: Evaluate the impact of the graph 
split; removed: T337013: [Epic] Splitting the graph in WDQS.

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Gehel
Cc: dr0ptp4kt, Fnielsen, Daniel_Mietchen, EgonWillighagen, dcausse, Aklapper, 
Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2023-10-31 Thread dcausse
dcausse added a comment.


  @EgonWillighagen thanks for the question!
  The set of triples that will be part of the split are the triples that we 
consider //owned// by the item, in other words these are the triples listed by 
Special:EntityData using the //dump// flavor, e.g. 
https://www.wikidata.org/wiki/Special:EntityData/Q59239844.ttl?flavor=dump.
  A scholarly article item will be part of the //scholarly item// subgraph if 
it matches this constraint: `?item wdt:P31 wd:Q5633421`.
  All its corresponding triples will also be part of the split, P53121 
 is a relatively painful query that 
demonstrate what triples can be considered //owned// by an entity and thus 
moved alongside the scholarly article to the same subgraph.
  
  For instance in my query the BGP `?article wdt:P50 wd:Q1042470` matches a 
triple owned by the article and thus is queryable from the split.
  On the hand everything requiring access to the triples owned by the author 
`wd:Q1042470` is not queryable from the split and thus the BGP:
  
?article wdt:P50 ?author .
?author wdt:P213 " 0001 2124 7940"
  
  won't be possible and would require federation like:
  
# all papers by ISNI  0001 2124 7940 (Carlo Rovelli)
SELECT ?article ?articleLabel {
  ?author wdt:P213 " 0001 2124 7940"
  SERVICE  {
# Querying the scholarly article split
?article wdt:P50 ?author .
BIND(?articleLabel as ?articleLabel) .
SERVICE wikibase:label { bd:serviceParam wikibase:language 
"[AUTO_LANGUAGE],en". }
  }
}
  
  The target endpoint being the main graph and the federated one being the 
scholarly article split.
  I suppose federation can be done the other way around with:
  
# all papers by ISNI  0001 2124 7940 (Carlo Rovelli)
SELECT ?article ?articleLabel {
  SERVICE  {
# Querying the wikidata main graph split
?author wdt:P213 " 0001 2124 7940"
  }
  hint:Prior hint:runFirst true . # Tell blazegraph to first collect ?author
  ?article wdt:P50 ?author .
  SERVICE wikibase:label { bd:serviceParam wikibase:language 
"[AUTO_LANGUAGE],en". }
}
  
  Where the target endpoint is the scholarly split and the federated one the 
main wikidata graph.
  In the later example we already see that we have to help blazegraph by 
telling it what to run first (here collect the author information first).
  
  I agree that using the current wdqs endpoint federating itself can be error 
prone but it's in theory possible to use it if someone is interested in doing 
early experiments.

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: dr0ptp4kt, Fnielsen, Daniel_Mietchen, EgonWillighagen, dcausse, Aklapper, 
Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2023-10-30 Thread Gehel
Gehel moved this task from Incoming to Current work on the 
Wikidata-Query-Service board.
Gehel added a project: Discovery-Search (Current work).

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

WORKBOARD
  https://phabricator.wikimedia.org/project/board/891/

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Gehel
Cc: Fnielsen, Daniel_Mietchen, EgonWillighagen, dcausse, Aklapper, 
Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2023-10-28 Thread EgonWillighagen
EgonWillighagen added a comment.


  > Note that early experiments can be done by federating wdqs with itself, 
e.g. https://w.wiki/7vE9.
  
  Thanks for the example. Before I can experiment, I need to know which item 
types end up in which SPARQL endpoint. The example query suggest the author 
information will also go into the split. I am looking forward to the first 
experimental splitted endpoint to be available.

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Fnielsen, Daniel_Mietchen, EgonWillighagen, dcausse, Aklapper, 
Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2023-10-27 Thread dcausse
dcausse added subscribers: EgonWillighagen, Daniel_Mietchen, Fnielsen.
dcausse added a comment.


  @Daniel_Mietchen @Fnielsen @EgonWillighagen  as discussed in our previous 
meeting here is the task to coordinate the efforts around exploring federation 
for scholia queries. The ticket description is very minimal but should evolve 
as we make progress, thanks!

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: Fnielsen, Daniel_Mietchen, EgonWillighagen, dcausse, Aklapper, 
Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2023-10-27 Thread Maintenance_bot
Maintenance_bot added a project: Wikidata.

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Maintenance_bot
Cc: dcausse, Aklapper, Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, 
karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, 
Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, 
EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, 
jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2023-10-27 Thread dcausse
dcausse added a parent task: T337013: [Epic] Splitting the graph in WDQS.

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: dcausse, Aklapper, AWesterinen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, 
EBjune, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, 
Tobias1984, Manybubbles
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2023-10-27 Thread dcausse
dcausse created this task.
dcausse added a project: Wikidata-Query-Service.
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  The purpose of this ticket it to explore how federation could be used to 
rewrite scholia queries in the context of the WDQS graph split using the naive 
rule `?e wdt:P31 wd:Q13442814`.
  
  Queries to analyze/explore (please add more):
  
  - //Topic// queries: 
https://github.com/search?q=repo%3AWDscholia%2Fscholia%20path%3Atopic*.sparql=code
  
  Note that early experiments can be done by federating wdqs with itself, e.g. 
https://w.wiki/7vE9.

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: dcausse, Aklapper, AWesterinen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, 
EBjune, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, 
Tobias1984, Manybubbles
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org