[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-05-24 Thread Gehel
Gehel closed this task as "Resolved".

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse, Gehel
Cc: dr0ptp4kt, dcausse, Aklapper, Danny_Benjafield_WMDE, S8321414, 
Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, 
Dringsim, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, KimKelting, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-05-02 Thread Maintenance_bot
Maintenance_bot removed a project: Patch-For-Review.

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse, Maintenance_bot
Cc: dr0ptp4kt, dcausse, Aklapper, Danny_Benjafield_WMDE, S8321414, 
Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, 
Dringsim, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, KimKelting, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331, 
Isabelladantes1983, Themindcoder, Adamm71, Jersione, Hellket777, LisafBia6531, 
786, Biggs657, Juan90264, Alter-paule, Beast1978, Un1tY, Hook696, Kent7301, 
joker88john, CucyNoiD, Gaboe420, Giuliamocci, Cpaulf30, Af420, Bsandipan, 
Lewizho99, Maathavan, Neuronton
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-05-02 Thread gerritbot
gerritbot added a comment.


  Change #1024414 **merged** by jenkins-bot:
  
  [wikidata/query/rdf@master] ScholarlyArticleSplit: add support for stubs
  
  https://gerrit.wikimedia.org/r/1024414

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse, gerritbot
Cc: dr0ptp4kt, dcausse, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, 
Themindcoder, Adamm71, S8321414, Jersione, Hellket777, LisafBia6531, 
Astuthiodit_1, 786, Biggs657, karapayneWMDE, Invadibot, maantietaja, Juan90264, 
Alter-paule, Beast1978, ItamarWMDE, Un1tY, Akuckartz, Dringsim, Hook696, 
Kent7301, joker88john, CucyNoiD, Nandana, Gaboe420, Giuliamocci, Cpaulf30, 
Lahi, Gq86, Af420, Bsandipan, GoranSMilovanovic, QZanden, EBjune, KimKelting, 
LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-05-02 Thread gerritbot
gerritbot added a comment.


  Change #1020871 **merged** by jenkins-bot:
  
  [wikidata/query/rdf@master] Split based on subgraph rules
  
  https://gerrit.wikimedia.org/r/1020871

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse, gerritbot
Cc: dr0ptp4kt, dcausse, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, 
Themindcoder, Adamm71, S8321414, Jersione, Hellket777, LisafBia6531, 
Astuthiodit_1, 786, Biggs657, karapayneWMDE, Invadibot, maantietaja, Juan90264, 
Alter-paule, Beast1978, ItamarWMDE, Un1tY, Akuckartz, Dringsim, Hook696, 
Kent7301, joker88john, CucyNoiD, Nandana, Gaboe420, Giuliamocci, Cpaulf30, 
Lahi, Gq86, Af420, Bsandipan, GoranSMilovanovic, QZanden, EBjune, KimKelting, 
LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-25 Thread gerritbot
gerritbot added a comment.


  Change #1019052 **merged** by jenkins-bot:
  
  [wikidata/query/rdf@master] Generalize ScholarlyArticleSplitter
  
  https://gerrit.wikimedia.org/r/1019052

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse, gerritbot
Cc: dr0ptp4kt, dcausse, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, 
Themindcoder, Adamm71, S8321414, Jersione, Hellket777, LisafBia6531, 
Astuthiodit_1, 786, Biggs657, karapayneWMDE, Invadibot, maantietaja, Juan90264, 
Alter-paule, Beast1978, ItamarWMDE, Un1tY, Akuckartz, Dringsim, Hook696, 
Kent7301, joker88john, CucyNoiD, Nandana, Gaboe420, Giuliamocci, Cpaulf30, 
Lahi, Gq86, Af420, Bsandipan, GoranSMilovanovic, QZanden, EBjune, KimKelting, 
LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-25 Thread gerritbot
gerritbot added a comment.


  Change #1024414 had a related patch set uploaded (by DCausse; author: 
DCausse):
  
  [wikidata/query/rdf@master] ScholarlyArticleSplit: add support for stubs
  
  https://gerrit.wikimedia.org/r/1024414

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse, gerritbot
Cc: dr0ptp4kt, dcausse, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, 
Themindcoder, Adamm71, S8321414, Jersione, Hellket777, LisafBia6531, 
Astuthiodit_1, 786, Biggs657, karapayneWMDE, Invadibot, maantietaja, Juan90264, 
Alter-paule, Beast1978, ItamarWMDE, Un1tY, Akuckartz, Dringsim, Hook696, 
Kent7301, joker88john, CucyNoiD, Nandana, Gaboe420, Giuliamocci, Cpaulf30, 
Lahi, Gq86, Af420, Bsandipan, GoranSMilovanovic, QZanden, EBjune, KimKelting, 
LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-23 Thread dcausse
dcausse claimed this task.
dcausse moved this task from Ready for Dev -- SWE to In Progress on the 
Discovery-Search (Current work) board.

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

WORKBOARD
  https://phabricator.wikimedia.org/project/board/1227/

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: dr0ptp4kt, dcausse, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, 
Themindcoder, Adamm71, S8321414, Jersione, Hellket777, LisafBia6531, 
Astuthiodit_1, 786, Biggs657, karapayneWMDE, Invadibot, maantietaja, Juan90264, 
Alter-paule, Beast1978, ItamarWMDE, Un1tY, Akuckartz, Dringsim, Hook696, 
Kent7301, joker88john, CucyNoiD, Nandana, Gaboe420, Giuliamocci, Cpaulf30, 
Lahi, Gq86, Af420, Bsandipan, GoranSMilovanovic, QZanden, EBjune, KimKelting, 
LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-23 Thread gerritbot


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-16 Thread dr0ptp4kt
dr0ptp4kt added a comment.


  **Running time**
  Total Uptime: 55 min
  
  This was faster than in T347989#9335980 
. Nice!
  
  **Counts**
  
  To be discussed in code review.
  
  **Samples **
  
  These look similar to about what we'd expect based on T347989#9346038 
 .
  
select "| " || concat_ws(" | ", subject, predicate, object, context) from 
dr0ptp4kt.wikibase_rdf_scholarly_split_t362060 where snapshot = '20231016' and 
wiki = 'wikidata' and
scope = 'scholarly_articles' and rand() <= (30/7643858365) distribute by 
rand() sort by rand() limit 30;
  
  {icon graduation-cap}
  
  | subject 
| predicate| 
object | 
context  |
  | 
---
 |  | 
-- | 
 |
  | 
http://www.wikidata.org/entity/statement/Q46815762-E3F8B9BE-32CC-4055-9097-0732A1D7E88E
 | http://www.w3.org/1999/02/22-rdf-syntax-ns#type  | 
http://wikiba.se/ontology#BestRank | 
http://www.wikidata.org/entity/Q46815762 |
  | http://www.wikidata.org/reference/c2c805e274b6709d71ffd08402ed14a95ddc0f48  
| http://www.wikidata.org/prop/reference/P248  | 
http://www.wikidata.org/entity/Q180686 | 
http://wikiba.se/ontology#Reference  |
  | http://www.wikidata.org/entity/Q93646519
| http://schema.org/description| 
"1985\u5E74\u306E\u8AD6\u6587"@ja  | 
http://www.wikidata.org/entity/Q93646519 |
  | http://www.wikidata.org/entity/Q82929879
| http://wikiba.se/ontology#sitelinks  | 
"0"^^http://www.w3.org/2001/XMLSchema#integer  | 
http://www.wikidata.org/entity/Q82929879 |
  | http://www.wikidata.org/reference/698fdc9c32c9033280837148dd0cc2fbb09701b6  
| http://www.wikidata.org/prop/reference/P248  | 
http://www.wikidata.org/entity/Q229883 | 
http://wikiba.se/ontology#Reference  |
  | 
http://www.wikidata.org/entity/statement/Q37398018-08548343-257C-43E8-8768-1B82B012B857
 | http://www.w3.org/ns/prov#wasDerivedFrom | 
http://www.wikidata.org/reference/1312ec06258ac7841e5e97d5b1d85cc034da666b | 
http://www.wikidata.org/entity/Q37398018 |
  | 
http://www.wikidata.org/entity/statement/Q38261165-38825DC4-B1CA-4102-8CCE-2B4713882EED
 | http://wikiba.se/ontology#rank   | 
http://wikiba.se/ontology#NormalRank   | 
http://www.wikidata.org/entity/Q38261165 |
  | 
http://www.wikidata.org/entity/statement/Q50247650-2B75A590-C865-4CD7-8E93-C5720E77B459
 | http://www.wikidata.org/prop/statement/P31   | 
http://www.wikidata.org/entity/Q13442814   | 
http://www.wikidata.org/entity/Q50247650 |
  | 
http://www.wikidata.org/entity/statement/Q56638632-3EEB814A-C402-48D4-9577-B91996287EDD
 | http://wikiba.se/ontology#rank   | 
http://wikiba.se/ontology#NormalRank   | 
http://www.wikidata.org/entity/Q56638632 |
  | 
http://www.wikidata.org/entity/statement/Q93198245-A9EF6F3A-AE60-4B68-9ADF-03861F92E7D2
 | http://www.w3.org/ns/prov#wasDerivedFrom | 
http://www.wikidata.org/reference/c40456cccbdf1b0dbf4590fad9ace45a270e3af6 | 
http://www.wikidata.org/entity/Q93198245 |
  | 
http://www.wikidata.org/entity/statement/Q35798201-73FA43B1-DE81-4AB8-84A1-435A776AFBF8
 | http://www.wikidata.org/prop/statement/P50   | 
http://www.wikidata.org/entity/Q55071316   | 
http://www.wikidata.org/entity/Q35798201 |
  | 
http://www.wikidata.org/entity/statement/Q46675214-E205C68E-FD35-4F3B-99F6-CEF31C772C1E
 | http://www.wikidata.org/prop/qualifier/P1545 | "2"   
 | 
http://www.wikidata.org/entity/Q46675214 |
  | 
http://www.wikidata.org/entity/statement/Q40608211-C59EE5EA-2F96-47C2-AE41-7EBEB83583F5
 | http://wikiba.se/ontology#rank   | 
http://wikiba.se/ontology#NormalRank   | 
http://www.wikidata.org/entity/Q40608211 |
  | 

[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-16 Thread dr0ptp4kt
dr0ptp4kt added a comment.


  I kicked off a run using the current version of the patch with the following 
command and backing table, and its status should be able to be followed here: 
https://yarn.wikimedia.org/cluster/app/application_1713178047802_16409
  
  So long as I haven't made an error somewhere in here that produces a runtime 
exception (e.g., pathing), we should be able to see after a couple hours how 
it's going.
  
spark3-submit --master yarn --driver-cores 2 --conf 
spark.sql.autoBroadcastJoinThreshold=-1 --conf 
spark.dynamicAllocation.maxExecutors=128 --conf 
spark.sql.shuffle.partitions=512 --conf spark.executor.memoryOverhead=4g 
--executor-cores 4 --executor-memory 12g --driver-memory 16g  --name 
scholarly_article_split_manual__scholarly_article_split_triples__T362060_personal_namespace
  --conf spark.yarn.maxAppAttempts=1 --class 
org.wikidata.query.rdf.spark.transform.structureddata.dumps.ScholarlyArticleSplit
 --deploy-mode cluster 
/home/dr0ptp4kt/rdf-spark-tools-0.3.138-SNAPSHOT-jar-with-dependencies-T362060.jar
 --input-table-partition-spec 
discovery.wikibase_rdf_t337013/date=20231016/wiki=wikidata 
--output-table-partition-spec 
dr0ptp4kt.wikibase_rdf_scholarly_split_T362060/snapshot=20231016/wiki=wikidata
  
  Here was the manual table creation I did while `use`ing the `dr0ptp4kt` 
namespace.
  
CREATE TABLE IF NOT EXISTS dr0ptp4kt.wikibase_rdf_scholarly_split_T362060 (
  `subject` string,
  `predicate` string,
  `object` string,
  `context` string
)
PARTITIONED BY (
`snapshot` string,
`wiki` string,
`scope` string
)
STORED AS PARQUET
LOCATION 
'hdfs://analytics-hadoop/user/dr0ptp4kt/wikibase_rdf_scholarly_split_T362060/wikidata/rdf_scholarly_split_T362060/'
;

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dr0ptp4kt
Cc: dr0ptp4kt, dcausse, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, 
Themindcoder, Adamm71, S8321414, Jersione, Hellket777, LisafBia6531, 
Astuthiodit_1, 786, Biggs657, karapayneWMDE, Invadibot, maantietaja, Juan90264, 
Alter-paule, Beast1978, ItamarWMDE, Un1tY, Akuckartz, Hook696, Kent7301, 
joker88john, CucyNoiD, Nandana, Gaboe420, Giuliamocci, Cpaulf30, Lahi, Gq86, 
Af420, Bsandipan, GoranSMilovanovic, QZanden, EBjune, KimKelting, LawExplorer, 
Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-15 Thread Gehel
Gehel set the point value for this task to "5".

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Gehel
Cc: dcausse, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, Themindcoder, 
Adamm71, S8321414, Jersione, Hellket777, LisafBia6531, Astuthiodit_1, 786, 
Biggs657, karapayneWMDE, Invadibot, maantietaja, Juan90264, Alter-paule, 
Beast1978, ItamarWMDE, Un1tY, Akuckartz, Hook696, Kent7301, joker88john, 
CucyNoiD, Nandana, Gaboe420, Giuliamocci, Cpaulf30, Lahi, Gq86, Af420, 
Bsandipan, GoranSMilovanovic, QZanden, EBjune, KimKelting, LawExplorer, 
Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-15 Thread Gehel
Gehel moved this task from Incoming to Current work on the 
Wikidata-Query-Service board.
Gehel removed a project: Wikidata-Query-Service.

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

WORKBOARD
  https://phabricator.wikimedia.org/project/board/891/

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Gehel
Cc: dcausse, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, Themindcoder, 
Adamm71, S8321414, Jersione, Hellket777, LisafBia6531, Astuthiodit_1, 786, 
Biggs657, karapayneWMDE, Invadibot, maantietaja, Juan90264, Alter-paule, 
Beast1978, ItamarWMDE, Un1tY, Akuckartz, Hook696, Kent7301, joker88john, 
CucyNoiD, Nandana, Gaboe420, Giuliamocci, Cpaulf30, Lahi, Gq86, Af420, 
Bsandipan, GoranSMilovanovic, QZanden, EBjune, KimKelting, LawExplorer, 
Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331, AWesterinen, Namenlos314, Lucas_Werkmeister_WMDE, 
merbst, Jonas, Xmlizer, jkroll, Jdouglas, Tobias1984, Manybubbles
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-15 Thread Gehel
Gehel triaged this task as "High" priority.

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

WORKBOARD
  https://phabricator.wikimedia.org/project/board/891/

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Gehel
Cc: dcausse, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, Themindcoder, 
Adamm71, S8321414, Jersione, Hellket777, LisafBia6531, Astuthiodit_1, 
AWesterinen, 786, Biggs657, karapayneWMDE, Invadibot, maantietaja, Juan90264, 
Alter-paule, Beast1978, ItamarWMDE, Un1tY, Akuckartz, Hook696, Kent7301, 
joker88john, CucyNoiD, Nandana, Namenlos314, Gaboe420, Giuliamocci, Cpaulf30, 
Lahi, Gq86, Af420, Bsandipan, Lucas_Werkmeister_WMDE, GoranSMilovanovic, 
QZanden, EBjune, KimKelting, merbst, LawExplorer, Lewizho99, Maathavan, 
_jensen, rosalieper, Neuronton, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-11 Thread gerritbot
gerritbot added a project: Patch-For-Review.

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: gerritbot
Cc: dcausse, Aklapper, luca.favorido, Danny_Benjafield_WMDE, 
Isabelladantes1983, Themindcoder, Adamm71, S8321414, Jersione, Hellket777, 
LisafBia6531, Astuthiodit_1, AWesterinen, 786, Biggs657, karapayneWMDE, 
Invadibot, maantietaja, Juan90264, Alter-paule, Beast1978, ItamarWMDE, Un1tY, 
Akuckartz, Hook696, Kent7301, joker88john, CucyNoiD, Nandana, Namenlos314, 
Gaboe420, Giuliamocci, Cpaulf30, Lahi, Gq86, Af420, Bsandipan, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, KimKelting, merbst, 
LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-11 Thread gerritbot
gerritbot added a comment.


  Change #1019052 had a related patch set uploaded (by DCausse; author: 
DCausse):
  
  [wikidata/query/rdf@master] Generalize ScholarlyArticleSplitter
  
  https://gerrit.wikimedia.org/r/1019052

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: gerritbot
Cc: dcausse, Aklapper, luca.favorido, Danny_Benjafield_WMDE, S8321414, 
Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, 
Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, 
GoranSMilovanovic, QZanden, EBjune, KimKelting, merbst, LawExplorer, _jensen, 
rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, 
Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-08 Thread dr0ptp4kt
dr0ptp4kt added a project: Discovery-Search (Current work).

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dr0ptp4kt
Cc: dcausse, Aklapper, Danny_Benjafield_WMDE, S8321414, Astuthiodit_1, 
AWesterinen, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, 
Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, 
QZanden, EBjune, KimKelting, merbst, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, 
Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-08 Thread Maintenance_bot
Maintenance_bot added a project: Wikidata.

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Maintenance_bot
Cc: dcausse, Aklapper, Danny_Benjafield_WMDE, S8321414, Astuthiodit_1, 
AWesterinen, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, 
Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, 
QZanden, EBjune, KimKelting, merbst, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, 
Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-08 Thread dcausse
dcausse added a parent task: T337013: [Epic] Splitting the graph in WDQS.

TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: dcausse, Aklapper, AWesterinen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, 
EBjune, KimKelting, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, 
aude, Tobias1984, Manybubbles
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T362060: Generalize ScholarlyArticleSplitter

2024-04-08 Thread dcausse
dcausse created this task.
dcausse added a project: Wikidata-Query-Service.
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  The spark job ScholarlyArticleSplitter should be generalized to support the 
general case with //n// subgraphs, a wider variety of rules and stubs.
  
  AC:
  
  - support subgraph definitions as proposed in T361935 

  - support stubs 
WDQS_Split_Refinement#Add_triples_to_help_navigate_between_the_subgraphs 


TASK DETAIL
  https://phabricator.wikimedia.org/T362060

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: dcausse, Aklapper, AWesterinen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, 
EBjune, KimKelting, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, 
aude, Tobias1984, Manybubbles
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org