dr0ptp4kt added a comment.

  @bking just wanted to express my gratitude for the support on this ticket and 
its friends T344905: Publish WDQS JNL files to dumps.wikimedia.org 
<https://phabricator.wikimedia.org/T344905> and T347647: 2023-09-18 
latest-all.ttl.gz WDQS dump `Fatal error munging RDF 
org.openrdf.rio.RDFParseException: Expected '.', found 'g'` 
<https://phabricator.wikimedia.org/T347647>. FWIW I do think it would be good 
to automate this. As a matter of getting to a functional WDQS local environment 
replete with BlazeGraph data, it would accelerate things a lot. I think my only 
reservations are that:
  
  1. It takes time to automate. Any rough guess on level of effort for that? I 
understand that'd inform relative prioritization against the large pile of 
other things.
  2. The energy savings is possibly unclear, at least under current case (but 
that's partly because it's hard to know how much energy is being expended, 
which could be guessed at from number of dump downloads; not sure how easy it 
is to get those stats; this is different from the bandwidth transfer on 
Cloudflare R2).
  
  However, I would probably err on the side of assuming that ultimately the 
automation will boost the technical communities' interest and ability to trial 
things locally (right now the barriers are somewhat prohibitive) and that the 
energy savings will roughly net out - ironically, if it attracts more people, 
they'll in the aggregate consume more energy, but they'll also be vastly more 
efficient energy-wise because they won't have to ETL, which takes a lot of 
compute resources. For potential reusers (e.g., Enterprise or other 
institutions) it might help smooth things along a bit, although this is mostly 
just my conjecture.
  
  Thinking ahead a little, we'd probably want to generalize anything so that it 
can take arbitrary `.jnl`s, for example for split graphs.

TASK DETAIL
  https://phabricator.wikimedia.org/T347605

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: bking, dr0ptp4kt
Cc: Addshore, dr0ptp4kt, Aklapper, bking, Danny_Benjafield_WMDE, Astuthiodit_1, 
AWesterinen, BTullis, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, 
Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, 
GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, 
Manybubbles, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org

Reply via email to