[Wikidata-bugs] [Maniphest] [Commented On] T46581: Partial dumps

2019-12-22 Thread Ladsgroup
Ladsgroup added a comment.


  I might be wrong but I feel there's a large demand for couple of types of 
dumps and there's a long tail that we can't afford to have. For example, having 
a dump of all humans (no pun intended) is very very useful (even I need it for 
one of my tools) and there might be a request for dumps that can be easily 
handled through WDQS+scraper but for example just getting list of all humans 
times out in WDQS (understandably)

TASK DETAIL
  https://phabricator.wikimedia.org/T46581

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Ladsgroup
Cc: Ladsgroup, Aklapper, Nikki, abian, Bugreporter, Lucie, PokestarFan, hoo, 
JanZerebecki, jkroll, Wikidata-bugs, Denny, Lydia_Pintscher, daniel, 
darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, 
_jensen, rosalieper, Scott_WUaS, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T46581: Partial dumps

2019-12-22 Thread daniel
daniel added a comment.


  Filtering dumps by area of interest is convenient, if a good criterion can be 
found to identify items relevant to the topic. It would probably make sense to 
also include any items directly references, to provide the immediate context of 
the items.
  ms and 
  However, for this to be useful, a great many of such specialized "area of 
interest" dumps would have to exist, with substantial overlap. If WMF can 
afford that in terms of resources, it would sure be nice to have.
  
  But perhaps there is a different way to slice this: create a stub dump that 
filters out most of the statements (and sitelinks?), providing labels, 
descriptions, aliases, plus instanceof and subclass.
  
  Another approach would be to focus on structure rather than topic: e.g. 
export all items that have a (or are the subject of a) parent axon property, 
and include only terms and maybe a very limited set op properties. Similarly, 
dumps that contain the geographical inclusion structure, or the genealogical 
structure, or historical timeline may be useful.

TASK DETAIL
  https://phabricator.wikimedia.org/T46581

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: daniel
Cc: Ladsgroup, Aklapper, Nikki, abian, Bugreporter, Lucie, PokestarFan, hoo, 
JanZerebecki, jkroll, Wikidata-bugs, Denny, Lydia_Pintscher, daniel, 
darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, 
_jensen, rosalieper, Scott_WUaS, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T46581: Partial dumps

2019-12-19 Thread Bugreporter
Bugreporter added a comment.


  https://tools.wmflabs.org/wdumps/ provided a way to generate a partial dump, 
but the dump can not be regularly generated.

TASK DETAIL
  https://phabricator.wikimedia.org/T46581

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Bugreporter
Cc: Aklapper, Nikki, abian, Bugreporter, Lucie, PokestarFan, hoo, JanZerebecki, 
jkroll, Wikidata-bugs, Denny, Lydia_Pintscher, daniel, darthmon_wmde, Nandana, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs