[Wikidata-bugs] [Maniphest] [Commented On] T46581: Partial dumps
Ladsgroup added a comment. I might be wrong but I feel there's a large demand for couple of types of dumps and there's a long tail that we can't afford to have. For example, having a dump of all humans (no pun intended) is very very useful (even I need it for one of my tools) and there might be a request for dumps that can be easily handled through WDQS+scraper but for example just getting list of all humans times out in WDQS (understandably) TASK DETAIL https://phabricator.wikimedia.org/T46581 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Ladsgroup Cc: Ladsgroup, Aklapper, Nikki, abian, Bugreporter, Lucie, PokestarFan, hoo, JanZerebecki, jkroll, Wikidata-bugs, Denny, Lydia_Pintscher, daniel, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T46581: Partial dumps
daniel added a comment. Filtering dumps by area of interest is convenient, if a good criterion can be found to identify items relevant to the topic. It would probably make sense to also include any items directly references, to provide the immediate context of the items. ms and However, for this to be useful, a great many of such specialized "area of interest" dumps would have to exist, with substantial overlap. If WMF can afford that in terms of resources, it would sure be nice to have. But perhaps there is a different way to slice this: create a stub dump that filters out most of the statements (and sitelinks?), providing labels, descriptions, aliases, plus instanceof and subclass. Another approach would be to focus on structure rather than topic: e.g. export all items that have a (or are the subject of a) parent axon property, and include only terms and maybe a very limited set op properties. Similarly, dumps that contain the geographical inclusion structure, or the genealogical structure, or historical timeline may be useful. TASK DETAIL https://phabricator.wikimedia.org/T46581 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: daniel Cc: Ladsgroup, Aklapper, Nikki, abian, Bugreporter, Lucie, PokestarFan, hoo, JanZerebecki, jkroll, Wikidata-bugs, Denny, Lydia_Pintscher, daniel, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T46581: Partial dumps
Bugreporter added a comment. https://tools.wmflabs.org/wdumps/ provided a way to generate a partial dump, but the dump can not be regularly generated. TASK DETAIL https://phabricator.wikimedia.org/T46581 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Bugreporter Cc: Aklapper, Nikki, abian, Bugreporter, Lucie, PokestarFan, hoo, JanZerebecki, jkroll, Wikidata-bugs, Denny, Lydia_Pintscher, daniel, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs