[Wikidata-bugs] [Maniphest] [Commented On] T230856: RDF dump performance for SDC

2019-11-27 Thread ArielGlenn
ArielGlenn added a comment. In T230856#5692766 , @Cparle wrote: > I'm not sure if T222497 covers this stuff and, if not, what is actionable here by the structured data team. @ArielGlenn any th

[Wikidata-bugs] [Maniphest] [Commented On] T230856: RDF dump performance for SDC

2019-11-26 Thread Cparle
Cparle added a comment. I'm not sure if T222497 covers this stuff and, if not, what is actionable here by the structured data team. @ArielGlenn any thoughts? TASK DETAIL https://phabricator.wikimedia.org/T230856 EMAIL PREFERENCES https://phabr

[Wikidata-bugs] [Maniphest] [Commented On] T230856: RDF dump performance for SDC

2019-11-04 Thread ArielGlenn
ArielGlenn added a comment. Bulk adds of depicts statements on deployment-prep will start this evening, now that the code is ready. It will run over a couple of days at least. Once complete we'll have 3k images on beta commons with captions and depicts statements in them, referencing 1k item

[Wikidata-bugs] [Maniphest] [Commented On] T230856: RDF dump performance for SDC

2019-11-01 Thread ArielGlenn
ArielGlenn added a comment. Adding items to wikidata in deployment-prep for use in depicts statements for the uploaded images in beta commons. Depicts statements early next week most likely. TASK DETAIL https://phabricator.wikimedia.org/T230856 EMAIL PREFERENCES https://phabricator.wiki

[Wikidata-bugs] [Maniphest] [Commented On] T230856: RDF dump performance for SDC

2019-10-29 Thread ArielGlenn
ArielGlenn added a comment. I've started generating, uploading and captioning images in beta commons today, using the latest version of the script linked above. I'd like to add some depicts statements too. In any case, by the end of the week expect that we'll have several batches of these li

[Wikidata-bugs] [Maniphest] [Commented On] T230856: RDF dump performance for SDC

2019-09-12 Thread Gehel
Gehel added a comment. Note that T222497 needs to be resolved before we can actually have a working dump. TASK DETAIL https://phabricator.wikimedia.org/T230856 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/

[Wikidata-bugs] [Maniphest] [Commented On] T230856: RDF dump performance for SDC

2019-08-23 Thread ArielGlenn
ArielGlenn added a comment. https://github.com/apergos/misc-wmf-crap/tree/master/glyph-image-generator Starting to get clever about this: ability to generate 50k small images with metadata that can be extracted for using in depicts and/or caption statements. TASK DETAIL https://phabricator

[Wikidata-bugs] [Maniphest] [Commented On] T230856: RDF dump performance for SDC

2019-08-20 Thread ArielGlenn
ArielGlenn added a comment. I'm looking at deployment-db05 now, and there are 63332 rows in the revision table, with 53250 rows in the content table. I guess we need to double the number of revisions and then add the structured data for those entries. we can probably be clever about this via

[Wikidata-bugs] [Maniphest] [Commented On] T230856: RDF dump performance for SDC

2019-08-20 Thread Smalyshev
Smalyshev added a comment. Probably not a lot. Search for English labels returns

[Wikidata-bugs] [Maniphest] [Commented On] T230856: RDF dump performance for SDC

2019-08-20 Thread ArielGlenn
ArielGlenn added a comment. @Smalyshev Do you know how many entries have structured data on deployment-prep? Is that a useful testing ground right now or should we be populating the data over there first? TASK DETAIL https://phabricator.wikimedia.org/T230856 EMAIL PREFERENCES https://ph