ArielGlenn added a subscriber: MarkTraceur.
ArielGlenn added a comment.

  From email from @MarkTraceur
  
  Database needs
  --------------
  
  - 54 million files on Commons
  - Estimated average of 10-20 statements per file
  - Estimated 1 revision per statement
  - Therefore, (very) roughly 1 billion estimated rows added to revisions table
  
  External storage needs
  ----------------------
  
  - Each file will have its own MediaInfo entity, which will be analogous to 
Wikidata items
  - So, given Wikidata has about 57 million items, the storage needs should be 
about the same
    - Obviously that would need to be additional storage, not including the 
existing Wikitext
  
  Rates
  -----
  
  - We expect multiple bots to run over Commons very shortly after release 
(within the next few months)
    - Don't anticipate these will be drastically faster than normal bot runs
    - Could see Multichill's bots for examples - I believe he's rate-limited 
them aggressively
  - There will likely be micro-contributions as well
    - Think Magnus's "Wikidata game" style, likely similar rates
    - Also sanctioned on-wiki machine-aided work (for depicts statements)
  - By the end of the calendar year, we expect at least 5 million files to have 
structured data
  - We're currently sitting in the low six figures (100-300k)

TASK DETAIL
  https://phabricator.wikimedia.org/T226093

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: ArielGlenn
Cc: MarkTraceur, ArielGlenn, Aklapper, darthmon_wmde, Legado_Shulgin, Nandana, 
JKSTNK, thifranc, AndyTan, Davinaclare77, Qtn1293, Lahi, PDrouin-WMF, Gq86, 
E1presidente, Ramsey-WMF, Cparle, Anooprao, SandraF_WMF, GoranSMilovanovic, 
Lunewa, Th3d3v1ls, Hfbn0, QZanden, Tramullas, Acer, LawExplorer, Salgo60, 
Zppix, Silverfish, _jensen, rosalieper, Susannaanas, Wong128hk, gnosygnu, 
Jane023, Wikidata-bugs, Base, matthiasmullie, aude, Ricordisamoa, Wesalius, 
Lydia_Pintscher, Fabrice_Florin, Raymond, faidon, Steinsplitter, Mbch331, 
Jay8g, fgiunchedi
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to