https://bugzilla.wikimedia.org/show_bug.cgi?id=59943

Nemo <federicol...@tiscali.it> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |ar...@wikimedia.org

--- Comment #3 from Nemo <federicol...@tiscali.it> ---
So, I finally found the file responsible of updates navigating the obscure repo
tree:
http://git.wikimedia.org/blob/operations%2Fdebs%2Fwikistats.git/HEAD/usr%2Flib%2Fwikistats%2Fupdate.php

If I understand correctly the update is run every 24h in a cron. The simplest
change I can think of is:
1) add a sleep time of 1 second between a request and the following;
2) if a table has 1000 wikis or less (or is "mediawikis"), update them all;
3) if a table has 1000 wikis or more, update only 1000-1500, in this way:
  a) start from those whose last update was earlier,
  b) first update up to 500 wikis with more than 100 articles,
  c) then update up to 1000 of the other wikis.

In this way we would update the whole Wikia table in a month (or one year once
it's completely filled) but have data 10 or so days old at most for the bigger
wikis. And the cron would always run in a reasonable time.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to