[Labs-l] Google bot

2014-10-19 Thread Magnus Manske
Hi, I saw a high load (dozens of queries [1]) hitting one of my tools (catscan2). The queries looked like they came from a template on French Wikipedia (category name different, other parameters the same). Access log shows (among other things) Google bot. When I added that to my bot exclusion

Re: [Labs-l] Google bot

2014-10-19 Thread Maximilian Doerr
You are correct. I believe I’m the one that initially brought that to Coren’s attention about that. Cyberpower678 English Wikipedia Account Creation Team Mailing List Moderator On Oct 19, 2014, at 15:50, Magnus Manske magnusman...@googlemail.com wrote: Hi, I saw a high load (dozens of

Re: [Labs-l] Google bot

2014-10-19 Thread Marc A. Pelletier
On 10/19/2014 03:50 PM, Magnus Manske wrote: I vaguely remember that indexing bots (like the Google one) were filtered out by Labs already? They were, for some time, but then I got some fairly vehement protestations that tools being unindexed by Google was a problem. -- Marc

Re: [Labs-l] Google bot

2014-10-19 Thread Maximilian Doerr
Who protested to that, and why would that be a problem? Cyberpower678 English Wikipedia Account Creation Team Mailing List Moderator -Original Message- From: labs-l-boun...@lists.wikimedia.org [mailto:labs-l-boun...@lists.wikimedia.org] On Behalf Of Marc A. Pelletier Sent: Sunday,

Re: [Labs-l] Google bot

2014-10-19 Thread Nuria Ruiz
Why would we want to restrict google indexing of the whole cluster? There are tools of many different nature deployed there, seems that indexing or not should be configured on a instance per instance basis. On Sun, Oct 19, 2014 at 4:41 PM, Maximilian Doerr maximilian.do...@gmail.com wrote:

Re: [Labs-l] Google bot

2014-10-19 Thread Maximilian Doerr
I disagree. Google bot has been nothing but a nuisance, continuously probing my tool with different queries. It has drained resources needlessly, and was quite glad that tool labs had it blocked. Cyberpower678 English Wikipedia Account Creation Team Mailing List Moderator On Oct 19, 2014,

Re: [Labs-l] Google bot

2014-10-19 Thread Gerard Meijssen
Hoi, We might not if it did not bring services down. Thanks, GerardM On 20 October 2014 03:30, Nuria Ruiz nu...@wikimedia.org wrote: Why would we want to restrict google indexing of the whole cluster? There are tools of many different nature deployed there, seems that indexing or not