https://bugzilla.wikimedia.org/show_bug.cgi?id=69277

--- Comment #86 from Dan Andreescu <dandree...@wikimedia.org> ---
Well, you can query the logs with SQL on our hadoop cluster if you'd like more
info (you have rights to stat1002.eqiad.wmnet right Sam?)  I mean, I could run
queries for you but it sounds like you just want to investigate.  For example,
to get a breakdown by status, you could do:

sta1002> hive
hive (default)> use wmf_raw;
hive (wmf_raw)>
 select uri_path,
        http_status,
        count(*)
   from webrequest
  where webrequest_source='bits'
    and year=2014
    and month=10
    and day=28
    and uri_path like '%skins/common/%'
  group by uri_path,
        http_status
 having count(*) > 20
;


You can see what else is available (like referrer) with:

hive (wmf_raw)> describe webrequest;

If you just want to poke around smaller bits of data, you can add something
like "and hour=12".  Without that, the query above could take over an hour.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to