Charlie Clark <[EMAIL PROTECTED]> wrote:
> Hi,
>
> I've got a couple of sites running with apache's combined logformat:
>
> LogFormat "%h %l %u %t \"%r\" %>s %b \"%{Referer}i\"
> \"%{User-Agent}i\""
> Analog is complaining about these at the referrer:
>
> C: 66.249.66.76 - - [04/Jul/2007:21:10:50 +0200] "GET /robots.txt
> HTTP/1.1" 404 3871 "-" "Mozilla/5.0 (compatible; Googlebot/2.1;
> +http://www.google.com/bot.html)"
> C:
>              *
>
> The lines look to have the identical format to ones from another
> server which doesn't raise the error:
>
> 74.6.66.46 - - [03/Aug/2007:17:35:19 +0200] "GET /robots.txt HTTP/
> 1.0" 200 123 "-" "Mozilla/5.0 (compatible; Yahoo! Slurp; http://
> help.yahoo.com/help/us/ysearch/slurp)"
>
> It may just be my eyes but can anyone see what's wrong?

I can't replicate your error. Using this APACHELOGFORMAT string:
APACHELogFormat (%h %l %u %t \"%r\" %>s %b \"%{Referer}i\" 
\"%{User-Agent}i\")

I get this result:
S: Successful requests: 1
S: Redirected requests: 0
S: Failed requests: 1
S: Requests returning informational status code: 0
S: Status code not given: 0
S: Unwanted lines: 0

How come Googlebot gets a 404 when it requests robots.txt, but Yahoo! 
Slurp gets a 200?

Aengus 


+------------------------------------------------------------------------
|  TO UNSUBSCRIBE from this list:
|    http://lists.meer.net/mailman/listinfo/analog-help
|
|  Analog Documentation: http://analog.cx/docs/Readme.html
|  List archives:  http://www.analog.cx/docs/mailing.html#listarchives
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
+------------------------------------------------------------------------

Reply via email to