Re: Strange Errors in logs: Suspected Web Crawler

2008-03-17 Thread James Buckingham
As I didn't get a reply to this one so I thought I'd add a bit more 
information. 

These errors seem to follow another which is worryingly similar to this:

The filename, directory name, or volume label syntax is incorrect The specific 
sequence of files included or processed is: 
X:\OurUploaderFolder\Web\groupDocumentation\2.http:\www.oursite.com\aa\assessments\page.cfm

I found a Q&A on Yahoo Slurp! here: 
http://help.yahoo.com/l/us/yahoo/search/webcrawler/slurp-10.html

which shows that Slurp! will fire out fake addresses as a way of 
establishing whether you use an error page or not. Why it uses a diretory path 
and how it knows what the directory path is though is a bit of a worry.

Checking the IIS logs there is activity from Slurp! but it doesn't always tie 
up with the times that the errors are appearing in the CF logs, hence why I'm 
not 100% sure this is web crawl activity.

I'm still not fully understanding why there is no application name, unless it's 
because there is this access to a directory path going on instead of a RUL, but 
does anybody know a) how I can assign an application to this kind of message or 
b) how I can stop the crawler from firing these messages off without actually 
stopping it from crawling the site?

Cheers,
James 

~|
Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to 
date
Get the Free Trial
http://ad.doubleclick.net/clk;160198600;22374440;w

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:301410
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4


Strange Errors in logs: Suspected Web Crawler

2008-03-14 Thread James Buckingham
Morning everyone,

We've got a strange error which keeps appearing in our logs every day (usually 
2-3 but they're increasing) which looks like this:

13-Mar-2008  12:13   Error   0 
Element ROOT is undefined in APPLICATION. The specific sequence of files 
included or processed is: C:\..\display\disp_nopage.cfm, line: 23

This is coming from our global error-handling page. Note that the "Application 
Name" column is blank so it doesn't look like it's being assigned to one. If i 
try and access this page myself or force an error it appears fine and the 
debugger is showing that the page is assign to the main application.

As you can see from the address the file sits in a directory called "display". 
There is no application.cfm file in this directory so I would expect it to drop 
to the next level and get the application details from there. This would be the 
root which is where the main application.cfm is sitting.

I'm suspecting that this might be a Web Crawler (Yahoo Slurp! appears a lot in 
the IIS logs) which is testing the error pages but what puzzles me is why there 
isn't an application being assigned to it. Does anybody have any ideas? Would 
something as simple as a


 


. fix the problem?

Your help would be appreciated.

Cheers,
James 

~|
Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to 
date
Get the Free Trial
http://ad.doubleclick.net/clk;160198600;22374440;w

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:301239
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4