Re: LinkWalker

2004-03-19 Thread Chris
I have this same robot on my site. Can i Block this robot using .htaccess files..??? Chris http://www.truefootball.com http://www.worldofjerseys.com

Re: LinkWalker

2004-03-19 Thread Chris
I have this same robot on my site. Can i Block this robot using .htaccess files..??? Chris http://www.truefootball.com http://www.worldofjerseys.com

Re: LinkWalker

2002-01-08 Thread Jesse Goerz
On Tuesday 08 January 2002 01:38, Russell Coker wrote: On Mon, 7 Jan 2002 23:31, Nathan Strom wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? It's apparantly a link

Re: LinkWalker

2002-01-08 Thread Marcel Hicking
On 8 Jan 2002, at 9:56, Jesse Goerz wrote: On Tuesday 08 January 2002 01:38, Russell Coker wrote: On Mon, 7 Jan 2002 23:31, Nathan Strom wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know

Re: LinkWalker

2002-01-08 Thread Russell Coker
On Mon, 7 Jan 2002 23:31, Nathan Strom wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? It's apparantly a link-validation robot operated by a company called SevenTwentyFour

Re: LinkWalker

2002-01-08 Thread Jesse Goerz
On Tuesday 08 January 2002 01:38, Russell Coker wrote: On Mon, 7 Jan 2002 23:31, Nathan Strom wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? It's apparantly a link

Re: LinkWalker

2002-01-08 Thread Marcel Hicking
On 8 Jan 2002, at 9:56, Jesse Goerz wrote: On Tuesday 08 January 2002 01:38, Russell Coker wrote: On Mon, 7 Jan 2002 23:31, Nathan Strom wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know

Re: LinkWalker

2002-01-07 Thread Nathan Strom
[EMAIL PROTECTED] (Russell Coker) wrote in message news:[EMAIL PROTECTED]... I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? It's apparantly a link-validation robot operated

Re: LinkWalker

2002-01-07 Thread Nathan Strom
[EMAIL PROTECTED] (Russell Coker) wrote in message news:[EMAIL PROTECTED]... I wasn't aware that there was any format to robots.txt, I thought that the mere presense of such a file would prevent robots from visiting. Nope; see: http://www.robotstxt.org/wc/robots.html -- To UNSUBSCRIBE,

Re: LinkWalker

2002-01-07 Thread Chris Wagner
Bwahahaha!! Man, that is low. Advertising to sysadmins through the access logs Sheesh. But now that you mention 7-24, I think I recognize that. I think they are a spam marketing outfit. At 02:31 PM 1/7/02 -0800, Nathan Strom wrote: Personally, I think this is a rogue organization --

Re: LinkWalker

2002-01-07 Thread Nathan Strom
[EMAIL PROTECTED] (Russell Coker) wrote in message news:[EMAIL PROTECTED]... I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? It's apparantly a link-validation robot operated by a company

Re: LinkWalker

2002-01-07 Thread Nathan Strom
[EMAIL PROTECTED] (Russell Coker) wrote in message news:[EMAIL PROTECTED]... I wasn't aware that there was any format to robots.txt, I thought that the mere presense of such a file would prevent robots from visiting. Nope; see: http://www.robotstxt.org/wc/robots.html

Re: LinkWalker

2002-01-07 Thread Frank Louwers
site does NOT have a link to us. Likely Seven24 is trying to clutter people's logs with references as a form of advertising. ... a practise we see more and more often here as well! Even 'respectable' major isp's are starting to do it! It's a strange world ... Frank Louwers Openminds b.v.b.a.

Re: LinkWalker

2002-01-07 Thread Chris Wagner
Bwahahaha!! Man, that is low. Advertising to sysadmins through the access logs Sheesh. But now that you mention 7-24, I think I recognize that. I think they are a spam marketing outfit. At 02:31 PM 1/7/02 -0800, Nathan Strom wrote: Personally, I think this is a rogue organization -- there

Re: LinkWalker

2001-12-24 Thread Russell Coker
On Mon, 24 Dec 2001 06:42, Jeremy Lunn wrote: On Sun, Dec 23, 2001 at 05:41:47PM +0100, Russell Coker wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? Surely you'd be able

Re: LinkWalker

2001-12-24 Thread Jeremy Lunn
On Mon, Dec 24, 2001 at 11:43:09AM +0100, Russell Coker wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? Surely you'd be able to disallow access to it with Apache? Yes

Re: LinkWalker

2001-12-24 Thread Jeff Waugh
quote who=Russell Coker Why don't you just update your robots.txt to explicitly specify which files you don't or do, allow spiders access to. If it's a rule-obiding spider, that will be the end of it. I wasn't aware that there was any format to robots.txt, I thought that the mere

Re: LinkWalker

2001-12-24 Thread Russell Coker
On Mon, 24 Dec 2001 06:42, Jeremy Lunn wrote: On Sun, Dec 23, 2001 at 05:41:47PM +0100, Russell Coker wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? Surely you'd be able

Re: LinkWalker

2001-12-24 Thread Jeremy Lunn
On Mon, Dec 24, 2001 at 11:43:09AM +0100, Russell Coker wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? Surely you'd be able to disallow access to it with Apache? Yes

Re: LinkWalker

2001-12-24 Thread Jeff Waugh
quote who=Russell Coker Why don't you just update your robots.txt to explicitly specify which files you don't or do, allow spiders access to. If it's a rule-obiding spider, that will be the end of it. I wasn't aware that there was any format to robots.txt, I thought that the mere

LinkWalker

2001-12-23 Thread Russell Coker
I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? I've added the following to my firewall setup to stop further attacks... # crappy LinkWalker - evil spider that downloads every file

Re: LinkWalker

2001-12-23 Thread Nick Jennings
of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? I've added the following to my firewall setup to stop further attacks... # crappy LinkWalker - evil spider that downloads every file including .tgz on # the site iptables -A INPUT -j

Re: LinkWalker

2001-12-23 Thread Russell Coker
browsing my web logs! On Sun, Dec 23, 2001 at 05:41:47PM +0100, Russell Coker wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? I've added the following to my firewall setup to stop

Re: LinkWalker

2001-12-23 Thread Nick Jennings
On Sun, Dec 23, 2001 at 09:17:54PM +0100, Russell Coker wrote: I wasn't aware that there was any format to robots.txt, I thought that the mere presense of such a file would prevent robots from visiting. Here is an example of my robots.txt User-agent: * Disallow: /webalizer/ Disallow:

Re: LinkWalker

2001-12-23 Thread Chris Wagner
You should be able to tell if it cares about robots.txt by looking in the logs to see if it's downloading /robots.txt. If it is then something like: User-agent: LinkWalker Disallow: / will keep it off your site. If it doesn't, then iptables will keep it away. Robots info: http://www.global

Re: LinkWalker

2001-12-23 Thread Jeremy Lunn
On Sun, Dec 23, 2001 at 05:41:47PM +0100, Russell Coker wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? Surely you'd be able to disallow access to it with Apache? -- Jeremy

LinkWalker

2001-12-23 Thread Russell Coker
I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? I've added the following to my firewall setup to stop further attacks... # crappy LinkWalker - evil spider that downloads every file

Re: LinkWalker

2001-12-23 Thread Nick Jennings
of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? I've added the following to my firewall setup to stop further attacks... # crappy LinkWalker - evil spider that downloads every file including .tgz on # the site iptables -A INPUT -j

Re: LinkWalker

2001-12-23 Thread Russell Coker
browsing my web logs! On Sun, Dec 23, 2001 at 05:41:47PM +0100, Russell Coker wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? I've added the following to my firewall setup to stop

Re: LinkWalker

2001-12-23 Thread Nick Jennings
On Sun, Dec 23, 2001 at 09:17:54PM +0100, Russell Coker wrote: I wasn't aware that there was any format to robots.txt, I thought that the mere presense of such a file would prevent robots from visiting. Here is an example of my robots.txt User-agent: * Disallow: /webalizer/ Disallow:

Re: LinkWalker

2001-12-23 Thread Chris Wagner
You should be able to tell if it cares about robots.txt by looking in the logs to see if it's downloading /robots.txt. If it is then something like: User-agent: LinkWalker Disallow: / will keep it off your site. If it doesn't, then iptables will keep it away. Robots info: http://www.global

Re: LinkWalker

2001-12-23 Thread Jeremy Lunn
On Sun, Dec 23, 2001 at 05:41:47PM +0100, Russell Coker wrote: I have a nasty web spider with an agent name of LinkWalker downloading everything on my site (including .tgz files). Does anyone know anything about it? Surely you'd be able to disallow access to it with Apache? -- Jeremy Lunn