On Sunday, October 28, 2001 5:17 PM, David Nesting [SMTP:[EMAIL PROTECTED]]
wrote:
> I am seeing some anomalous behavior with wget with respects to mirroring
> (-m) a site and trying to keep that mirror local to the source domain.
> There are a couple of CGI scripts that inevitably get called that end up
> issuing redirects off-site.  These redirects are followed even though
> --span-hosts is not supplied, and even if the destination domains are
> added via the --exclude-domains option.
> 

Hi,

now that's coincidence. This morning I wanted to post the very same
complaint.

IMHO, when I tell wget to mirror a site, I want it to follow links
_only_ to this site, never to anything outside.

Also, I believe a CGI script is not necessary to provoke this behaviour.
I tried 'wget http://www.intercon-spacetec.com/software/ --mirror -np',
and when wget tried to retrieve 'robots.txt' it got a '302 Moved
Temporarily' and was redirected to 'http://www.serve.com/filenf.html'.
Soon after that it happily began to mirror www.serve.com.
Unnecessary to say that I am not in the least interested in that site.

If a fix for this problem comes up, I would be most interested.

Ulrich Hagen

PS: I am running 'GNU Wget 1.7.1-pre1' on WinNT.

Reply via email to