On Sunday, October 28, 2001 5:17 PM, David Nesting [SMTP:[EMAIL PROTECTED]] wrote: > I am seeing some anomalous behavior with wget with respects to mirroring > (-m) a site and trying to keep that mirror local to the source domain. > There are a couple of CGI scripts that inevitably get called that end up > issuing redirects off-site. These redirects are followed even though > --span-hosts is not supplied, and even if the destination domains are > added via the --exclude-domains option. >
Hi, now that's coincidence. This morning I wanted to post the very same complaint. IMHO, when I tell wget to mirror a site, I want it to follow links _only_ to this site, never to anything outside. Also, I believe a CGI script is not necessary to provoke this behaviour. I tried 'wget http://www.intercon-spacetec.com/software/ --mirror -np', and when wget tried to retrieve 'robots.txt' it got a '302 Moved Temporarily' and was redirected to 'http://www.serve.com/filenf.html'. Soon after that it happily began to mirror www.serve.com. Unnecessary to say that I am not in the least interested in that site. If a fix for this problem comes up, I would be most interested. Ulrich Hagen PS: I am running 'GNU Wget 1.7.1-pre1' on WinNT.