Guillaume Morin <[EMAIL PROTECTED]> writes: > If wget fetches a url which redirects to another host, wget > retrieves the file, and there's nothing that can be done to turn > that off. > > So, if you do wget -r on a machine that happens to have a redirect to > www.yahoo.com you'll wind up trying to pull down a big chunk of > yahoo.
Hmm. Are you sure? Wget 1.8.1 is trying hard to restrict following redirections by applying the same rules normally used for following links. Downloading a half of Yahoo! because someone redirects to www.yahoo.com is not intended to happen. I tried to reproduce it by creating a page that redirects to www.yahoo.com, but Wget behaved correctly: $ wget -r -l0 http://muc.arsdigita.com:2005/test.tcl --19:13:53-- http://muc.arsdigita.com:2005/test.tcl => `muc.arsdigita.com:2005/test.tcl' Resolving muc.arsdigita.com... done. Connecting to muc.arsdigita.com[212.84.246.68]:2005... connected. HTTP request sent, awaiting response... 302 Found Location: http://www.yahoo.com [following] --19:13:53-- http://www.yahoo.com/ => `www.yahoo.com/index.html' Resolving www.yahoo.com... done. Connecting to www.yahoo.com[64.58.76.223]:80... connected. HTTP request sent, awaiting response... 200 OK Length: unspecified [text/html] [ <=> ] 16,829 22.39K/s 19:13:55 (22.39 KB/s) - `www.yahoo.com/index.html' saved [16829] FINISHED --19:13:55-- Downloaded: 16,829 bytes in 1 files Guillaume, exactly how have you reproduced the problem?