From: Glenn Nieuwenhuyse > wget -T 1 -t 1 -r --reject="robots.*" [...] > > I would expect this not to download the robots.txt file, but still it > does.
Perhaps because "robots.txt" is a special case, and is not selected by following links, and so is unaffected by the --reject option. A search for "robot" in the manual should reveal this: http://www.gnu.org/software/wget/manual/wget.html robots = on/off Specify whether the norobots convention is respected by Wget, "on" by default. This switch controls both the /robots.txt and the nofollow aspect of the spec. See Robot Exclusion, for more details about this. Be sure you know what you are doing before turning this off. So, adding "-e robots=off" to your command might help. ------------------------------------------------------------------------ Steven M. Schweda [EMAIL PROTECTED] 382 South Warwick Street (+1) 651-699-9818 Saint Paul MN 55105-2547