Re: wget bug: spaces in directories mapped to %20
Zitat von Tony O'Hagan [EMAIL PROTECTED]: Original path: abc def/xyz pqr.gif After wget mirroring: abc%20def/xyz pqr.gif (broken link) wget --version is GNU Wget 1.8.2 This was a well-known error in the 1.8 versions of wget, which is already corrected in the 1.9 versions. Regards, Jochen Roderburg ZAIK/RRZK University of Cologne Robert-Koch-Str. 10 Tel.: +49-221/478-7024 D-50931 Koeln E-Mail: [EMAIL PROTECTED] Germany
wget bug: spaces in directories mapped to %20
Recently I used the following wget command under a hosted linux account: $ wget -mirror url -o mirror.log The web site contained files and virtual directories that contained spaces in the names. URL encoding translated these spaces to %20. wget correctly URL decoded the file names (creating file names containing spaces) but incorrectly failed to URL decode the directory names (creating directory paths containing %20 instead of spaces). The resulting mirror therefor contained broken links. Some hyper links were embedded inside flash graphics files so hyper link renaming was not an option. Personally, I would never put a space in a web hosted file or directory name but in this case I was migrating a web site that had been developed by someone else. I think that mirroring should work regardless in this case. Example: Original path: abc def/xyz pqr.gif After wget mirroring: abc%20def/xyz pqr.gif (broken link) wget --version is GNU Wget 1.8.2 Thanks for the invaluable wget. Tony O'Hagan. -- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.0.300 / Virus Database: 265.6.13 - Release Date: 16/01/2005