On Tue, Apr 22, 2014 at 10:57 PM, Andries E. Brouwer <[email protected]> wrote: > If I ask wget to download the wikipedia page > > http://he.wikipedia.org/wiki/ש._שפרה > > then I hope for a resulting file ש._שפרה. > Instead, wget gives me ש._שפר\327%94, where the \327 > is an unpronounceable byte that cannot be typed > (This is an UTF-8 system and the filename > that wget produces is not valid UTF-8.) > > Maybe it would be better if wget by default used the original filename. > This name mangling is a vestige of old times, it seems to me. > > Andries >
This is a commonly reported grievance and as you correctly mention a vestige of old times. With UTF-8 supported filesystems, Wget should simply write the correct characters. I sincerely hope this issue is resolved as fast as possible, but I know not how to. Those who understand i18n should work on this. -- Thanking You, Darshit Shah
