On Montag, 6. Februar 2017 05:02:57 CET William Prescott wrote:
> Hello,
>
> I'm encountering a problem when recursively downloading from a website when
> the URL contains a tilde and the page encoding claims to be Shift JIS.
>
> I've tried both Wget 1.17.1 (from Ubuntu 16.04) and 1.19 (from source,
> with Libidn2 0.16).
> I believe my local character encoding is UTF-8.
>
> The first page will download okay, but then most pages after it will get the
> tilde converted to "%E2%80%BE" ("‾"), which, as one would expect, doesn't
> work.Hi William, reproducable by: $echo '~'|iconv -f SHIFT-JIS -t utf-8 ‾ $echo -n '~'|iconv -f SHIFT-JIS -t utf-8|od -t x1 0000000 e2 80 be So this seems not be a Wget issue, but maybe a general character conversion issue. Not sure what Wget could do... Regards, Tim
signature.asc
Description: This is a digitally signed message part.
