URL: <http://savannah.gnu.org/bugs/?47689>
Summary: Support for UTF-16 encoding. Project: GNU Wget Submitted by: kenorb Submitted on: Wed 13 Apr 2016 06:42:52 PM GMT Category: Localization Severity: 3 - Normal Priority: 5 - Normal Status: None Privacy: Public Assigned to: None Originator Name: Originator Email: Open/Closed: Open Discussion Lock: Any Release: 1.16.3 Operating System: Mac OS Reproducibility: Every Time Fixed Release: None Planned Release: None Regression: None Work Required: None Patch Included: None _______________________________________________________ Details: The following site has UTF-16 encoding: http://www.free-energy-info.co.uk/ W3C claim it's UTF-16LE, but it's not relevant. By default wget doesn't recognise the source of it, because it's not following any links when using with -m or -r. When specifying remote-encoding, it doesn't work either: $ wget --remote-encoding=UTF-16 http://www.free-energy-info.co.uk/ This version does not have support for IRIs The same for any format, including when specifying `--no-iri`. What should be the fix in order that encoding of that site can be parsed by wget? Related: http://stackoverflow.com/q/36605946/55075 _______________________________________________________ Reply to this item at: <http://savannah.gnu.org/bugs/?47689> _______________________________________________ Message sent via/by Savannah http://savannah.gnu.org/