URL:
  <http://savannah.gnu.org/bugs/?47689>

                 Summary: Support for UTF-16 encoding.
                 Project: GNU Wget
            Submitted by: kenorb
            Submitted on: Wed 13 Apr 2016 06:42:52 PM GMT
                Category: Localization
                Severity: 3 - Normal
                Priority: 5 - Normal
                  Status: None
                 Privacy: Public
             Assigned to: None
         Originator Name: 
        Originator Email: 
             Open/Closed: Open
         Discussion Lock: Any
                 Release: 1.16.3
        Operating System: Mac OS
         Reproducibility: Every Time
           Fixed Release: None
         Planned Release: None
              Regression: None
           Work Required: None
          Patch Included: None

    _______________________________________________________

Details:

The following site has UTF-16 encoding:
http://www.free-energy-info.co.uk/
W3C claim it's UTF-16LE, but it's not relevant.

By default wget doesn't recognise the source of it, because it's not following
any links when using with -m or -r.

When specifying remote-encoding, it doesn't work either:

$ wget --remote-encoding=UTF-16 http://www.free-energy-info.co.uk/
This version does not have support for IRIs

The same for any format, including when specifying `--no-iri`.

What should be the fix in order that encoding of that site can be parsed by
wget?

Related: http://stackoverflow.com/q/36605946/55075




    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/bugs/?47689>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.gnu.org/


Reply via email to