URL:
  <http://savannah.gnu.org/bugs/?50514>

                 Summary: Convert Links touching embedded Javascript
                 Project: GNU Wget
            Submitted by: ages2500
            Submitted on: Sat 11 Mar 2017 07:34:12 PM UTC
                Category: Program Logic
                Severity: 3 - Normal
                Priority: 5 - Normal
                  Status: None
                 Privacy: Public
             Assigned to: None
         Originator Name: 
        Originator Email: 
             Open/Closed: Open
         Discussion Lock: Any
                 Release: 1.18
        Operating System: GNU/Linux
         Reproducibility: Every Time
           Fixed Release: None
         Planned Release: None
              Regression: No
           Work Required: None
          Patch Included: No

    _______________________________________________________

Details:

The convert links process, and maybe recursive retrieval process are touching
embedded javascript code and mangling the code up a bit. Historically, I
believe wget stayed away from touching javascript.

I attempt to grab the following page using:


wget -rkE -l inf -P wget_test -D dimensionality.com
http://www.dimensionality.com/freebeeexamples/freebieexample1-javascript.html


The original source code includes the following line:


document.write("<img src=\"rw1");


After wget converts links, the line reads as:


document.write("<img
src="http://www.dimensionality.com/freebeeexamples/\&quot;rw1&quot;);"


Ideally this line should be left as is, untouched.

Tested on wget v1.16 custom compiled from savanna source, on GNU/Linux Debian
7, and package install of wget v1.18 on Cygwin.




    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/bugs/?50514>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.gnu.org/


Reply via email to