URL: <http://savannah.gnu.org/bugs/?50514>
Summary: Convert Links touching embedded Javascript Project: GNU Wget Submitted by: ages2500 Submitted on: Sat 11 Mar 2017 07:34:12 PM UTC Category: Program Logic Severity: 3 - Normal Priority: 5 - Normal Status: None Privacy: Public Assigned to: None Originator Name: Originator Email: Open/Closed: Open Discussion Lock: Any Release: 1.18 Operating System: GNU/Linux Reproducibility: Every Time Fixed Release: None Planned Release: None Regression: No Work Required: None Patch Included: No _______________________________________________________ Details: The convert links process, and maybe recursive retrieval process are touching embedded javascript code and mangling the code up a bit. Historically, I believe wget stayed away from touching javascript. I attempt to grab the following page using: wget -rkE -l inf -P wget_test -D dimensionality.com http://www.dimensionality.com/freebeeexamples/freebieexample1-javascript.html The original source code includes the following line: document.write("<img src=\"rw1"); After wget converts links, the line reads as: document.write("<img src="http://www.dimensionality.com/freebeeexamples/\"rw1");" Ideally this line should be left as is, untouched. Tested on wget v1.16 custom compiled from savanna source, on GNU/Linux Debian 7, and package install of wget v1.18 on Cygwin. _______________________________________________________ Reply to this item at: <http://savannah.gnu.org/bugs/?50514> _______________________________________________ Message sent via/by Savannah http://savannah.gnu.org/