I have just been able to reproduce it by running this on the same server
the website example.org is hosted:

cd /tmp
while true; do rm -rf www.example.org/ && wget --mirror --page-requisites 
https://www.example.org 2>&1 | grep -F GB/s; done

The 'grep GB/s' makes a point of showing the improbable speeds. It's not
unusual to see several dozens or hundreds of GB/s there, like this:

2023-11-19 19:33:57 (116 GB/s) - ‘www.example.org/wp-
json/oembed/1.0/embed?url=https:%2F%2Fwww.example.org%2F2023%2F06%2F05%2Fredacted’
saved [2362/2362]

That's 2 kB, downloaded at 116 GB/s. Obviously a timer issue. This
illustrates it's not hard to occasionally hit some super high TB/s
value, which would trigger the crash.

With the PPA, running this grepping on TB/s, it took about 30 minutes,
but eventually:

2023-11-19 19:34:16 (4.00 TB/s) - ‘www.example.org/wp-
json/oembed/1.0/embed?url=https:%2F%2Fwww.example.org%2F2023%2F08%2F19%2Fredacted%2F’
saved [2201/2201]

This is all crawling a Wordpress site. There seems to be a higher chance
of crawling a dynamic site vs static files, possibly because it doesn't
start counting bytes until it has seen the first, and dynamic sites tend
to only operate from memory at that point. That's probably also the
reason it tends to happen on 2 kB-ish files, because it fits in one or
two TCP packets.

Probably a simple PHP scripts that prints a few kBytes is enough.

-- 
You received this bug notification because you are a member of Ubuntu
Touch seeded packages, which is subscribed to wget in Ubuntu.
https://bugs.launchpad.net/bugs/2029930

Title:
  wget crash when printing download rate

Status in wget package in Ubuntu:
  Confirmed
Status in wget source package in Noble:
  Confirmed
Status in wget package in Debian:
  Confirmed

Bug description:
  
  All supported versions of Ubuntu suffer from crashes in wget in printing of 
the download speed. I've been getting this on various servers. It's been fixed 
upstream and should probably be included in 'updates' of all supported Ubuntu 
versions.

  https://git.savannah.gnu.org/git/wget.git
  Commit 04ab35666997fbb3cd5d72497415fb3dfd62dcc5

  https://lists.gnu.org/archive/html/bug-wget/2023-08/msg00001.html

  Patch attached.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/wget/+bug/2029930/+subscriptions


-- 
Mailing list: https://launchpad.net/~touch-packages
Post to     : touch-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~touch-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to