Your message dated Tue, 1 Aug 2017 09:47:59 +0200
with message-id <[email protected]>
and subject line Re: Bug#798995: [bmon] Segfault after hours of viewing
packet-shaped queues/classes on a busy 100Mbit interface
has caused the Debian Bug report #798995,
regarding [bmon] Segfault after hours of viewing packet-shaped queues/classes
on a busy 100Mbit interface
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)
--
798995: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=798995
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Package: bmon
Version: 1:3.8-1
Severity: normal
Leave bmon running for a day or so showing bandwidth information on a
busy interface on a server (up to date Debian Testing), and it segfaults
- it appears to be to do with frequent class additions/removals you get
with fq_codel (looks like a bad pointer in a bmon list).
This is the shaping setup on the server's external interface for reference:
=======================================================================
tc qdisc add dev eth0 parent root handle 1: prio bands 4 priomap 1 1 1 1
1 1 1 1 1 1 1 1 1 1 1 1
tc qdisc add dev eth0 parent 1:1 handle 101: fq_codel
tc qdisc add dev eth0 parent 1:2 handle 102: fq_codel
tc qdisc add dev eth0 parent 1:3 handle 103: fq_codel
tc qdisc add dev eth0 parent 1:4 handle 104: fq_codel
=======================================================================
Backtrace attached, although it doesn't look too useful.
Thanks
Program received signal SIGSEGV, Segmentation fault.
0x000055555555b683 in __group_foreach_element (g=g@entry=0x55555577d3d0,
list=list@entry=0x5555558554b0,
cb=cb@entry=0x55555555c430 <element_check_if_dead>, arg=arg@entry=0x0) at
group.c:47
47 group.c: No such file or directory.
Thread 1 (Thread 0x7ffff7fe6700 (LWP 19118)):
#0 0x000055555555b683 in __group_foreach_element (g=g@entry=0x55555577d3d0,
list=list@entry=0x5555558554b0, cb=cb@entry=0x55555555c430
<element_check_if_dead>, arg=arg@entry=0x0)
at group.c:47
e = 0x5555559b9f30
n = 0x55555595f520
#1 0x000055555555b697 in __group_foreach_element (g=g@entry=0x55555577d3d0,
list=list@entry=0x5555558546e0, cb=cb@entry=0x55555555c430
<element_check_if_dead>, arg=arg@entry=0x0)
at group.c:48
e = 0x555555855470
n = 0x5555558546b0
#2 0x000055555555b697 in __group_foreach_element (g=g@entry=0x55555577d3d0,
list=list@entry=0x55555583bc60, cb=cb@entry=0x55555555c430
<element_check_if_dead>, arg=arg@entry=0x0)
at group.c:48
e = 0x5555558546a0
n = 0x55555583bc30
#3 0x000055555555b697 in __group_foreach_element (g=g@entry=0x55555577d3d0,
list=list@entry=0x5555558392f0, cb=cb@entry=0x55555555c430
<element_check_if_dead>, arg=arg@entry=0x0)
at group.c:48
e = 0x55555583bc20
n = 0x5555558392c0
#4 0x000055555555b697 in __group_foreach_element (g=0x55555577d3d0,
list=0x55555577d3e0,
cb=cb@entry=0x55555555c430 <element_check_if_dead>, arg=arg@entry=0x0) at
group.c:48
e = 0x5555558392b0
n = 0x55555577d3b0
#5 0x000055555555b71f in group_foreach_recursive (cb=0x55555555c430
<element_check_if_dead>, arg=0x0)
at group.c:67
g = <optimized out>
n = 0x55555576d410 <output_subsys+16>
#6 0x00005555555598be in main (argc=<optimized out>, argv=<optimized out>) at
bmon.c:359
c = {tv_sec = -1, tv_usec = 999931}
e = {tv_sec = 1442175175, tv_usec = 481737}
ri = {tv_sec = 1, tv_usec = 0}
tmp = {tv_sec = 0, tv_usec = 11826}
st = <optimized out>
sleep_time = 20000
read_interval = <optimized out>
signature.asc
Description: OpenPGP digital signature
--- End Message ---
--- Begin Message ---
fixed #798995 1:4.0-4
thanks
Am 31.07.2017 um 18:36 schrieb OmegaPhil:
> On 27/07/17 10:33, Patrick Matthäi wrote:
>> Am 26.07.2017 um 17:25 schrieb OmegaPhil:
>>> On 26/07/17 13:27, Patrick Matthäi wrote:
>>>> Hello,
>>>>
>>>> could you still reproduce this issue with 4.0? I am not able to
>>>> reproduce it on some jessie and stretch servers, running > 24 hours on
>>>> busy Gbit/s interfaces.
>>>>
>>> Hmm, something is going on. On my remote server, the individual classes
>>> associated with fq_codel qdiscs are not being shown, but on my local
>>> server they are - however even then there isn't constant change (classes
>>> appearing and disappearing etc). So in both cases, the bug associated
>>> with rapid class addition/removal won't be hit (screenshots attached).
>>>
>>> I'm definitely pushing less bandwidth than I did a few years ago atm,
>>> but I think something has changed on top of that.
>>>
>>> Regardless, I'll keep bmon running on the server for 4 days and report
>>> back. I've had a segfault 'recently', but I have no idea when that was.
>> Thank you for your investigation, I will await your feedback
>
> I can confirm that bmon is still running on both machines, so I'm unable
> to recreate the situation that leads to the bug, so I guess you can
> close this.
Much thanks for your testing, closing right now :)
--
/*
Mit freundlichem Gruß / With kind regards,
Patrick Matthäi
GNU/Linux Debian Developer
Blog: http://www.linux-dev.org/
E-Mail: [email protected]
[email protected]
*/
signature.asc
Description: OpenPGP digital signature
--- End Message ---