Re: [ovs-discuss] discuss Digest, Vol 172, Issue 35

2023-10-28 Thread Steve Relf via discuss
Sounds like this bug that got me the other week

https://github.com/openvswitch/ovs/commit/106ef21860c935e5e0017a88bf42b94025c4e511

Should be fixed in 2.17.2



Sent from Outlook for Android

From: discuss  on behalf of 
ovs-discuss-requ...@openvswitch.org 
Sent: Saturday, October 28, 2023 1:00:02 PM
To: ovs-discuss@openvswitch.org 
Subject: discuss Digest, Vol 172, Issue 35

Send discuss mailing list submissions to
ovs-discuss@openvswitch.org

To subscribe or unsubscribe via the World Wide Web, visit
https://mail.openvswitch.org/mailman/listinfo/ovs-discuss
or, via email, send a message with subject or body 'help' to
ovs-discuss-requ...@openvswitch.org

You can reach the person managing the list at
ovs-discuss-ow...@openvswitch.org

When replying, please edit your Subject line so it is more specific
than "Re: Contents of discuss digest..."


Today's Topics:

   1. OVS Crashing and restarting every 1hr (Gavin McKee)


--

Message: 1
Date: Fri, 27 Oct 2023 15:50:07 -0700
From: Gavin McKee 
To: ovs-discuss 
Subject: [ovs-discuss] OVS Crashing and restarting every 1hr
Message-ID:

Content-Type: text/plain; charset="utf-8"

Hi,

Every hour we are seeing OVS crash with the following message

ovs-vsctl --version
ovs-vsctl (Open vSwitch) 2.17.6
DB Schema 8.3.0

2023-10-27T00:33:47.163Z|6|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T01:33:47.277Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T02:33:47.391Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T03:33:47.503Z|4|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T04:33:47.613Z|6|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T05:33:47.729Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T06:33:47.841Z|6|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T07:33:47.885Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T08:33:47.981Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T09:33:48.040Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T10:33:48.112Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T11:33:48.228Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T12:33:48.249Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T13:33:48.273Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T14:33:48.315Z|5|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T15:33:48.433Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T16:33:48.488Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T17:33:48.514Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T18:33:48.630Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T19:33:48.744Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T20:33:48.857Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T21:33:48.973Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()
2023-10-27T22:33:49.093Z|00956|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
assertion offset + size <= b->size failed in ofpbuf_at_assert()


gdb shows the following

Using host li

Re: [ovs-discuss] OVS Crashing and restarting every 1hr

2023-10-28 Thread Frode Nordahl via discuss
Hello, Gavin,

This looks familiar and I wonder if it is fixed by [0]? It is also
available in 2.17.7 [1].

0:
https://github.com/openvswitch/ovs/commit/106ef21860c935e5e0017a88bf42b94025c4e511
1:
https://github.com/openvswitch/ovs/commit/111c7be3193e15e2acf8af8ceb74a1177a95806d

--
Frode Nordahl

lør. 28. okt. 2023, 00:50 skrev Gavin McKee via discuss <
ovs-discuss@openvswitch.org>:

> Hi,
>
> Every hour we are seeing OVS crash with the following message
>
> ovs-vsctl --version
> ovs-vsctl (Open vSwitch) 2.17.6
> DB Schema 8.3.0
>
> 2023-10-27T00:33:47.163Z|6|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T01:33:47.277Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T02:33:47.391Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T03:33:47.503Z|4|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T04:33:47.613Z|6|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T05:33:47.729Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T06:33:47.841Z|6|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T07:33:47.885Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T08:33:47.981Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T09:33:48.040Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T10:33:48.112Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T11:33:48.228Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T12:33:48.249Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T13:33:48.273Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T14:33:48.315Z|5|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T15:33:48.433Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T16:33:48.488Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T17:33:48.514Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T18:33:48.630Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T19:33:48.744Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T20:33:48.857Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T21:33:48.973Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
> 2023-10-27T22:33:49.093Z|00956|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>
>
> gdb shows the following
>
> Using host libthread_db library "/lib64/libthread_db.so.1".
> Core was generated by `ovs-vswitchd unix:/var/run/openvswitch/db.sock 
> -vconsole:emer -vsyslog:err -vfi'.
> Program terminated with signal SIGABRT, Aborted.
> #0  0x7f1b7e54854c in __pthread_kill_implementation () from 
> /lib64/libc.so.6
> [Current thread is 1 (Thread 0x7f1b6fbfd640 (LWP 1617989))]
> (gdb) bt
> #0  0x7f1b7e54854c in __pthread_kill_implementation () from 
> /lib64/libc.so.6
> #1  0x7f1b7e4fbce6 in raise () from /lib64/libc.so.6
> #2  0x7f1b7e4cf7f3 in abort () from /lib64/libc.so.6
> #3  0x55eb8de11d74 in ovs_abort_valist (err_no=err_no@entry=0,
> format=format@entry=0x55eb8e01ca18 "%s: assertion %s failed in %s()",
> args=args@entry=0x7f1b6fbad030) at ../lib/util.c:444
> #4  0x55eb8de1c335 in 

Re: [ovs-discuss] OVS Crashing and restarting every 1hr

2023-10-28 Thread Gavin McKee via discuss
Frode,

Thanks for the help . We will try to apply these patches .

Out of interest why would OVS crash exactly on an hourly interval ? Is
there some process that runs on an hourly interval ?

Gav

On Sat, Oct 28, 2023 at 7:56 AM Frode Nordahl 
wrote:

> Hello, Gavin,
>
> This looks familiar and I wonder if it is fixed by [0]? It is also
> available in 2.17.7 [1].
>
> 0:
> https://github.com/openvswitch/ovs/commit/106ef21860c935e5e0017a88bf42b94025c4e511
> 1:
> https://github.com/openvswitch/ovs/commit/111c7be3193e15e2acf8af8ceb74a1177a95806d
>
> --
> Frode Nordahl
>
> lør. 28. okt. 2023, 00:50 skrev Gavin McKee via discuss <
> ovs-discuss@openvswitch.org>:
>
>> Hi,
>>
>> Every hour we are seeing OVS crash with the following message
>>
>> ovs-vsctl --version
>> ovs-vsctl (Open vSwitch) 2.17.6
>> DB Schema 8.3.0
>>
>> 2023-10-27T00:33:47.163Z|6|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T01:33:47.277Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T02:33:47.391Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T03:33:47.503Z|4|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T04:33:47.613Z|6|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T05:33:47.729Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T06:33:47.841Z|6|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T07:33:47.885Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T08:33:47.981Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T09:33:48.040Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T10:33:48.112Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T11:33:48.228Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T12:33:48.249Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T13:33:48.273Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T14:33:48.315Z|5|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T15:33:48.433Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T16:33:48.488Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T17:33:48.514Z|2|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T18:33:48.630Z|3|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T19:33:48.744Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T20:33:48.857Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T21:33:48.973Z|1|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>> 2023-10-27T22:33:49.093Z|00956|util(handler17)|EMER|../include/openvswitch/ofpbuf.h:194:
>> assertion offset + size <= b->size failed in ofpbuf_at_assert()
>>
>>
>> gdb shows the following
>>
>> Using host libthread_db library "/lib64/libthread_db.so.1".
>> Core was generated by `ovs-vswitchd unix:/var/run/openvswitch/db.sock 
>> -vconsole:emer -vsyslog:err -vfi'.
>> Program terminated with signal SIGABRT, Aborted.
>> #0  0x7f1b7e54854c in __pthread_kill_implementation () from 
>> /lib64/libc.so.6
>> [Current thread is 1 (Thread 0x7f1b6fbfd640 (LWP 1617989))]
>> (gdb) bt
>> #0  0x7f1b7e54854c in __pthread_kill_implementation () from 
>> /lib64/libc.so.6
>>