Re: OpenBSD 6.7 crashes on APU2C4 with LTE modem Huawei E3372s-153 HiLink

Łukasz Lejtkowski Fri, 29 May 2020 06:07:17 -0700

LTE modem works correctly under usb 2.0(EHCI), no panics. Uptime almost 2 days. 
Something is broken in xHCI (xhci.c?).
Any patch for 6.7 soon, or is it more complicated?



> On 27 May 2020, at 18:30, Łukasz Lejtkowski <emig...@gmail.com> wrote:
> 
> Right now LTE modem working on internal USB 2.0(EHCI). External is USB 
> 3.0(xHCI). We will see… :)
> 
> root@master[~]usbdevs -v
> Controller /dev/usb0:
> addr 01: 1022:0000 AMD, xHCI root hub
>        super speed, self powered, config 1, rev 1.00
>        driver: uhub0
> Controller /dev/usb1:
> addr 01: 1022:0000 AMD, EHCI root hub
>        high speed, self powered, config 1, rev 1.00
>        driver: uhub1
> addr 02: 0438:7900 Advanced Micro Devices, Hub
>        high speed, self powered, config 1, rev 0.18
>        driver: uhub2
> addr 03: 12d1:14dc HUAWEI_MOBILE, HUAWEI_MOBILE
>        high speed, self powered, config 1, rev 1.02
>        driver: cdce0
>        driver: umass0
> root@master[~]
> 
> <IMG_0278.JPG>
>> On 25 May 2020, at 13:19, Martin Pieuchot <m...@openbsd.org 
>> <mailto:m...@openbsd.org>> wrote:
>> 
>> On 25/05/20(Mon) 12:56, Gerhard Roth wrote:
>>> On 5/22/20 9:05 PM, Mark Kettenis wrote:
>>>>> From: Łukasz Lejtkowski <emig...@gmail.com <mailto:emig...@gmail.com>>
>>>>> Date: Fri, 22 May 2020 20:51:57 +0200
>>>>> 
>>>>> Probably power supply 12 V is broken. Showing 16,87 V(Fluke 179) -
>>>>> too high. Should be 12,25-12,50 V. I replaced to the new one.
>>>> 
>>>> That might be why the device stops responding.  The fact that cleaning
>>>> up from a failed USB transaction leads to this panic is a bug though.
>>>> 
>>>> And somebody just posted a very similar panic with ure(4).  Something
>>>> in the network stack is holding a mutex when it shouldn't.
>>> 
>>> I think that holding the mutex is ok. The bug is calling the stop
>>> routine in case of errors.
>>> 
>>> This is what common foo_start() does:
>>> 
>>>     m_head = ifq_deq_begin(&ifp->if_snd);
>>>     if (foo_encap(sc, m_head, 0)) {
>>>             ifq_deq_rollback(&ifp->if_snd, m_head);
>>>             ...
>>>             return;
>>>     }
>>>     ifq_deq_commit(&ifp->if_snd, m_head);
>>> 
>>> Here, ifq_deq_begin() grabs a mutex and it is held while
>>> calling foo_encap().
>>> 
>>> For USB network interfaces foo_encap() mostly does this:
>>> 
>>>     err = usbd_transfer(sc->sc_xfer);
>>>     if (err != USBD_IN_PROGRESS) {
>>>             foo_stop(sc);
>>>             return EIO;
>>>     }
>>> 
>>> And foo_stop() calls usbd_abort_pipe() -> xhci_command_submit(),
>>> which might sleep.
>>> 
>>> How to fix? We could do the foo_encap() after the ifq_deq_commit(),
>>> possibly dropping the current mbuf if encap fails (who cares
>>> for the packets after foo_stop() anyway).
>> 
>> That's the approach taken by drivers using ifq_dequeue(9) instead of
>> ifq_deq_begin/commit().
>> 
>>> Or change all the drivers to follow the path that if_aue.c takes:
>>> 
>>>     err = usbd_transfer(c->aue_xfer);
>>>     if (err != USBD_IN_PROGRESS) {
>>>             ...
>>>             /* Stop the interface from process context. */
>>>             usb_add_task(sc->aue_udev, &sc->aue_stop_task);
>>>             return (EIO);
>>>     }
>> 
>> That's just trading the current problem for another one with higher
>> complexity.
>> 
>>> Any ideas, what's better? Or alternative proposals?
>> 
>> Using ifq_dequeue(9) would have the advantage of unifying the code base.
>> It introduces a behavior change.  A simpler fix would be to call
>> foo_stop() in the error path after ifq_deq_rollback().
>

Re: OpenBSD 6.7 crashes on APU2C4 with LTE modem Huawei E3372s-153 HiLink

Reply via email to