[Xenomai-core] Re: RT-Socket-CAN bus error rate and latencies

Wolfgang Grandegger Wed, 21 Mar 2007 12:30:05 -0800

Oliver Hartkopp wrote:

Wolfgang Grandegger wrote:
Wolfgang Grandegger wrote:
But flooding can still occur and weare thinking about a better way of downscaling or temporarily disablingthem. Socket-CAN currently restarts the controller after 200 bus errors.My preferred solution for RT-Socket-CAN currently is to stop the CANcontroller after a kernel configurable amount of successive bus errors.More clever ideas and comments are welcome?
What do you think about the following method?
   config XENO_DRIVERS_CAN_SJA1000_BUS_ERR_LIMIT
        depends on XENO_DRIVERS_CAN_SJA1000
        int "Maximum number of successive bus errors"
        range 0 255
        default 20
        help

        CAN bus errors are very useful for analyzing electrical problems
         but they can come at a very high rate resulting in interrupt
         flooding with bad impact on system performance and real-time
         behavior. This option, if greater than 0, will limit the amount
         of successive bus error interrupts. If the limit is reached, an
         error message with "can_id = CAN_ERR_BUSERR_FLOOD" is sent. The
         bus error counter gets reset on restart of the device and on any
         successful message transmission or reception. Be aware that bus
         error interrupts are only enabled if at least one socket is
         listening on bus errors.
Hi Wolfgang,
what would be the wanted behaviour, after the discussed problem of buserror flooding occurred?

Well, I think the bus error rate should be downscaled without loosingvital information concerning the cause of the problem and it shouldrequire as little user intervention as possible. Treating it like a buserror as currently done in Socket-CAN is a bit to strong in my mind.

Can the Controller be assumed to be 'slightly dead', or what? Is thereany chance that the bus heals by itself (=> no more bus errors) and canbe used in a normal way? Or is a user interaction recommended or _required_?

Yes, if you plug the cable, the bus errors might go away and the TX doneinterrupt will arrive or you get a bus-off (I have seen both).

Indeed the slow down of bus errors is a reasonable approach, but yoursuggested method leaves too many questions open for the user :-/


What questions?

I would tend to reduce the notifications to the user by creating a timerat the first bus error interrupt. The first BE irq would lead to aCAN_ERR_BUSERROR and after a (configurable) time (e.g.250ms) the nextinformation about bus errors is allowed to be passed to the user. Afterthis time period is over a new CAN_ERR_BUSERROR may be passed to theuser containing the count of occurred bus errors somewhere in thedata[]-section of the Error Frame. When a normal RX/TX-interruptindicates a 'working' CAN again, the timer would be terminated.
Instead of a fix configurable time we could also think about a dynamicbehaviour (e.g. with increasing periods).
What do you think about this?

The question is if one bus-error does provide enough information on thecause of the electrical problem or if a sequence is better. Furthermore,I personally regard the use of timers as to heavy. But the solution isfeasible, of course. Any other opinions?


Thanks.

Wolfgang.

_______________________________________________
Xenomai-core mailing list
Xenomai-core@gna.org
https://mail.gna.org/listinfo/xenomai-core

[Xenomai-core] Re: RT-Socket-CAN bus error rate and latencies

Reply via email to