On Thu, 13 Aug 2020 03:03:46 +0000
Jiafei Pan <jiafei....@nxp.com> wrote:

> Any comments? Thanks.
>
> @Steven Rostedt, I thinks irq off checking is necessary especially

> This is probably more for Thomas Gleixner.
Thanks Steven.
@Thomas Gleixner, would you please review the patch? thanks.
Jiafei.
> for Preempt-RT kernel, because some context may be changed from irq 
> off to irq on when enable Preempt RT, I once met a issue that hrtimer 
> soft irq is lost when enabled Preempt RT, finally I found 
> napi_schedule_irqoff is called in hardware interrupt handler, there 
> maybe no issue for non RT kernel, but for Preempt RT, interrupt is 
> threaded, so irq is on in interrupt handler, the result is 
> __raise_softirq_irqoff is called in irq on context, so that per-CPU 
> softirq masking is corrupted because of the process of updating of 
> soft irq masking is interrupted and not a atomic operation , and then 
> caused hrtimer soft irq is lost. So I think adding irq status checking 
> in __raise_softirq_irqoff can report such issue directly and help us 
> to find the root cause of such issue.
>
> I know that there may be performance impaction to add extra checking 
> here, if it is the case, how about to include it in some debug 
> configuration items? Such as CONFIG_DEBUG_PREEMPT or other debug 
> items?
>


> Best Regards,
> Jiafei.
>
> -----Original Message-----
> From: Jiafei Pan <jiafei....@nxp.com>
> Sent: Thursday, August 6, 2020 12:07 PM
> To: pet...@infradead.org; mi...@kernel.org; t...@linutronix.de; 
> rost...@goodmis.org; romain.per...@gmail.com; w...@kernel.org
> Cc: linux-kernel@vger.kernel.org; linux-rt-us...@vger.kernel.org; 
> Jiafei Pan <jiafei....@nxp.com>; Leo Li <leoyang...@nxp.com>; Vladimir 
> Oltean <vladimir.olt...@nxp.com>; Jiafei Pan <jiafei....@nxp.com>
> Subject: [PATCH] softirq: add irq off checking for 
> __raise_softirq_irqoff
>
> __raise_softirq_irqoff will update per-CPU mask of pending softirqs, it need 
> to be called in irq disabled context in order to keep it atomic operation, 
> otherwise it will be interrupted by hardware interrupt, and per-CPU softirqs 
> pending mask will be corrupted, the result is there will be unexpected issue, 
> for example hrtimer soft irq will be losed and soft hrtimer will never be 
> expire and handled.

Please wrap your change logs.

>
> Adding irqs disabled checking here to provide warning in irqs enabled context.
>
> Signed-off-by: Jiafei Pan <jiafei....@nxp.com>
> ---
>  kernel/softirq.c | 5 +++++
>  1 file changed, 5 insertions(+)
>
> diff --git a/kernel/softirq.c b/kernel/softirq.c index 
> bf88d7f62433..11f61e54a3ae 100644
> --- a/kernel/softirq.c
> +++ b/kernel/softirq.c
> @@ -481,6 +481,11 @@ void raise_softirq(unsigned int nr)
>
>  void __raise_softirq_irqoff(unsigned int nr)  {
> +     /* This function can only be called in irq disabled context,
> +      * otherwise or_softirq_pending will be interrupted by hardware
> +      * interrupt, so that there will be unexpected issue.
> +      */
> +     WARN_ON_ONCE(!irqs_disabled());

Perhaps: lockdep_assert_irqs_disabled() is more appropriate, and doesn't add 
extra overhead on production systems.

-- Steve


>       trace_softirq_raise(nr);
>       or_softirq_pending(1UL << nr);
>  }
> --
> 2.17.1

Reply via email to