subject:"\[PATCH V4\] notifier\/panic\: Introduce panic_notifier

[PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-08 Thread Guilherme G. Piccoli

The kernel notifier infrastructure allows function callbacks to be
added in multiple lists, which are then called in the proper time,
like in a reboot or panic event. The panic_notifier_list specifically
contains the callbacks that are executed during a panic event. As any
other notifier list, the panic one has no filtering and all functions
previously registered are executed.

The kdump infrastructure, on the other hand, enables users to set
a crash kernel that is kexec'ed in a panic event, and vmcore/logs
are collected in such crash kernel. When kdump is set, by default
the panic notifiers are ignored - the kexec jumps to the crash kernel
before the list is checked and callbacks executed.

There are some cases though in which kdump users might want to
allow panic notifier callbacks to execute _before_ the kexec to
the crash kernel, for a variety of reasons - for example, users
may think kexec is very prone to fail and want to give a chance
to kmsg dumpers to run (and save logs using pstore), or maybe
some panic notifier is required to properly quiesce some hardware
that must be used to the crash kernel. For these cases, we have
the kernel parameter "crash_kexec_post_notifiers".

But there's a problem: currently it's an "all-or-nothing" situation,
the kdump user choice is either to execute all panic notifiers or
none of them. Given that panic notifiers may increase the risk of a
kdump failure, this is a tough decision and may affect the debug of
hard to reproduce bugs, if for some reason the user choice is to
enable panic notifiers, but kdump then fails.

So, this patch aims to ease this decision: we hereby introduce a filter
for the panic notifier list, in which users may select specifically
which callbacks they wish to run, allowing a safer kdump. The allowlist
should be provided using the parameter "panic_notifier_filter=a,b,..."
where a, b are valid callback names. Invalid symbols are discarded.

Currently up to 16 symbols may be passed in this list, we consider
that this numbers allows enough flexibility (and no matter what
architecture is used, at most 30 panic callbacks are registered).
In an experiment using a qemu x86 virtual machine, by default only
six callbacks are registered in the panic notifier list.
Once a valid callback name is provided in the list, such function
is allowed to be registered/unregistered in the panic_notifier_list;
all other panic callbacks are ignored. Notice that this filter is
only for the panic notifiers and has no effect in the other notifiers.

Signed-off-by: Guilherme G. Piccoli 
---



V4:

* Add some more clean-up suggestion from Andy (thanks).


V3:

* Implemented Alan's suggestion (thanks!), simplifying the check code
in the notifiers register/unregister functions. Notice that the
suggestion was missing a negative in the check function, I even
renamed it now, to be more clear:
s/is_panic_notifier_filtered/should_register_panic_notifier

The condition is !(A && B && C), and C is the check function, that
returns true when a symbol *is found* in the notifier filter; hence
we need to invert that here, as you can see in the code.


* Implemented Andy's suggestion (thanks!), to reduce the code in
the parameter parsing loop. Notice that strsep() modifies the buffer,
so not sure if it was a typo but the correct code here is:

single_param = strsep(&full_param_buffer, ",");


* "Bumped" the log output of the parsing function: users should be
warned in the errors (invalid symbol or exceeded entries) and
informed (pr_info) in case the parsing succeeded - I think pr_debug
was useless there.

Cheers,

Guilherme



 .../admin-guide/kernel-parameters.txt | 14 +-
 include/linux/panic_notifier.h| 10 +
 kernel/notifier.c | 44 +--
 kernel/panic.c| 39 
 4 files changed, 102 insertions(+), 5 deletions(-)

diff --git a/Documentation/admin-guide/kernel-parameters.txt 
b/Documentation/admin-guide/kernel-parameters.txt
index 2fba82431efb..2dc4e98823ae 100644
--- a/Documentation/admin-guide/kernel-parameters.txt
+++ b/Documentation/admin-guide/kernel-parameters.txt
@@ -3727,13 +3727,25 @@
panic_on_warn   panic() instead of WARN().  Useful to cause kdump
on a WARN().
 
+   panic_notifier_filter=[function-list]
+   Limit the functions registered by the panic notifier
+   infrastructure. This allowlist is composed by function
+   names, comma separated (invalid symbols are filtered
+   out). Such functionality is useful for kdump users
+   that set "crash_kexec_post_notifiers" in order to
+   execute panic notifiers, but at the same time wish to
+   have just a subset of notifiers, not all of them. The
+   list of functions is limited to 16 entries currently.
+
crash_k

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-14 Thread Guilherme G. Piccoli

Hey folks, sorry for the ping.
But is there any extra reviews? All comments are much appreciated!

Dave, what do you think about the patch? I remember we talked about it
in [0], seems you considered that a good idea right?

Thanks in advance,


Guilherme


[0]
https://lore.kernel.org/lkml/yckaz79zg5hde...@dhcp-128-65.nay.redhat.com/

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-16 Thread Baoquan He

On 01/08/22 at 12:34pm, Guilherme G. Piccoli wrote:
.. 
> So, this patch aims to ease this decision: we hereby introduce a filter
> for the panic notifier list, in which users may select specifically
> which callbacks they wish to run, allowing a safer kdump. The allowlist
> should be provided using the parameter "panic_notifier_filter=a,b,..."
> where a, b are valid callback names. Invalid symbols are discarded.
> 
> Currently up to 16 symbols may be passed in this list, we consider
> that this numbers allows enough flexibility (and no matter what
> architecture is used, at most 30 panic callbacks are registered).
> In an experiment using a qemu x86 virtual machine, by default only
> six callbacks are registered in the panic notifier list.
> Once a valid callback name is provided in the list, such function
> is allowed to be registered/unregistered in the panic_notifier_list;
> all other panic callbacks are ignored. Notice that this filter is
> only for the panic notifiers and has no effect in the other notifiers.
> 
> Signed-off-by: Guilherme G. Piccoli 

This patch looks good to me, thx.

Acked-by: Baoquan He 


___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-17 Thread Guilherme G. Piccoli

On 16/01/2022 10:11, Baoquan He wrote:
> [...]
> This patch looks good to me, thx.
> 
> Acked-by: Baoquan He 
> 

Thanks a lot Baoquan He !

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-18 Thread Guilherme G. Piccoli

On 17/01/2022 23:31, Dave Young wrote:
> [...]
> Hi Guilherme,  thank you for making it a formal patch!  Yes, it is a
> nice improvement.
> I'm sorry I did not get time to review the code,  I will leave it to
> other people to review :)

Hi Dave, no worries - thanks for your opinion, much appreciated!
Cheers,


Guilherme

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-20 Thread Petr Mladek

Adding some more people into Cc. Some modified the logic in the past.
Some are familiar with some interesting areas where the panic
notfiers are used.

On Sat 2022-01-08 12:34:51, Guilherme G. Piccoli wrote:
> The kernel notifier infrastructure allows function callbacks to be
> added in multiple lists, which are then called in the proper time,
> like in a reboot or panic event. The panic_notifier_list specifically
> contains the callbacks that are executed during a panic event. As any
> other notifier list, the panic one has no filtering and all functions
> previously registered are executed.
> 
> The kdump infrastructure, on the other hand, enables users to set
> a crash kernel that is kexec'ed in a panic event, and vmcore/logs
> are collected in such crash kernel. When kdump is set, by default
> the panic notifiers are ignored - the kexec jumps to the crash kernel
> before the list is checked and callbacks executed.
> 
> There are some cases though in which kdump users might want to
> allow panic notifier callbacks to execute _before_ the kexec to
> the crash kernel, for a variety of reasons - for example, users
> may think kexec is very prone to fail and want to give a chance
> to kmsg dumpers to run (and save logs using pstore),

Yes, this seems to be original intention for the
"crash_kexec_post_notifiers" option, see the commit
f06e5153f4ae2e2f3b0300f ("kernel/panic.c: add
"crash_kexec_post_notifiers" option for kdump after panic_notifiers")


> some panic notifier is required to properly quiesce some hardware
> that must be used to the crash kernel.

Do you have any example, please? The above mentioned commit
says "crash_kexec_post_notifiers" actually increases risk
of kdump failure.

Note that kmsg_dump() is called after the notifiers only because
some are printing more information, see the commit
6723734cdff15211bb78a ("panic: call panic handlers before kmsg_dump").
They might still increase the change that kmsg_dump() will never
be called.


> But there's a problem: currently it's an "all-or-nothing" situation,
> the kdump user choice is either to execute all panic notifiers or
> none of them. Given that panic notifiers may increase the risk of a
> kdump failure, this is a tough decision and may affect the debug of
> hard to reproduce bugs, if for some reason the user choice is to
> enable panic notifiers, but kdump then fails.
>
> So, this patch aims to ease this decision: we hereby introduce a filter
> for the panic notifier list, in which users may select specifically
> which callbacks they wish to run, allowing a safer kdump. The allowlist
> should be provided using the parameter "panic_notifier_filter=a,b,..."
> where a, b are valid callback names. Invalid symbols are discarded.

I am afraid that this is almost unusable solution:

   + requires deep knowledge of what each notifier does
   + might need debugging what notifier causes problems
   + the list might need to be updated when new notifiers are added
   + function names are implementation detail and might change
   + requires kallsyms


It is only workaround for a real problem. The problem is that
"panic_notifier_list" is used for many purposes that break
each other.

I checked some notifiers and found few groups:

   + disable watchdogs:
  + hung_task_panic()
  + rcu_panic()

   + dump information:
  + kernel_offset_notifier()
  + trace_panic_handler() (duplicate of panic_print=0x10)

   + inform hypervisor
  + xen_panic_event()
  + pvpanic_panic_notify()
  + hyperv_panic_event()

   + misc cleanup / flush / blinking
  + panic_event()   in ipmi_msghandler.c
  + panic_happened()   in heartbeat.c
  + led_trigger_panic_notifier()


IMHO, the right solution is to split the callbacks into 2 or more
notifier list. Then we might rework panic() to do:

void panic(void)
{
[...]

/* stop watchdogs + extra info */
atomic_notifier_call_chain(&panic_disable_watchdogs_notifier_list, 0, 
buf);
atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
panic_print_sys_info();

/* crash_kexec + kmsg_dump in configurable order */
if (!_crash_kexec_post_kmsg_dump) {
__crash_kexec(NULL);
smp_send_stop();
} else {
crash_smp_send_stop();
}

kmsg_dump();
if (_crash_kexec_post_kmsg_dump)
__crash_kexec(NULL);

/* infinite loop or reboot */
atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 0, buf);
atomic_notifier_call_chain(&panic_rest_notifier_list, 0, buf);

console_flush_on_panic(CONSOLE_FLUSH_PENDING);

if (panic_timeout >= 0) {
timeout();
emergency_restart();
}

for (i = 0; ; i += PANIC_TIMER_STEP) {
if (i >= i_next) {
i += panic_blink(state ^= 1);
i_next = i + 3600 / PANIC_BLINK_SPD;

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-21 Thread Guilherme G. Piccoli

Hi Petr, thanks for the great response, and for CCing more (potentially)
interested parties! Some comments inline below; also, I'm CCing Michael
Kelley as well.


On 20/01/2022 12:14, Petr Mladek wrote:
> Adding some more people into Cc. Some modified the logic in the past.
> Some are familiar with some interesting areas where the panic
> notfiers are used.
> 
> On Sat 2022-01-08 12:34:51, Guilherme G. Piccoli wrote:
>> [...]
>> There are some cases though in which kdump users might want to
>> allow panic notifier callbacks to execute _before_ the kexec to
>> the crash kernel, for a variety of reasons - for example, users
>> may think kexec is very prone to fail and want to give a chance
>> to kmsg dumpers to run (and save logs using pstore),
> 
> Yes, this seems to be original intention for the
> "crash_kexec_post_notifiers" option, see the commit
> f06e5153f4ae2e2f3b0300f ("kernel/panic.c: add
> "crash_kexec_post_notifiers" option for kdump after panic_notifiers")
> 
>> some panic notifier is required to properly quiesce some hardware
>> that must be used to the crash kernel.
> 
> Do you have any example, please? The above mentioned commit
> says "crash_kexec_post_notifiers" actually increases risk
> of kdump failure.
> 
> Note that kmsg_dump() is called after the notifiers only because
> some are printing more information, see the commit
> 6723734cdff15211bb78a ("panic: call panic handlers before kmsg_dump").
> They might still increase the change that kmsg_dump() will never
> be called.
> 

Sure! I guess Michael Kelley's response here is the perfect example:
https://lore.kernel.org/lkml/mwhpr21mb1593a32a3433f5f262796fcfd7...@mwhpr21mb1593.namprd21.prod.outlook.com/

In my understanding, he is referring the function hyperv_panic_event().
But I also found another 2 examples in a quick look: bcm_vk_on_panic()
and brcmstb_pm_panic_notify().


>> [...]
>> So, this patch aims to ease this decision: we hereby introduce a filter
>> for the panic notifier list, in which users may select specifically
>> which callbacks they wish to run, allowing a safer kdump. The allowlist
>> should be provided using the parameter "panic_notifier_filter=a,b,..."
>> where a, b are valid callback names. Invalid symbols are discarded.
> 
> I am afraid that this is almost unusable solution:
> 
>+ requires deep knowledge of what each notifier does
>+ might need debugging what notifier causes problems
>+ the list might need to be updated when new notifiers are added
>+ function names are implementation detail and might change
>+ requires kallsyms
> 
> 
> It is only workaround for a real problem. The problem is that
> "panic_notifier_list" is used for many purposes that break
> each other.
> 
> I checked some notifiers and found few groups:
> 
>+ disable watchdogs:
>   + hung_task_panic()
>   + rcu_panic()
> 
>+ dump information:
>   + kernel_offset_notifier()
>   + trace_panic_handler() (duplicate of panic_print=0x10)
> 
>+ inform hypervisor
>   + xen_panic_event()
>   + pvpanic_panic_notify()
>   + hyperv_panic_event()
> 
>+ misc cleanup / flush / blinking
>   + panic_event()   in ipmi_msghandler.c
>   + panic_happened()   in heartbeat.c
>   + led_trigger_panic_notifier()
> 
> 
> IMHO, the right solution is to split the callbacks into 2 or more
> notifier list. Then we might rework panic() to do:
> 
> void panic(void)
> {
>   [...]
> 
>   /* stop watchdogs + extra info */
>   atomic_notifier_call_chain(&panic_disable_watchdogs_notifier_list, 0, 
> buf);
>   atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
>   panic_print_sys_info();
> 
>   /* crash_kexec + kmsg_dump in configurable order */
>   if (!_crash_kexec_post_kmsg_dump) {
>   __crash_kexec(NULL);
>   smp_send_stop();
>   } else {
>   crash_smp_send_stop();
>   }
> 
>   kmsg_dump();
>   if (_crash_kexec_post_kmsg_dump)
>   __crash_kexec(NULL);
> 
>   /* infinite loop or reboot */
>   atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 0, buf);
>   atomic_notifier_call_chain(&panic_rest_notifier_list, 0, buf);
> 
>   console_flush_on_panic(CONSOLE_FLUSH_PENDING);
> [...] 
> Two notifier lists might be enough in the above scenario. I would call
> them:
> 
>   panic_pre_dump_notifier_list
>   panic_post_dump_notifier_list
> 
> 
> It is a real solution that will help everyone. It is more complicated now
> but it will makes things much easier in the long term. And it might be done
> step by step:
> 
>  1. introduce the two notifier lists
>  2. convert all users: one by one
>  3. remove the original notifier list when there is no user

That's a great idea! I'm into it, if we have a consensus. The thing that
scares me most here is that this is a big change and consumes time to
implement - I'd not risk such time if somebody is really against that.
So, let's

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-22 Thread Baoquan He

On 01/21/22 at 05:31pm, Guilherme G. Piccoli wrote:
..
> > IMHO, the right solution is to split the callbacks into 2 or more
> > notifier list. Then we might rework panic() to do:
> > 
> > void panic(void)
> > {
> > [...]
> > 
> > /* stop watchdogs + extra info */
> > atomic_notifier_call_chain(&panic_disable_watchdogs_notifier_list, 0, 
> > buf);
> > atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
> > panic_print_sys_info();
> > 
> > /* crash_kexec + kmsg_dump in configurable order */
> > if (!_crash_kexec_post_kmsg_dump) {
> > __crash_kexec(NULL);
> > smp_send_stop();
> > } else {
> > crash_smp_send_stop();
> > }
> > 
> > kmsg_dump();
> > if (_crash_kexec_post_kmsg_dump)
> > __crash_kexec(NULL);
> > 
> > /* infinite loop or reboot */
> > atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 0, buf);
> > atomic_notifier_call_chain(&panic_rest_notifier_list, 0, buf);
> > 
> > console_flush_on_panic(CONSOLE_FLUSH_PENDING);
> > [...] 
> > Two notifier lists might be enough in the above scenario. I would call
> > them:
> > 
> > panic_pre_dump_notifier_list
> > panic_post_dump_notifier_list
> > 
> > 
> > It is a real solution that will help everyone. It is more complicated now
> > but it will makes things much easier in the long term. And it might be done
> > step by step:
> > 
> >  1. introduce the two notifier lists
> >  2. convert all users: one by one
> >  3. remove the original notifier list when there is no user
> 
> That's a great idea! I'm into it, if we have a consensus. The thing that
> scares me most here is that this is a big change and consumes time to
> implement - I'd not risk such time if somebody is really against that.
> So, let's see more opinions, maybe the kdump maintainers have good input.

I am fine with it. As long as thing is made clear, glad to see code is
refactored to be more understandable and improved. Earlier, during several
rounds of discussion between you and Petr, seveal pitfalls have been
pointed out and avoided.

Meanwhile, I would suggest Masa and HATAYAMA to help give input about
panic_notifier usage and refactory. AFAIK, they contributed code and use
panic_notifier in their product or environment a lot, that will be very
helpful to get the first hand information from them.

Hi Masa, HATAYANA,

Any comment on this? (Please ignore this if it's not in your care.)


___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-23 Thread Masami Hiramatsu

On Sat, 22 Jan 2022 18:55:14 +0800
Baoquan He  wrote:

> On 01/21/22 at 05:31pm, Guilherme G. Piccoli wrote:
> ..
> > > IMHO, the right solution is to split the callbacks into 2 or more
> > > notifier list. Then we might rework panic() to do:
> > > 
> > > void panic(void)
> > > {
> > >   [...]
> > > 
> > >   /* stop watchdogs + extra info */
> > >   atomic_notifier_call_chain(&panic_disable_watchdogs_notifier_list, 0, 
> > > buf);
> > >   atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
> > >   panic_print_sys_info();
> > > 
> > >   /* crash_kexec + kmsg_dump in configurable order */
> > >   if (!_crash_kexec_post_kmsg_dump) {
> > >   __crash_kexec(NULL);
> > >   smp_send_stop();
> > >   } else {
> > >   crash_smp_send_stop();
> > >   }
> > > 
> > >   kmsg_dump();
> > >   if (_crash_kexec_post_kmsg_dump)
> > >   __crash_kexec(NULL);
> > > 
> > >   /* infinite loop or reboot */
> > >   atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 0, buf);
> > >   atomic_notifier_call_chain(&panic_rest_notifier_list, 0, buf);
> > > 
> > >   console_flush_on_panic(CONSOLE_FLUSH_PENDING);
> > > [...] 
> > > Two notifier lists might be enough in the above scenario. I would call
> > > them:
> > > 
> > >   panic_pre_dump_notifier_list
> > >   panic_post_dump_notifier_list
> > > 
> > > 
> > > It is a real solution that will help everyone. It is more complicated now
> > > but it will makes things much easier in the long term. And it might be 
> > > done
> > > step by step:
> > > 
> > >  1. introduce the two notifier lists
> > >  2. convert all users: one by one
> > >  3. remove the original notifier list when there is no user
> > 
> > That's a great idea! I'm into it, if we have a consensus. The thing that
> > scares me most here is that this is a big change and consumes time to
> > implement - I'd not risk such time if somebody is really against that.
> > So, let's see more opinions, maybe the kdump maintainers have good input.
> 
> I am fine with it. As long as thing is made clear, glad to see code is
> refactored to be more understandable and improved. Earlier, during several
> rounds of discussion between you and Petr, seveal pitfalls have been
> pointed out and avoided.
> 
> Meanwhile, I would suggest Masa and HATAYAMA to help give input about
> panic_notifier usage and refactory. AFAIK, they contributed code and use
> panic_notifier in their product or environment a lot, that will be very
> helpful to get the first hand information from them.
> 
> Hi Masa, HATAYANA,
> 
> Any comment on this? (Please ignore this if it's not in your care.)

No, that looks good idea to me. BTW, the 'dump' in the new notifieers
means both kmsg_dump and crash dump, right?

Thank you,

-- 
Masami Hiramatsu 

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-24 Thread d.hatay...@fujitsu.com



> On 01/21/22 at 05:31pm, Guilherme G. Piccoli wrote:
> ..
> > > IMHO, the right solution is to split the callbacks into 2 or more
> > > notifier list. Then we might rework panic() to do:
> > >
> > > void panic(void)
> > > {
> > > [...]
> > >
> > > /* stop watchdogs + extra info */
> > > atomic_notifier_call_chain(&panic_disable_watchdogs_notifier_list, 0, 
> > > buf);
> > > atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
> > > panic_print_sys_info();
> > >
> > > /* crash_kexec + kmsg_dump in configurable order */
> > > if (!_crash_kexec_post_kmsg_dump) {
> > > __crash_kexec(NULL);
> > > smp_send_stop();
> > > } else {
> > > crash_smp_send_stop();
> > > }
> > >
> > > kmsg_dump();
> > > if (_crash_kexec_post_kmsg_dump)
> > > __crash_kexec(NULL);
> > >
> > > /* infinite loop or reboot */
> > > atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 0, buf);
> > > atomic_notifier_call_chain(&panic_rest_notifier_list, 0, buf);
> > >
> > > console_flush_on_panic(CONSOLE_FLUSH_PENDING);
> > > [...]
> > > Two notifier lists might be enough in the above scenario. I would call
> > > them:
> > >
> > > panic_pre_dump_notifier_list
> > > panic_post_dump_notifier_list
> > >
> > >
> > > It is a real solution that will help everyone. It is more complicated now
> > > but it will makes things much easier in the long term. And it might be 
> > > done
> > > step by step:
> > >
> > >  1. introduce the two notifier lists
> > >  2. convert all users: one by one
> > >  3. remove the original notifier list when there is no user
> >
> > That's a great idea! I'm into it, if we have a consensus. The thing that
> > scares me most here is that this is a big change and consumes time to
> > implement - I'd not risk such time if somebody is really against that.
> > So, let's see more opinions, maybe the kdump maintainers have good input.
> 
> I am fine with it. As long as thing is made clear, glad to see code is
> refactored to be more understandable and improved. Earlier, during several
> rounds of discussion between you and Petr, seveal pitfalls have been
> pointed out and avoided.
> 
> Meanwhile, I would suggest Masa and HATAYAMA to help give input about
> panic_notifier usage and refactory. AFAIK, they contributed code and use
> panic_notifier in their product or environment a lot, that will be very
> helpful to get the first hand information from them.
> 
> Hi Masa, HATAYANA,
> 
> Any comment on this? (Please ignore this if it's not in your care.)
> 

Thanks for CCing to me. I like this patch set. I have same motivation.

For example, when I used crash_kexec_post_notifiers, I sometimes ran into
deadlock in printk's exclusion logic during the call of panic notifiers since
kaslr outputs kernel offset at panic by dump_kernel_offset() via panic notifers
(although this might never happen now thanks to lockless implementation).

The problem is that in the current design, we have to run all the
tasks registered, although most of them are actually unnecessary for
other users' requirements. Each user wants to call only their own handlers
in order to keep kdump as reliable as possible.

I've just started reading this patch set and have no other comments for now.

Thanks.
HATAYAMA, Daisuke

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-24 Thread Baoquan He

On 01/23/22 at 10:07pm, Masami Hiramatsu wrote:
> On Sat, 22 Jan 2022 18:55:14 +0800
> Baoquan He  wrote:
> 
> > On 01/21/22 at 05:31pm, Guilherme G. Piccoli wrote:
> > ..
> > > > IMHO, the right solution is to split the callbacks into 2 or more
> > > > notifier list. Then we might rework panic() to do:
> > > > 
> > > > void panic(void)
> > > > {
> > > > [...]
> > > > 
> > > > /* stop watchdogs + extra info */
> > > > 
> > > > atomic_notifier_call_chain(&panic_disable_watchdogs_notifier_list, 0, 
> > > > buf);
> > > > atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
> > > > panic_print_sys_info();
> > > > 
> > > > /* crash_kexec + kmsg_dump in configurable order */
> > > > if (!_crash_kexec_post_kmsg_dump) {
> > > > __crash_kexec(NULL);
> > > > smp_send_stop();
> > > > } else {
> > > > crash_smp_send_stop();
> > > > }
> > > > 
> > > > kmsg_dump();
> > > > if (_crash_kexec_post_kmsg_dump)
> > > > __crash_kexec(NULL);
> > > > 
> > > > /* infinite loop or reboot */
> > > > atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 0, 
> > > > buf);
> > > > atomic_notifier_call_chain(&panic_rest_notifier_list, 0, buf);
> > > > 
> > > > console_flush_on_panic(CONSOLE_FLUSH_PENDING);
> > > > [...] 
> > > > Two notifier lists might be enough in the above scenario. I would call
> > > > them:
> > > > 
> > > > panic_pre_dump_notifier_list
> > > > panic_post_dump_notifier_list
> > > > 
> > > > 
> > > > It is a real solution that will help everyone. It is more complicated 
> > > > now
> > > > but it will makes things much easier in the long term. And it might be 
> > > > done
> > > > step by step:
> > > > 
> > > >  1. introduce the two notifier lists
> > > >  2. convert all users: one by one
> > > >  3. remove the original notifier list when there is no user
> > > 
> > > That's a great idea! I'm into it, if we have a consensus. The thing that
> > > scares me most here is that this is a big change and consumes time to
> > > implement - I'd not risk such time if somebody is really against that.
> > > So, let's see more opinions, maybe the kdump maintainers have good input.
> > 
> > I am fine with it. As long as thing is made clear, glad to see code is
> > refactored to be more understandable and improved. Earlier, during several
> > rounds of discussion between you and Petr, seveal pitfalls have been
> > pointed out and avoided.
> > 
> > Meanwhile, I would suggest Masa and HATAYAMA to help give input about
> > panic_notifier usage and refactory. AFAIK, they contributed code and use
> > panic_notifier in their product or environment a lot, that will be very
> > helpful to get the first hand information from them.
> > 
> > Hi Masa, HATAYANA,
> > 
> > Any comment on this? (Please ignore this if it's not in your care.)
> 
> No, that looks good idea to me. BTW, the 'dump' in the new notifieers
> means both kmsg_dump and crash dump, right?

Thanks for quick response, Masa.

I guess it's crash dump, namely kdump.

About pre_dump, if the dump is crash dump, hope those pre_dump notifiers
will be executed under conditional check, e.g only if 
'crash_kexec_post_notifiers'
is specified in kernel cmdline. 


___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-24 Thread Baoquan He

On 01/24/22 at 11:43am, d.hatay...@fujitsu.com wrote:
> 
> > On 01/21/22 at 05:31pm, Guilherme G. Piccoli wrote:
> > ..
> > > > IMHO, the right solution is to split the callbacks into 2 or more
> > > > notifier list. Then we might rework panic() to do:
> > > >
> > > > void panic(void)
> > > > {
> > > > [...]
> > > >
> > > > /* stop watchdogs + extra info */
> > > > atomic_notifier_call_chain(&panic_disable_watchdogs_notifier_list, 
> > > > 0, buf);
> > > > atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
> > > > panic_print_sys_info();
> > > >
> > > > /* crash_kexec + kmsg_dump in configurable order */
> > > > if (!_crash_kexec_post_kmsg_dump) {
> > > > __crash_kexec(NULL);
> > > > smp_send_stop();
> > > > } else {
> > > > crash_smp_send_stop();
> > > > }
> > > >
> > > > kmsg_dump();
> > > > if (_crash_kexec_post_kmsg_dump)
> > > > __crash_kexec(NULL);
> > > >
> > > > /* infinite loop or reboot */
> > > > atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 0, buf);
> > > > atomic_notifier_call_chain(&panic_rest_notifier_list, 0, buf);
> > > >
> > > > console_flush_on_panic(CONSOLE_FLUSH_PENDING);
> > > > [...]
> > > > Two notifier lists might be enough in the above scenario. I would call
> > > > them:
> > > >
> > > > panic_pre_dump_notifier_list
> > > > panic_post_dump_notifier_list
> > > >
> > > >
> > > > It is a real solution that will help everyone. It is more complicated 
> > > > now
> > > > but it will makes things much easier in the long term. And it might be 
> > > > done
> > > > step by step:
> > > >
> > > >  1. introduce the two notifier lists
> > > >  2. convert all users: one by one
> > > >  3. remove the original notifier list when there is no user
> > >
> > > That's a great idea! I'm into it, if we have a consensus. The thing that
> > > scares me most here is that this is a big change and consumes time to
> > > implement - I'd not risk such time if somebody is really against that.
> > > So, let's see more opinions, maybe the kdump maintainers have good input.
> > 
> > I am fine with it. As long as thing is made clear, glad to see code is
> > refactored to be more understandable and improved. Earlier, during several
> > rounds of discussion between you and Petr, seveal pitfalls have been
> > pointed out and avoided.
> > 
> > Meanwhile, I would suggest Masa and HATAYAMA to help give input about
> > panic_notifier usage and refactory. AFAIK, they contributed code and use
> > panic_notifier in their product or environment a lot, that will be very
> > helpful to get the first hand information from them.
> > 
> > Hi Masa, HATAYANA,
> > 
> > Any comment on this? (Please ignore this if it's not in your care.)
> > 
> 
> Thanks for CCing to me. I like this patch set. I have same motivation.
> 
> For example, when I used crash_kexec_post_notifiers, I sometimes ran into
> deadlock in printk's exclusion logic during the call of panic notifiers since
> kaslr outputs kernel offset at panic by dump_kernel_offset() via panic 
> notifers
> (although this might never happen now thanks to lockless implementation).
> 
> The problem is that in the current design, we have to run all the
> tasks registered, although most of them are actually unnecessary for
> other users' requirements. Each user wants to call only their own handlers
> in order to keep kdump as reliable as possible.

If I unerstand you correclty, you are expressing your favour to the
panic_notifier filter this patch adds. I personally like it very much
either because I know users only expect to run those one or several
handlers they added or cared about, from discussing reported cases
realted to them, just as you said.

Now comments to patch lean to split and classify the current panic
notifiers list into two or several sub-lists and execute them in
different order. I think this improvement will benefit people who
defaults to execute all panic notifiers, while the panic notifier
fileter is also very helpful if can be added.

Hope I got you right, HATAYAMA. And thanks a lot for your quick
response.


___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-24 Thread Guilherme G. Piccoli

On 24/01/2022 10:59, Baoquan He wrote:
> [...]
> About pre_dump, if the dump is crash dump, hope those pre_dump notifiers
> will be executed under conditional check, e.g only if 
> 'crash_kexec_post_notifiers'
> is specified in kernel cmdline. 

Hi Baoquan, based on Petr's suggestion, I think pre_dump would be
responsible for really *non-intrusive/non-risky* tasks and should be
always executed in the panic path (before kdump), regardless of
"crash_kexec_post_notifiers".

The idea is that the majority of the notifiers would be executed in the
post_dump portion, and for that, we have the
"crash_kexec_post_notifiers" conditional. I also suggest we have
blacklist options (based on function names) for both notifiers, in order
to make kdump issues debug easier.

Do you agree with that? Feel free to comment with suggestions!
Cheers,

Guilherme

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-25 Thread d.hatay...@fujitsu.com

> @@ -146,6 +157,34 @@ void nmi_panic(struct pt_regs *regs, const char *msg)
>  }
>  EXPORT_SYMBOL(nmi_panic);
> 
> +static int __init panic_notifier_filter_setup(char *buf)
> +{
> +   char *func;
> +   unsigned long addr;
> +
> +   while ((func = strsep(&buf, ","))) {
> +   addr = kallsyms_lookup_name(func);
> +   if (!addr) {
> +   pr_warn("panic_notifier_filter: invalid symbol %s\n", 
> func);
> +   continue;
> +   }

Could you remove this check?

panic_notifier_list is exported to kernel modules and this check
prevents such users from using this feature.

Thanks.
HATAYAMA, Daisuke

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-25 Thread Guilherme G. Piccoli

On 25/01/2022 08:50, d.hatay...@fujitsu.com wrote:
>> +   while ((func = strsep(&buf, ","))) {
>> +   addr = kallsyms_lookup_name(func);
>> +   if (!addr) {
>> +   pr_warn("panic_notifier_filter: invalid symbol 
>> %s\n", func);
>> +   continue;
>> +   }
> 
> Could you remove this check?
> 
> panic_notifier_list is exported to kernel modules and this check
> prevents such users from using this feature.
> 
> Thanks.
> HATAYAMA, Daisuke

Hi, thanks for the review. First of all, notice that it's very likely
this patch isn't gonna get merged this way, we are considering a
refactor that included 2 panic notifiers: one a bit earlier (pre_dump),
that includes functions less risky, as watchdog unloaders, kernel offset
dump, etc, and the second panic notifier (post_dump) will keep the
majority of callbacks, and can be conditionally executed on kdump
through the usage of "crash_kexec_post_notifiers".

Anyway, I'm curious with your code review - how can we use this filter
with modules, if the filter setup is invoked as early_param(), before
modules load? In that case, module functions won't have a valid address,
correct? So, in that moment, this lookup fails, we cannot record an
unloaded module address in such list. Please, correct me if I'm wrong.

Cheers,

Guilherme

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-25 Thread d.hatay...@fujitsu.com

> Hi, thanks for the review. First of all, notice that it's very likely
> this patch isn't gonna get merged this way, we are considering a
> refactor that included 2 panic notifiers: one a bit earlier (pre_dump),
> that includes functions less risky, as watchdog unloaders, kernel offset
> dump, etc, and the second panic notifier (post_dump) will keep the
> majority of callbacks, and can be conditionally executed on kdump
> through the usage of "crash_kexec_post_notifiers".

But the pre_dump cannot avoid calling multiple unnecessary handlers, right?
It's more risky than the previous idea...

> Anyway, I'm curious with your code review - how can we use this filter
> with modules, if the filter setup is invoked as early_param(), before
> modules load? In that case, module functions won't have a valid address,
> correct? So, in that moment, this lookup fails, we cannot record an
> unloaded module address in such list. Please, correct me if I'm wrong.

For example, how about simply maintaining function symbol names in the list
as string, not address.

Thanks.
HATAYAMA, Daisuke

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-25 Thread Baoquan He

On 01/24/22 at 11:48am, Guilherme G. Piccoli wrote:
> On 24/01/2022 10:59, Baoquan He wrote:
> > [...]
> > About pre_dump, if the dump is crash dump, hope those pre_dump notifiers
> > will be executed under conditional check, e.g only if 
> > 'crash_kexec_post_notifiers'
> > is specified in kernel cmdline. 
> 
> Hi Baoquan, based on Petr's suggestion, I think pre_dump would be
> responsible for really *non-intrusive/non-risky* tasks and should be
> always executed in the panic path (before kdump), regardless of
> "crash_kexec_post_notifiers".
> 
> The idea is that the majority of the notifiers would be executed in the
> post_dump portion, and for that, we have the
> "crash_kexec_post_notifiers" conditional. I also suggest we have
> blacklist options (based on function names) for both notifiers, in order
> to make kdump issues debug easier.
> 
> Do you agree with that? Feel free to comment with suggestions!
> Cheers,

I would say "please NO" cautiously.

As Petr said, kdump mostly works only if people configure it correctly.
That's because we try best to switch to kdump kernel from the fragile
panicked kernel immediately. When we try to add anthing before the switching,
please consider carefully and ask if that adding is mandatory, otherwise
switching into kdump kernel may fail. If the answer is yes, the adding
is needed and welcomed. Othewise, any unnecessary action, including any
"non-intrusive/non-risky" tasks, would be unwelcomed.

Surely, we don't oppose the "non-intrusive/non-risky" or completely
"intrusive/risky" action adding before kdump kernel switching, with a
conditional limitation. When we handle customers' kdump support, we
explicitly declare we only support normal and default kdump operation.
If any action which is done before switching into kdump kernel is specified,
e.g panic_notifier, panic_print, they need take care of their own kdump
failure.

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-26 Thread d.hatay...@fujitsu.com

> On 01/24/22 at 11:48am, Guilherme G. Piccoli wrote:
> > On 24/01/2022 10:59, Baoquan He wrote:
> > > [...]
> > > About pre_dump, if the dump is crash dump, hope those pre_dump notifiers
> > > will be executed under conditional check, e.g only if 
> > > 'crash_kexec_post_notifiers'
> > > is specified in kernel cmdline.
> >
> > Hi Baoquan, based on Petr's suggestion, I think pre_dump would be
> > responsible for really *non-intrusive/non-risky* tasks and should be
> > always executed in the panic path (before kdump), regardless of
> > "crash_kexec_post_notifiers".
> >
> > The idea is that the majority of the notifiers would be executed in the
> > post_dump portion, and for that, we have the
> > "crash_kexec_post_notifiers" conditional. I also suggest we have
> > blacklist options (based on function names) for both notifiers, in order
> > to make kdump issues debug easier.
> >
> > Do you agree with that? Feel free to comment with suggestions!
> > Cheers,
>
> I would say "please NO" cautiously.
>
> As Petr said, kdump mostly works only if people configure it correctly.
> That's because we try best to switch to kdump kernel from the fragile
> panicked kernel immediately. When we try to add anthing before the switching,
> please consider carefully and ask if that adding is mandatory, otherwise
> switching into kdump kernel may fail. If the answer is yes, the adding
> is needed and welcomed. Othewise, any unnecessary action, including any
> "non-intrusive/non-risky" tasks, would be unwelcomed.
>
> Surely, we don't oppose the "non-intrusive/non-risky" or completely
> "intrusive/risky" action adding before kdump kernel switching, with a
> conditional limitation. When we handle customers' kdump support, we
> explicitly declare we only support normal and default kdump operation.
> If any action which is done before switching into kdump kernel is specified,
> e.g panic_notifier, panic_print, they need take care of their own kdump
> failure.

Sorry for bringing back the past discussion, but how about
reconsidering the following design?

- kdump-specific notifier list within crash_kexec()

I don't think that the current implementation of
crash_kexec_post_notifiers is ideal, where panic() and panic notifier
are used, which contains the code that is unnecessary for kdump, and
it unnecessarily decreases kdump's reliability. The presence of kdump
code in panic() also conversely makes panic()'s code unnecessarily
complicated.

I use crash_kexec_post_notifiers with the understanding that it
affects kdump's reliability to some degree, but at the same time I
want it to be as reliable as possible.

I think introducing kdump-specific notifier list will resolve the
issues caused by dependencies to panic()'s code.

In addition, as I already said in another mail, I want to avoid
invoking any other handlers passed by users other than me, in order to
keep kdump's reliability. For such purpose, I think some feature like
Guilherme's panic notifier filter is needed.

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-26 Thread Petr Mladek

On Wed 2022-01-26 11:10:39, Baoquan He wrote:
> On 01/24/22 at 11:48am, Guilherme G. Piccoli wrote:
> > On 24/01/2022 10:59, Baoquan He wrote:
> > > [...]
> > > About pre_dump, if the dump is crash dump, hope those pre_dump notifiers
> > > will be executed under conditional check, e.g only if 
> > > 'crash_kexec_post_notifiers'
> > > is specified in kernel cmdline. 
> > 
> > Hi Baoquan, based on Petr's suggestion, I think pre_dump would be
> > responsible for really *non-intrusive/non-risky* tasks and should be
> > always executed in the panic path (before kdump), regardless of
> > "crash_kexec_post_notifiers".
> > 
> > The idea is that the majority of the notifiers would be executed in the
> > post_dump portion, and for that, we have the
> > "crash_kexec_post_notifiers" conditional. I also suggest we have
> > blacklist options (based on function names) for both notifiers, in order
> > to make kdump issues debug easier.
> > 
> > Do you agree with that? Feel free to comment with suggestions!
> > Cheers,
> 
> I would say "please NO" cautiously.
> 
> As Petr said, kdump mostly works only if people configure it correctly.
> That's because we try best to switch to kdump kernel from the fragile
> panicked kernel immediately. When we try to add anthing before the switching,
> please consider carefully and ask if that adding is mandatory, otherwise
> switching into kdump kernel may fail. If the answer is yes, the adding
> is needed and welcomed. Othewise, any unnecessary action, including any
> "non-intrusive/non-risky" tasks, would be unwelcomed.

I still do not have the complete picture. But it seems that some
actions make always sense even for kdump:

+ Super safe operations that might disable churn from broken
  system. For examle, disabling watchdogs by setting a single
  variable, see rcu_panic() notifier

+ Actions needed that allow to kexec the crash kernel a safe
  way under some hypervisor, see

https://lore.kernel.org/r/mwhpr21mb15933573f5c81c5250bf6a1cd7...@mwhpr21mb1593.namprd21.prod.outlook.com

> Surely, we don't oppose the "non-intrusive/non-risky" or completely
> "intrusive/risky" action adding before kdump kernel switching, with a
> conditional limitation. When we handle customers' kdump support, we
> explicitly declare we only support normal and default kdump operation.
> If any action which is done before switching into kdump kernel is specified,
> e.g panic_notifier, panic_print, they need take care of their own kdump
> failure.

All this actually started because of kmsg_dump. It might make sense to
allow both kmsg_dump and kdump together. The messages stored by
kmsg_dump might be better than nothing when kdump fails.

It actually seems to be the main motivation to introduce
"crash_kexec_post_notifier" parameter, see the commit
f06e5153f4ae2e2f3b03 ("kernel/panic.c: add "crash_kexec_post_notifiers"
option for kdump after panic_notifers").

And this patch introduces panic_notifier_filter that tries to select
notifiers that are helpful and harmful. IMHO, it is almost unusable.
It seems that even kernel developers do not understand what exactly
some notifiers do and why they are needed. Usually only the author
and people familiar with the subsystem have some idea. It makes
it pretty hard for anyone to create a reasonable filter.

I am pretty sure that we could do better. I propose to add more
notifier lists that will be called at various places with reasonable
rules and restrictions. Then the susbsystem maintainers could decide
where exactly a given action must be done.

The result might be that we will need only few options that will
enable/disable some well defined optional features.

Best Regards,
Petr

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-27 Thread Guilherme G. Piccoli

On 25/01/2022 10:06, d.hatay...@fujitsu.com wrote:
> 
> But the pre_dump cannot avoid calling multiple unnecessary handlers, right?
> It's more risky than the previous idea...
> 

I think we could have 2 kernel parameters then:

crash_kernel_disable_pre_notitifers (of course we can think in some
better name here heh)

crash_kernel_enable_post_notifiers (which is the same as the current
"crash_kernel_post_notifiers", we can keep it)

The point being (if I understand correctly): some callbacks are really
simple and don't introduce risks for kdump, like the RCU; a bunch of
them just set one variable. Those could be enable by default, before the
kdump.

The majority would fit in the 2nd group, meaning they are not enabled by
default, requiring some parameter for that.

Petr, let me know if that makes sense and is aligned with your suggestion.

> For example, how about simply maintaining function symbol names in the list
> as string, not address.
> 

I considered that before, it was my first idea but it's not great due to
memory allocation. We'd need to use memblock to allocate a struct to
hold function names, and the comparison on register time is slower, I
guess... so it's much easier to pre-allocate some handlers and only
track the addresses of the function. I personally do not see much use in
this filter for module callbacks, but if that's a use case, we can think
on how to do that. But notice that the current implementation of the
filter wont hold if we end-up following the suggestions in this thread,
not sure even if we're gonna have a filter...

Cheers,

Guilherme

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-28 Thread Petr Mladek

On Thu 2022-01-27 14:16:20, Guilherme G. Piccoli wrote:
> On 25/01/2022 10:06, d.hatay...@fujitsu.com wrote:
> > 
> > But the pre_dump cannot avoid calling multiple unnecessary handlers, right?
> > It's more risky than the previous idea...
> > 
> 
> I think we could have 2 kernel parameters then:
> 
> crash_kernel_disable_pre_notitifers (of course we can think in some
> better name here heh)
> 
> crash_kernel_enable_post_notifiers (which is the same as the current
> "crash_kernel_post_notifiers", we can keep it)
> 
> The point being (if I understand correctly): some callbacks are really
> simple and don't introduce risks for kdump, like the RCU; a bunch of
> them just set one variable. Those could be enable by default, before the
> kdump.
> 
> The majority would fit in the 2nd group, meaning they are not enabled by
> default, requiring some parameter for that.
> 
> Petr, let me know if that makes sense and is aligned with your suggestion.

First, I am sorry for the very long mail. But the problem is really
complicated. I did my best to describe it a clean way.

I have discussed these problems with a colleague and he had some good
points. And my view evolved even further.

There are two groups of people interested in panic() behavior:

1. Users wants to get as reliable as possible: kdump, kmsg_dump,
   console log, useful last message on screen, reboot, hypervisor
   notification.

   Different users have different priorities according to the use case.

2. Subsystem maintainers and developers that need to do something
   special in panic(). They have to deal with the user requirements
   and bug reports.

   Most operations in panic() have unclear results because the system
   is in unclear state. Maintainers and developers wants to make their
   life easier. They do not want to deal with problems caused by
   others. So that they want to disable others or run as early as
   possible.

   It is nicely visible. kdump maintainer is afraid of calling
   anything before kdump. Many people support the idea of filtering
   because it moves problems to the user side.

I see two basic problems here: ordering and reliability:

1. Ordering problems are partly solved by configuration and partly by
   definition. I mean that:

  + kdump, kmsg_dump, panic_print_sys_info() are optional
  + console output is the best effort; more risky in final flush
  + reboot, infinite loop are the very last step

   IMHO, the ordering should be pretty clear:

  + panic_print_sys_info(), kmsg_dump(), kdump(), console flush, reboot

   Why?

  + panic_print_sys_info(), kmsg_dump(), kdump() are all optional
   and disabled by default
  + Users might want panic_print_sys_info() in kmsg_dump() and kdump()
  + Users might prefer kmsg_dump() over kdump()
  + kdump() is the last operation when enabled

   Where are panic notifiers in this picture?
   Where are CPUs stopped?

2. Reliability is the best effort and any subsystem should do its
   best.

   Users need to be aware (documentation, warning) that:

  + kmsg_dump() is less reliable when panic_print_sys_info() is enabled
  + kdump() is less reliable when panic_print_sys_info() and/or
kmsg_dump() is enabled.

   Where are panic notifiers in this picture?
   How stopped CPUs affect reliability?

Regarding panic notifiers. They look like a problematic black box:

+ ordering against other operations is not clear
+ are not safe enough
+ various users require some and do not need others
+ some are too complex so that only few people know what
  they do

So far, we have two proposals how to handle panic notifiers:

1. Allow to filter them with parameter:

 + pros:
+ it allows users to customize and work around problems

 + cons:
+ ordering is still not clear

+ user has to know what he does; note that sometimes only
  few people know what their notifier does

+ hard to use in the long term; callbacks names are
  implementation detail; new notifiers are added

+ lower motivation to solve problems; easy to wave them with
  "just disable it when it does not work for you..."

2. Split notifiers into more lists:

+ pros:
+ might solve ordering problems

+ subsystem maintainers could find the proper and more safe
  location

+ cons:
+ subsystem maintainers tend to think that their problem is
  the most important one; they will tend to do the operation
  as early as possible; so that even dangerous operations
  might be done early  => the original problem is still there

+ it might not motivate developers enough to make the notifiers as
  safe as possible

+ some might still need to be optional; for example, it should
  be possible to disable hypervisor notifier when it breaks
  kdump

Regarding stopped CPUs, it looks like a good idea most of the

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-01-30 Thread Baoquan He

On 01/26/22 at 02:20pm, Petr Mladek wrote:
> On Wed 2022-01-26 11:10:39, Baoquan He wrote:
> > On 01/24/22 at 11:48am, Guilherme G. Piccoli wrote:
> > > On 24/01/2022 10:59, Baoquan He wrote:
> > > > [...]
> > > > About pre_dump, if the dump is crash dump, hope those pre_dump notifiers
> > > > will be executed under conditional check, e.g only if 
> > > > 'crash_kexec_post_notifiers'
> > > > is specified in kernel cmdline. 
> > > 
> > > Hi Baoquan, based on Petr's suggestion, I think pre_dump would be
> > > responsible for really *non-intrusive/non-risky* tasks and should be
> > > always executed in the panic path (before kdump), regardless of
> > > "crash_kexec_post_notifiers".
> > > 
> > > The idea is that the majority of the notifiers would be executed in the
> > > post_dump portion, and for that, we have the
> > > "crash_kexec_post_notifiers" conditional. I also suggest we have
> > > blacklist options (based on function names) for both notifiers, in order
> > > to make kdump issues debug easier.
> > > 
> > > Do you agree with that? Feel free to comment with suggestions!
> > > Cheers,
> > 
> > I would say "please NO" cautiously.
> > 
> > As Petr said, kdump mostly works only if people configure it correctly.
> > That's because we try best to switch to kdump kernel from the fragile
> > panicked kernel immediately. When we try to add anthing before the 
> > switching,
> > please consider carefully and ask if that adding is mandatory, otherwise
> > switching into kdump kernel may fail. If the answer is yes, the adding
> > is needed and welcomed. Othewise, any unnecessary action, including any
> > "non-intrusive/non-risky" tasks, would be unwelcomed.
> 
> I still do not have the complete picture. But it seems that some
> actions make always sense even for kdump:
> 
> + Super safe operations that might disable churn from broken
>   system. For examle, disabling watchdogs by setting a single
>   variable, see rcu_panic() notifier
> 
> + Actions needed that allow to kexec the crash kernel a safe
>   way under some hypervisor, see
>   
> https://lore.kernel.org/r/mwhpr21mb15933573f5c81c5250bf6a1cd7...@mwhpr21mb1593.namprd21.prod.outlook.com

Yes, I agree with this after going through threads of discussion again.
There is much space we can do something for panic_notifier, and it might
be a good time to do now with these discussion and some clarification.

> 
> 
> > Surely, we don't oppose the "non-intrusive/non-risky" or completely
> > "intrusive/risky" action adding before kdump kernel switching, with a
> > conditional limitation. When we handle customers' kdump support, we
> > explicitly declare we only support normal and default kdump operation.
> > If any action which is done before switching into kdump kernel is specified,
> > e.g panic_notifier, panic_print, they need take care of their own kdump
> > failure.
> 
> All this actually started because of kmsg_dump. It might make sense to
> allow both kmsg_dump and kdump together. The messages stored by
> kmsg_dump might be better than nothing when kdump fails.

I think this can be done later, after panics notifiers are combed and
tidied up.

> 
> It actually seems to be the main motivation to introduce
> "crash_kexec_post_notifier" parameter, see the commit
> f06e5153f4ae2e2f3b03 ("kernel/panic.c: add "crash_kexec_post_notifiers"
> option for kdump after panic_notifers").

>From discussion with Hitachi and FJ engineers, they use
crash_kexec_post_notifiers when 1st kernel panicked and kdump kernel is
not so stable to function. In that case, the captured information 
with best effort after panic in 1st kernel can help analyze what
happened in 1st kernel, and also might give hint on why kdump kernel
is unstbale. But they will not add crash_kexec_post_notifiers in kernel
cmdline by default. The unstable kdump kernel rarely happened, need be
debugged and investigated. 

> 
> And this patch introduces panic_notifier_filter that tries to select
> notifiers that are helpful and harmful. IMHO, it is almost unusable.
> It seems that even kernel developers do not understand what exactly
> some notifiers do and why they are needed. Usually only the author
> and people familiar with the subsystem have some idea. It makes
> it pretty hard for anyone to create a reasonable filter.

Then people can select the notifiers they know to execute. E.g for
Hyper-V, they can run the notifiers related. And as HATAYAMA mentioned,
they have the same expection. This kind of filter is not for people to
blindly pick and run, but for the professional.

I think the chaos is from not being monitored entirely. People can
freely register one when they need. The priority, impaction to the
entirety is not considered by subsystem developer.

> 
> I am pretty sure that we could do better. I propose to add more
> notifier lists that will be called at various places with reasonable
> rules and restrictions. Then the susbsystem maintainers could decide
> where exactly a given ac

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-02-08 Thread Guilherme G. Piccoli

On 28/01/2022 10:38, Petr Mladek wrote:
> [...] On Thu 2022-01-27 14:16:20, Guilherme G. Piccoli wrote:
> First, I am sorry for the very long mail. But the problem is really
> complicated. I did my best to describe it a clean way.
> 
> I have discussed these problems with a colleague and he had some good
> points. And my view evolved even further.

Thanks Petr for the very comprehensive and detailed email - this helps a
lot in shaping the future of panic notifier(s)!


> [...] 
> I think about the following solution:
> 
> + split the notifiers into three lists:
> 
>   + info: stop watchdogs, provide extra info
>   + hypervisor: poke hypervisor
>   + reboot: actions needed only when crash dump did not happen
> 
> + allow to call hypervisor notifiers before or after kdump
> 
> + stop CPUs before kdump when either hypervisor notifiers or
>   kmsg_dump is enabled
> 
> Note that it still allows to call kdump as the first action when
> hypervisor notifiers are called after kdump and no kmsg dumper
> is registered.
> 
> 
> void panic(void)
> {
>   [...]
> 
>   if (crash_kexec_post_hypervisor || panic_print || enabled_kmsg_dump()) {
>   /*
>* Stop CPUs when some extra action is required before
>* crash dump. We will need architecture dependent extra
>* works in addition to stopping other CPUs.
>*/
>crash_smp_send_stop();
>cpus_stopped = true;
>   }
> 
>   if (crash_kexec_post_hypervisor) {
> /* Tell hypervisor about the panic */
> atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 
> 0, buf);
>   }
> 
>   if (enabled_kmsg_dump) {
> /*
>  * Print extra info by notifiers.
>  * Prevent rumors, for example, by stopping watchdogs.
>  */
> atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
>   }
> 
>   /* Optional extra info */
>   panic_printk_sys_info();
> 
>   /* No dumper by default */
>   kmsg_dump();
> 
>   /* Used only when crash kernel loaded */
>   __crash_kexec(NULL);
> 
>   if (!cpus_stopped) {
>   /*
>* Note smp_send_stop is the usual smp shutdown function, which
>* unfortunately means it may not be hardened to work in a
>* panic situation.
>*/
>   smp_send_stop();
>   }
> 
>   if (!crash_kexec_post_hypervisor) {
> /* Tell hypervisor about the panic */
> atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 
> 0, buf);
>   }
> 
>   if (!enabled_kmsg_dump) {
> /*
>  * Print extra info by notifiers.
>  * Prevent rumors, for example, by stopping watchdogs.
>  */
> atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
>   }
> 
>   /*
>* Help to reboot a safe way.
>*/
>   atomic_notifier_call_chain(&panic_reboot_notifier_list, 0, buf);
> 
>   [...]
> }
> 
> Any opinion?
> Do the notifier list names make sense?
> 

This was exposed very clearly, thanks. I agree with you, it's a good
approach, and we can evolve that during the implementation phase, like
"function A is not good in the hypervisor list because of this and
that", so we move it to the reboot list. Also, name of the lists is not
so relevant, might evolve in the implementation phase - I personally
liked them, specially the "info" and "hypervisor" ones (reboot seems
good but not great heh).

So, what are the opinions from kdump maintainers about this idea?
Baoquan / Vivek / Dave, does it make sense to you? Do you have any
suggestions/concerns to add on top of Petr draft?

I prefer this refactor than the filter, certainly. If nobody else
working on that, I can try implementing that - it's very interesting.
The only thing I'd like to have first is an ACK from the kdump
maintainers about the general idea.

Cheers,


Guilherme

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-02-08 Thread b...@redhat.com

On 02/08/22 at 03:51pm, Guilherme G. Piccoli wrote:
> On 28/01/2022 10:38, Petr Mladek wrote:
> > [...] On Thu 2022-01-27 14:16:20, Guilherme G. Piccoli wrote:
> > First, I am sorry for the very long mail. But the problem is really
> > complicated. I did my best to describe it a clean way.
> > 
> > I have discussed these problems with a colleague and he had some good
> > points. And my view evolved even further.
> 
> Thanks Petr for the very comprehensive and detailed email - this helps a
> lot in shaping the future of panic notifier(s)!
> 
> 
> > [...] 
> > I think about the following solution:
> > 
> > + split the notifiers into three lists:
> > 
> > + info: stop watchdogs, provide extra info
> > + hypervisor: poke hypervisor
> > + reboot: actions needed only when crash dump did not happen
> > 
> > + allow to call hypervisor notifiers before or after kdump
> > 
> > + stop CPUs before kdump when either hypervisor notifiers or
> >   kmsg_dump is enabled
> > 
> > Note that it still allows to call kdump as the first action when
> > hypervisor notifiers are called after kdump and no kmsg dumper
> > is registered.
> > 
> > 
> > void panic(void)
> > {
> > [...]
> > 
> > if (crash_kexec_post_hypervisor || panic_print || enabled_kmsg_dump()) {
> > /*
> >  * Stop CPUs when some extra action is required before
> >  * crash dump. We will need architecture dependent extra
> >  * works in addition to stopping other CPUs.
> >  */
> >  crash_smp_send_stop();
> >  cpus_stopped = true;
> > }
> > 
> > if (crash_kexec_post_hypervisor) {
> >   /* Tell hypervisor about the panic */
> >   atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 
> > 0, buf);
> > }
> > 
> > if (enabled_kmsg_dump) {
> >   /*
> >* Print extra info by notifiers.
> >* Prevent rumors, for example, by stopping watchdogs.
> >*/
> >   atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
> > }
> > 
> > /* Optional extra info */
> > panic_printk_sys_info();
> > 
> > /* No dumper by default */
> > kmsg_dump();
> > 
> > /* Used only when crash kernel loaded */
> > __crash_kexec(NULL);
> > 
> > if (!cpus_stopped) {
> > /*
> >  * Note smp_send_stop is the usual smp shutdown function, which
> >  * unfortunately means it may not be hardened to work in a
> >  * panic situation.
> >  */
> > smp_send_stop();
> > }
> > 
> > if (!crash_kexec_post_hypervisor) {
> >   /* Tell hypervisor about the panic */
> >   atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 
> > 0, buf);
> > }
> > 
> > if (!enabled_kmsg_dump) {
> >   /*
> >* Print extra info by notifiers.
> >* Prevent rumors, for example, by stopping watchdogs.
> >*/
> >   atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
> > }
> > 
> > /*
> >  * Help to reboot a safe way.
> >  */
> > atomic_notifier_call_chain(&panic_reboot_notifier_list, 0, buf);
> > 
> > [...]
> > }
> > 
> > Any opinion?
> > Do the notifier list names make sense?
> > 
> 
> This was exposed very clearly, thanks. I agree with you, it's a good
> approach, and we can evolve that during the implementation phase, like
> "function A is not good in the hypervisor list because of this and
> that", so we move it to the reboot list. Also, name of the lists is not
> so relevant, might evolve in the implementation phase - I personally
> liked them, specially the "info" and "hypervisor" ones (reboot seems
> good but not great heh).
> 
> So, what are the opinions from kdump maintainers about this idea?
> Baoquan / Vivek / Dave, does it make sense to you? Do you have any
> suggestions/concerns to add on top of Petr draft?

Yeah, it's reasonable. As I replied to Michael in another thread, I
think splitting the current notifier list is a good idea. At least the
code to archieve hyper-V's goal with panic_notifier is a little odd and
should be taken out and execute w/o conditional before kdump, and maybe
some others Petr has combed out.

For those which will be switched on with the need of adding panic_notifier
or panic_print into cmdline, the heavy users like HATAYAMA and Masa can
help check.

For Petr's draft code, does it mean hyper-V need another knob to trigger
the needed notifiers? Will you go with the draft direclty? Hyper-V now
runs panic notifiers by default, just a reminder.

> 
> I prefer this refactor than the filter, certainly. If nobody else
> working on that, I can try implementing that - it's very interesting.
> The only thing I'd like to have first is an ACK from the kdump
> maintainers about the general idea.
> 
> Cheers,
> 
> 
> Guilherme
> 


___

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-02-10 Thread Guilherme G. Piccoli

On 08/02/2022 21:31, b...@redhat.com wrote:
> [...]
>> So, what are the opinions from kdump maintainers about this idea?
>> Baoquan / Vivek / Dave, does it make sense to you? Do you have any
>> suggestions/concerns to add on top of Petr draft?
> 
> Yeah, it's reasonable. As I replied to Michael in another thread, I
> think splitting the current notifier list is a good idea. At least the
> code to archieve hyper-V's goal with panic_notifier is a little odd and
> should be taken out and execute w/o conditional before kdump, and maybe
> some others Petr has combed out.
> 
> For those which will be switched on with the need of adding panic_notifier
> or panic_print into cmdline, the heavy users like HATAYAMA and Masa can
> help check.
> 
> For Petr's draft code, does it mean hyper-V need another knob to trigger
> the needed notifiers? Will you go with the draft direclty? Hyper-V now
> runs panic notifiers by default, just a reminder.
> 

Hi Baoquan, thanks for your comments.

I'll need to study the Hyper-V code and how it's done today - I guess
most part of this implementation will be studying the notifiers we have
currently, split them among the 3 new notifiers and comb them into
patches, so they can be reviewed for all relevant maintainers (who know
the code we are changing).

I'm not sure if I go directly with the draft, likely it'll have some
changes, but the draft should be the skeleton of the new implementation.
Specially if you/other kdump maintainers agree it's a good idea =)

Cheers,

Guilherme

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

RE: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-02-10 Thread Michael Kelley (LINUX)

From: Guilherme G. Piccoli  Sent: Thursday, February 10, 
2022 8:40 AM
> 
> On 08/02/2022 21:31, b...@redhat.com wrote:
> > [...]
> >> So, what are the opinions from kdump maintainers about this idea?
> >> Baoquan / Vivek / Dave, does it make sense to you? Do you have any
> >> suggestions/concerns to add on top of Petr draft?
> >
> > Yeah, it's reasonable. As I replied to Michael in another thread, I
> > think splitting the current notifier list is a good idea. At least the
> > code to archieve hyper-V's goal with panic_notifier is a little odd and
> > should be taken out and execute w/o conditional before kdump, and maybe
> > some others Petr has combed out.
> >
> > For those which will be switched on with the need of adding panic_notifier
> > or panic_print into cmdline, the heavy users like HATAYAMA and Masa can
> > help check.
> >
> > For Petr's draft code, does it mean hyper-V need another knob to trigger
> > the needed notifiers? Will you go with the draft direclty? Hyper-V now
> > runs panic notifiers by default, just a reminder.
> >
> 
> Hi Baoquan, thanks for your comments.
> 
> I'll need to study the Hyper-V code and how it's done today -

Let me know if you need any assistance or explanation as you look
at the Hyper-V code.

Michael Kelley
Principal SW Engineer
Linux Systems Group
Microsoft Corporation

> I guess
> most part of this implementation will be studying the notifiers we have
> currently, split them among the 3 new notifiers and comb them into
> patches, so they can be reviewed for all relevant maintainers (who know
> the code we are changing).
> 
> I'm not sure if I go directly with the draft, likely it'll have some
> changes, but the draft should be the skeleton of the new implementation.
> Specially if you/other kdump maintainers agree it's a good idea =)
> 
> Cheers,
> 
> 
> Guilherme
___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-02-10 Thread Guilherme G. Piccoli

On 10/02/2022 14:26, Michael Kelley (LINUX) wrote:
> From: Guilherme G. Piccoli  Sent: Thursday, February 10, 
> 2022 8:40 AM
> [...]>> I'll need to study the Hyper-V code and how it's done today -
> 
> Let me know if you need any assistance or explanation as you look
> at the Hyper-V code.
>
Perfect Michael, thanks a lot! Much appreciated =)

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-03-06 Thread Guilherme G. Piccoli

On 28/01/2022 10:38, Petr Mladek wrote:
> [...] 
> I think about the following solution:
> 
> + split the notifiers into three lists:
> 
>   + info: stop watchdogs, provide extra info
>   + hypervisor: poke hypervisor
>   + reboot: actions needed only when crash dump did not happen
> 
> + allow to call hypervisor notifiers before or after kdump
> 
> + stop CPUs before kdump when either hypervisor notifiers or
>   kmsg_dump is enabled
> 
> Note that it still allows to call kdump as the first action when
> hypervisor notifiers are called after kdump and no kmsg dumper
> is registered.
> 
> 
> void panic(void)
> {
>   [...]
> 
>   if (crash_kexec_post_hypervisor || panic_print || enabled_kmsg_dump()) {
>   /*
>* Stop CPUs when some extra action is required before
>* crash dump. We will need architecture dependent extra
>* works in addition to stopping other CPUs.
>*/
>crash_smp_send_stop();
>cpus_stopped = true;
>   }
> 
>   if (crash_kexec_post_hypervisor) {
> /* Tell hypervisor about the panic */
> atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 
> 0, buf);
>   }
> 
>   if (enabled_kmsg_dump) {
> /*
>  * Print extra info by notifiers.
>  * Prevent rumors, for example, by stopping watchdogs.
>  */
> atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
>   }
> 
>   /* Optional extra info */
>   panic_printk_sys_info();
> 
>   /* No dumper by default */
>   kmsg_dump();
> 
>   /* Used only when crash kernel loaded */
>   __crash_kexec(NULL);
> 
>   if (!cpus_stopped) {
>   /*
>* Note smp_send_stop is the usual smp shutdown function, which
>* unfortunately means it may not be hardened to work in a
>* panic situation.
>*/
>   smp_send_stop();
>   }
> 
>   if (!crash_kexec_post_hypervisor) {
> /* Tell hypervisor about the panic */
> atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 
> 0, buf);
>   }
> 
>   if (!enabled_kmsg_dump) {
> /*
>  * Print extra info by notifiers.
>  * Prevent rumors, for example, by stopping watchdogs.
>  */
> atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
>   }
> 
>   /*
>* Help to reboot a safe way.
>*/
>   atomic_notifier_call_chain(&panic_reboot_notifier_list, 0, buf);
> 
>   [...]
> }
> 
> Any opinion?
> Do the notifier list names make sense?
> 
> Best Regards,
> Petr


Hi folks, I'm working on this now, and while looking into it I've
noticed that we have the concept of "priority" in the notifiers list.
Basically, you can order the calls the way it fits best, priority is an
integer and must the set in the moment of registration, it's up to the
users of the notifiers to set it and enforce the ordering.

So what I'm thinking is: currently, only 3 or 4 panic notifiers make use
of that. What if, since we're re-working this, we add a priority for
*all* notifiers and enforce its usage? This way we guarantee
consistency, it'd make debug easier and maybe even more important:
having all the notifiers and their priorities in a list present in the
header file would be great documentation about all the existing
notifiers and how they are called - today this information is quite
obscure and requires lots of code grepping!

Let me know your thoughts Petr / Baoquan - it would add slightly more
code / complexity, but in my opinion the payback is very good.
Cheers,


Guilherme

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-03-06 Thread b...@redhat.com

On 03/06/22 at 11:21am, Guilherme G. Piccoli wrote:
> On 28/01/2022 10:38, Petr Mladek wrote:
> > [...] 
> > I think about the following solution:
> > 
> > + split the notifiers into three lists:
> > 
> > + info: stop watchdogs, provide extra info
> > + hypervisor: poke hypervisor
> > + reboot: actions needed only when crash dump did not happen
> > 
> > + allow to call hypervisor notifiers before or after kdump
> > 
> > + stop CPUs before kdump when either hypervisor notifiers or
> >   kmsg_dump is enabled
> > 
> > Note that it still allows to call kdump as the first action when
> > hypervisor notifiers are called after kdump and no kmsg dumper
> > is registered.
> > 
> > 
> > void panic(void)
> > {
> > [...]
> > 
> > if (crash_kexec_post_hypervisor || panic_print || enabled_kmsg_dump()) {
> > /*
> >  * Stop CPUs when some extra action is required before
> >  * crash dump. We will need architecture dependent extra
> >  * works in addition to stopping other CPUs.
> >  */
> >  crash_smp_send_stop();
> >  cpus_stopped = true;
> > }
> > 
> > if (crash_kexec_post_hypervisor) {
> >   /* Tell hypervisor about the panic */
> >   atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 
> > 0, buf);
> > }
> > 
> > if (enabled_kmsg_dump) {
> >   /*
> >* Print extra info by notifiers.
> >* Prevent rumors, for example, by stopping watchdogs.
> >*/
> >   atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
> > }
> > 
> > /* Optional extra info */
> > panic_printk_sys_info();
> > 
> > /* No dumper by default */
> > kmsg_dump();
> > 
> > /* Used only when crash kernel loaded */
> > __crash_kexec(NULL);
> > 
> > if (!cpus_stopped) {
> > /*
> >  * Note smp_send_stop is the usual smp shutdown function, which
> >  * unfortunately means it may not be hardened to work in a
> >  * panic situation.
> >  */
> > smp_send_stop();
> > }
> > 
> > if (!crash_kexec_post_hypervisor) {
> >   /* Tell hypervisor about the panic */
> >   atomic_notifier_call_chain(&panic_hypervisor_notifier_list, 
> > 0, buf);
> > }
> > 
> > if (!enabled_kmsg_dump) {
> >   /*
> >* Print extra info by notifiers.
> >* Prevent rumors, for example, by stopping watchdogs.
> >*/
> >   atomic_notifier_call_chain(&panic_info_notifier_list, 0, buf);
> > }
> > 
> > /*
> >  * Help to reboot a safe way.
> >  */
> > atomic_notifier_call_chain(&panic_reboot_notifier_list, 0, buf);
> > 
> > [...]
> > }
> > 
> > Any opinion?
> > Do the notifier list names make sense?
> > 
> > Best Regards,
> > Petr
> 
> 
> Hi folks, I'm working on this now, and while looking into it I've
> noticed that we have the concept of "priority" in the notifiers list.
> Basically, you can order the calls the way it fits best, priority is an
> integer and must the set in the moment of registration, it's up to the
> users of the notifiers to set it and enforce the ordering.
> 
> So what I'm thinking is: currently, only 3 or 4 panic notifiers make use
> of that. What if, since we're re-working this, we add a priority for
> *all* notifiers and enforce its usage? This way we guarantee
> consistency, it'd make debug easier and maybe even more important:
> having all the notifiers and their priorities in a list present in the
> header file would be great documentation about all the existing
> notifiers and how they are called - today this information is quite
> obscure and requires lots of code grepping!
> 
> Let me know your thoughts Petr / Baoquan - it would add slightly more
> code / complexity, but in my opinion the payback is very good.
> Cheers,

The ideal situation is each panic notifier has an order or index to
indicate its priority. Wondering how to make it. What I think of is
copying initcall. We have several priorities, at the same priority,
execution sequence is not important. Not sure if I get your point.

~~~
#define core_initcall(fn)   __define_initcall(fn, 1)
#define core_initcall_sync(fn)  __define_initcall(fn, 1s)
..
#define late_initcall(fn)   __define_initcall(fn, 7)
#define late_initcall_sync(fn)  __define_initcall(fn, 7s)


___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-03-07 Thread Guilherme G. Piccoli

On 07/03/2022 00:42, b...@redhat.com wrote:
> [...]
>> Let me know your thoughts Petr / Baoquan - it would add slightly more
>> code / complexity, but in my opinion the payback is very good.
>> Cheers,
> 
> The ideal situation is each panic notifier has an order or index to
> indicate its priority. Wondering how to make it. What I think of is
> copying initcall. We have several priorities, at the same priority,
> execution sequence is not important. Not sure if I get your point.
> 
> ~~~
> #define core_initcall(fn)   __define_initcall(fn, 1)
> #define core_initcall_sync(fn)  __define_initcall(fn, 1s)
> ..
> #define late_initcall(fn)   __define_initcall(fn, 7)
> #define late_initcall_sync(fn)  __define_initcall(fn, 7s)
> 

Hi Baoquan, thanks for you consideration! In fact, the notifiers
infrastructure already have a mechanism of ordering, my idea is to make
use of that. It's not that different from the initcall system...

For instance, the code in the notifier register function checks for the
priority field in the notifier block:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/kernel/notifier.c#n31
.

For example, the Xen panic notifier is one of the few blocks that make
use of that, currently:

static struct notifier_block xen_panic_block = {
.notifier_call = xen_panic_event,
.priority = INT_MIN
};

(see
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/arch/x86/xen/enlighten.c#n286)

In this case, xen_panic_event() will be the latest one to run. My idea
is to make use of that in *all* panic notifiers, having a table/list on
panic_notifier.h (defines or enum, I'll think about that when writing
this part) so all notifiers are documented and the ordering is clear and
enforced.

Makes sense to you?
Cheers,

Guilherme

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-03-07 Thread b...@redhat.com

On 03/07/22 at 10:11am, Guilherme G. Piccoli wrote:
> On 07/03/2022 00:42, b...@redhat.com wrote:
> > [...]
> >> Let me know your thoughts Petr / Baoquan - it would add slightly more
> >> code / complexity, but in my opinion the payback is very good.
> >> Cheers,
> > 
> > The ideal situation is each panic notifier has an order or index to
> > indicate its priority. Wondering how to make it. What I think of is
> > copying initcall. We have several priorities, at the same priority,
> > execution sequence is not important. Not sure if I get your point.
> > 
> > ~~~
> > #define core_initcall(fn)   __define_initcall(fn, 1)
> > #define core_initcall_sync(fn)  __define_initcall(fn, 1s)
> > ..
> > #define late_initcall(fn)   __define_initcall(fn, 7)
> > #define late_initcall_sync(fn)  __define_initcall(fn, 7s)
> > 
> 
> Hi Baoquan, thanks for you consideration! In fact, the notifiers
> infrastructure already have a mechanism of ordering, my idea is to make
> use of that. It's not that different from the initcall system...

Ah, sorry, I even didn't notice that. That's awesome if we can make use
of that. While I still have concerns:

1) about those we have decided to take out from panic notifier list and
put before kdump, e.g the Hypver-V notifier, how will we do with it? Are
we going to handle them as we have discussed?

2) Combing and settling priority for all existing panic notifier looks
great, even though it will take some effort. How about the later newly
added one? How can we guarantee that those new notifiers are getting
appropriate priority to mark their order? Sometime we even don't know
a new panic notifier is added since code change may be made in any
component or driver.

> 
> For instance, the code in the notifier register function checks for the
> priority field in the notifier block:
> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/kernel/notifier.c#n31
> .
> 
> For example, the Xen panic notifier is one of the few blocks that make
> use of that, currently:
> 
> static struct notifier_block xen_panic_block = {
>   .notifier_call = xen_panic_event,
>   .priority = INT_MIN
> };
> 
> (see
> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/arch/x86/xen/enlighten.c#n286)
> 
> In this case, xen_panic_event() will be the latest one to run. My idea
> is to make use of that in *all* panic notifiers, having a table/list on
> panic_notifier.h (defines or enum, I'll think about that when writing
> this part) so all notifiers are documented and the ordering is clear and
> enforced.
> 
> Makes sense to you?
> Cheers,
> 
> 
> Guilherme
> 


___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-03-07 Thread Guilherme G. Piccoli

On 07/03/2022 11:04, b...@redhat.com wrote:
> [...]
> Ah, sorry, I even didn't notice that. That's awesome if we can make use
> of that. While I still have concerns:
> 

Thanks, nice that you liked the idea.

> 1) about those we have decided to take out from panic notifier list and
> put before kdump, e.g the Hypver-V notifier, how will we do with it? Are
> we going to handle them as we have discussed?
> 

While implementing that I will think of something, but if
understood/remember correctly Hyper-V gonna be one of the first to run
in the first notifier list proposed by Petr - so we might still use
ordering by priority there, having Hyper-V being the first heh

> 2) Combing and settling priority for all existing panic notifier looks
> great, even though it will take some effort. How about the later newly
> added one? How can we guarantee that those new notifiers are getting
> appropriate priority to mark their order? Sometime we even don't know
> a new panic notifier is added since code change may be made in any
> component or driver.
> 

This is a great point! How to do it? One idea is to have a special
registering function for panic notifiers that checks for priority field
missing, and good documentation is a good idea as well, always.

But if you / others have other suggestions, let me know - appreciate that.
Cheers,


Guilherme

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-03-08 Thread Petr Mladek

On Mon 2022-03-07 11:25:30, Guilherme G. Piccoli wrote:
> On 07/03/2022 11:04, b...@redhat.com wrote:
> > [...]
> > Ah, sorry, I even didn't notice that. That's awesome if we can make use
> > of that. While I still have concerns:
> > 
> 
> Thanks, nice that you liked the idea.
> 
> > 1) about those we have decided to take out from panic notifier list and
> > put before kdump, e.g the Hypver-V notifier, how will we do with it? Are
> > we going to handle them as we have discussed?
> > 
> 
> While implementing that I will think of something, but if
> understood/remember correctly Hyper-V gonna be one of the first to run
> in the first notifier list proposed by Petr - so we might still use
> ordering by priority there, having Hyper-V being the first heh

My understanding is that the problem is not a priority but an ordering
against other operations.

Namely, Hyper-V must be called even before crash dump. Some others before
kmsg_dump(). And the rest only when the crash dump is not called at all.


> > 2) Combing and settling priority for all existing panic notifier looks
> > great, even though it will take some effort. How about the later newly
> > added one? How can we guarantee that those new notifiers are getting
> > appropriate priority to mark their order? Sometime we even don't know
> > a new panic notifier is added since code change may be made in any
> > component or driver.
> > 
> 
> This is a great point! How to do it? One idea is to have a special
> registering function for panic notifiers that checks for priority field
> missing, and good documentation is a good idea as well, always.
> 
> But if you / others have other suggestions, let me know - appreciate that.
> Cheers,

Honestly, I am not that keen about enforcing the priorities.

It would make sense only when the ordering is really important.
Then there should be some rules. It should be obvious why
something has to be done earlier than something else.

>From my POV, it is just another complexity. Someone will need to
assign the priority to the existing notifiers. People will wonder
about it for newly added notifiers.

Reproducibility seems to be the only motivation. Is it really a
problem? Do the notifiers affect each other?

Also the notifiers are typically registered by some early boot
code. It will define some ordering out of box.

Best Regards,
Petr

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

2022-03-08 Thread Guilherme G. Piccoli

On 08/03/2022 09:54, Petr Mladek wrote:
> [...]
> Honestly, I am not that keen about enforcing the priorities.
> 
> It would make sense only when the ordering is really important.
> Then there should be some rules. It should be obvious why
> something has to be done earlier than something else.
> 
> From my POV, it is just another complexity. Someone will need to
> assign the priority to the existing notifiers. People will wonder
> about it for newly added notifiers.
> 
> Reproducibility seems to be the only motivation. Is it really a
> problem? Do the notifiers affect each other?
> 
> Also the notifiers are typically registered by some early boot
> code. It will define some ordering out of box.
> 
> Best Regards,
> Petr

OK Petr, makes sense. I feel that one the pros of the approach would be
to document all the notifiers in the header, but I guess nothing
prevents me to do that heh
I don't need enforcing the priorities for this.

I liked the idea of having everything "reproducible" as you said, but we
could face the problem of how to order some independent notifiers...
would be something "ad-hoc".

I'll progress here without priorities for now, if somebody see an
advantage for that, we can use it.

Thanks for your points,

Guilherme

___
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

[PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

RE: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

Re: [PATCH V4] notifier/panic: Introduce panic_notifier_filter

34 matches

Site Navigation

Mail list logo

Footer information