On 7/16/19 5:00 AM, Xiaoming Ni wrote:
> On 2019/7/15 13:38, Vasily Averin wrote:
>> On 7/14/19 5:45 AM, Xiaoming Ni wrote:
>>> On 2019/7/12 22:07, gre...@linuxfoundation.org wrote:
>>>> On Fri, Jul 12, 2019 at 09:11:57PM +0800, Xiaoming Ni wrote:
>>>>> On 2019/7/11 21:57, Vasily Averin wrote:
>>>>>> On 7/11/19 4:55 AM, Nixiaoming wrote:
>>>>>>> On Wed, July 10, 2019 1:49 PM Vasily Averin wrote:
>>>>>>>> On 7/10/19 6:09 AM, Xiaoming Ni wrote:
>>>>>>>>> Registering the same notifier to a hook repeatedly can cause the hook
>>>>>>>>> list to form a ring or lose other members of the list.
>>>>>>>>
>>>>>>>> I think is not enough to _prevent_ 2nd register attempt,
>>>>>>>> it's enough to detect just attempt and generate warning to mark host 
>>>>>>>> in bad state.
>>>>>>>>
>>>>>>>
>>>>>>> Duplicate registration is prevented in my patch, not just "mark host in 
>>>>>>> bad state"
>>>>>>>
>>>>>>> Duplicate registration is checked and exited in 
>>>>>>> notifier_chain_cond_register()
>>>>>>>
>>>>>>> Duplicate registration was checked in notifier_chain_register() but 
>>>>>>> only 
>>>>>>> the alarm was triggered without exiting. added by commit 
>>>>>>> 831246570d34692e 
>>>>>>> ("kernel/notifier.c: double register detection")
>>>>>>>
>>>>>>> My patch is like a combination of 831246570d34692e and 
>>>>>>> notifier_chain_cond_register(),
>>>>>>>  which triggers an alarm and exits when a duplicate registration is 
>>>>>>> detected.
>>>>>>>
>>>>>>>> Unexpected 2nd register of the same hook most likely will lead to 2nd 
>>>>>>>> unregister,
>>>>>>>> and it can lead to host crash in any time: 
>>>>>>>> you can unregister notifier on first attempt it can be too early, it 
>>>>>>>> can be still in use.
>>>>>>>> on the other hand you can never call 2nd unregister at all.
>>>>>>>
>>>>>>> Since the member was not added to the linked list at the time of the 
>>>>>>> second registration, 
>>>>>>> no linked list ring was formed. 
>>>>>>> The member is released on the first unregistration and -ENOENT on the 
>>>>>>> second unregistration.
>>>>>>> After patching, the fault has been alleviated
>>>>>>
>>>>>> You are wrong here.
>>>>>> 2nd notifier's registration is a pure bug, this should never happen.
>>>>>> If you know the way to reproduce this situation -- you need to fix it. 
>>>>>>
>>>>>> 2nd registration can happen in 2 cases:
>>>>>> 1) missed rollback, when someone forget to call unregister after 
>>>>>> successfull registration, 
>>>>>> and then tried to call register again. It can lead to crash for example 
>>>>>> when according module will be unloaded.
>>>>>> 2) some subsystem is registered twice, for example from  different 
>>>>>> namespaces.
>>>>>> in this case unregister called during sybsystem cleanup in first 
>>>>>> namespace will incorrectly remove notifier used 
>>>>>> in second namespace, it also can lead to unexpacted behaviour.
>>>>>>
>>>>> So in these two cases, is it more reasonable to trigger BUG() directly 
>>>>> when checking for duplicate registration ?
>>>>> But why does current notifier_chain_register() just trigger WARN() 
>>>>> without exiting ?
>>>>> notifier_chain_cond_register() direct exit without triggering WARN() ?
>>>>
>>>> It should recover from this, if it can be detected.  The main point is
>>>> that not all apis have to be this "robust" when used within the kernel
>>>> as we do allow for the callers to know what they are doing :)
>>>>
>>> In the notifier_chain_register(), the condition ( (*nl) == n) is the same 
>>> registration of the same hook.
>>>  We can intercept this situation and avoid forming a linked list ring to 
>>> make the API more rob
>>
>> Once again -- yes, you CAN prevent list corruption, but you CANNOT recover 
>> the host and return it back to safe state.
>> If double register event was detected -- it means you have bug in kernel.
>>
>> Yes, you can add BUG here and crash the host immediately, but I prefer to 
>> use warning in such situations.
>>
>>>> If this does not cause any additional problems or slow downs, it's
>>>> probably fine to add.
>>>>
>>> Notifier_chain_register() is not a system hotspot function.
>>> At the same time, there is already a WARN_ONCE judgment. There is no new 
>>> judgment in the new patch.
>>> It only changes the processing under the condition of (*nl) == n, which 
>>> will not cause performance problems.
>>> At the same time, avoiding the formation of a link ring can make the system 
>>> more robust.
>>
>> I disagree, 
>> yes, node will have correct list, but anyway node will work wrong and can 
>> crash the host in any time.
> 
> Sorry, my description is not accurate.
> 
> My patch feature does not prevent users from repeatedly registering hooks.
> But avoiding the chain ring caused by the user repeatedly registering the hook
> 
> There are no modules for duplicate registration hooks in the current system.
> But considering that not all modules are in the kernel source tree,
> In order to improve the robustness of the kernel API, we should avoid the 
> linked list ring caused by repeated registration.
> Or in order to improve the efficiency of problem location, when the duplicate 
> registration is checked, the system crashes directly.

Detect of duplicate registration means an unrecoverable error,
from this point of view it makes sense to replace WARN_ONCE by BUG_ON.
 
> On the other hand, the difference between notifier_chain_register() and 
> notifier_chain_cond_register() for duplicate registrations is confusing:
> Blocking the formation of the linked list ring in 
> notifier_chain_cond_register()
> There is no interception of the linked list ring in 
> notifier_chain_register(), just an alarm.
> Give me the illusion: Isn't notifier_chain_register() allowed to create a 
> linked list ring?

I'm not sure that I understood your question correctly but will try to answer.
As far as I see all callers of notifier_chain_cond_register checks return 
value, expect possible failure and handle it somehow.
On the other hand callers of notifier_chain_register() in many cases do not 
check return value and always expect success.
The goal of original WARN_ONCE -- to detect possible misuse of notifiers and it 
seems for me it correctly handles this task.

Reply via email to