Greg Kroah-Hartman <gre...@linuxfoundation.org> writes:

> On Tue, Nov 17, 2015 at 01:04:19AM +0100, Nicolai Stange wrote:
>> Similar problems exist at all call sites of kset_register(), that is
>> in drivers/base, fs/ext4 and in fs/ocfs2.
>
> Yes, but those calls all succeed, so this isn't a problem in the "real"
> world :)
>

No, it is not an issue at runtime. It's more about puzzling code readers
like me.

>> The deeper cause behind this are the current semantics of the kset
>> initialization, creation and registration functions which differ from the
>> ones of their corresponding kobject counterparts.
>> Namely,
>> - there is no _exported_ kset initialization function,
>> - the (not exported) kset_create() creates a halfway initialized kset
>>   object,
>> - and the kset_register() finishes a kset object's initialization but
>>   expects its name to already have been set at its entry.
>> 
>> To fix this:
>> - Export kset_init() and let it mimic the semantics of kobject_init().
>>   Especially it takes and sets a kobj_type which makes the kset object
>>   kset_put()able upon return.
>> - Let the internal kset_create() do the complete initialization by means
>>   of kset_init().
>> - Remove any kset initialization from kset_register(): it expects a
>>   readily initialized kset object now. Furthermore, let kset_register()
>>   take and set the kset object's name. This resembles the behaviour of
>>   kobject_add(). In analogy to the latter, require the caller to call
>>   kset_put() or kset_unregister() unconditionally, even on failure.
>> 
>> At the call sites of kset_register():
>> - Replace the open coded kset object initialization by a call to
>>   kset_init().
>> - Remove the explicit name setting and let the kset_register() call do
>>   that work.
>> - Replace any kfree()s by kset_put()s where appropriate.
>> 
>> Signed-off-by: Nicolai Stange <nicsta...@gmail.com>
>> ---
>> To the maintainers of ext4 and ocfs2: although several subsystems are
>> touched, the changes come in one single patch in order to keep them 
>> bisectable.
>> 
>> 
>>  drivers/base/bus.c         | 14 ++-----
>>  drivers/base/class.c       | 39 ++++++++++++++-----
>>  fs/ext4/sysfs.c            | 13 +++----
>>  fs/ocfs2/cluster/masklog.c | 13 ++++---
>>  include/linux/kobject.h    |  6 ++-
>>  lib/kobject.c              | 94 
>> ++++++++++++++++++++++++++++------------------
>>  6 files changed, 110 insertions(+), 69 deletions(-)
>> 
>> diff --git a/drivers/base/bus.c b/drivers/base/bus.c
>> index 5005924..a81227c 100644
>> --- a/drivers/base/bus.c
>> +++ b/drivers/base/bus.c
>> @@ -900,15 +900,11 @@ int bus_register(struct bus_type *bus)
>>  
>>      BLOCKING_INIT_NOTIFIER_HEAD(&priv->bus_notifier);
>>  
>> -    retval = kobject_set_name(&priv->subsys.kobj, "%s", bus->name);
>> -    if (retval)
>> -            goto out;
>> -
>> -    priv->subsys.kobj.kset = bus_kset;
>> -    priv->subsys.kobj.ktype = &bus_ktype;
>>      priv->drivers_autoprobe = 1;
>>  
>> -    retval = kset_register(&priv->subsys);
>> +    kset_init(&priv->subsys, &bus_ktype, NULL);
>> +    priv->subsys.kobj.kset = bus_kset;
>> +    retval = kset_register(&priv->subsys, bus->name, NULL);
>>      if (retval)
>>              goto out;
>>  
>> @@ -955,10 +951,8 @@ bus_drivers_fail:
>>  bus_devices_fail:
>>      bus_remove_file(bus, &bus_attr_uevent);
>>  bus_uevent_fail:
>> -    kset_unregister(&bus->p->subsys);
>>  out:
>> -    kfree(bus->p);
>> -    bus->p = NULL;
>> +    kset_unregister(&bus->p->subsys);
>>      return retval;
>>  }
>>  EXPORT_SYMBOL_GPL(bus_register);
>> diff --git a/drivers/base/class.c b/drivers/base/class.c
>> index 71059e3..94a5b040 100644
>> --- a/drivers/base/class.c
>> +++ b/drivers/base/class.c
>> @@ -19,6 +19,7 @@
>>  #include <linux/slab.h>
>>  #include <linux/genhd.h>
>>  #include <linux/mutex.h>
>> +#include <linux/printk.h>
>>  #include "base.h"
>>  
>>  #define to_class_attr(_attr) container_of(_attr, struct class_attribute, 
>> attr)
>> @@ -82,6 +83,24 @@ static struct kobj_type class_ktype = {
>>      .child_ns_type  = class_child_ns_type,
>>  };
>>  
>> +static void glue_dirs_release_dummy(struct kobject *kobj)
>> +{
>> +    /*
>> +     * The glue_dirs kset member of struct subsys_private is never
>> +     * registered and thus, never unregistered.
>> +     * This release function is a dummy to make kset_init() happy.
>> +     */
>> +    pr_err(
>> +    "class (%p): unexpected kset_put() on glue_dirs, something is broken.",
>> +            container_of(kobj, struct subsys_private,
>> +                            glue_dirs.kobj)->class);
>> +    dump_stack();
>
> How can this ever happen?
>

Actually it can't.

To quote you from Documentation/kobject.txt:

  Do not try to get rid of this warning by providing an "empty" release
  function; you will be mocked mercilessly by the kobject maintainer if
  you attempt this.

Well, the glue_dirs_release_dummy() is quite empty, but my reasoning was
like this:
1. It is a good thing to initialize glue_dirs by kset_init().
2. In the current patch's version, kset_init() does not allow for NULL
   ktype's, just as kobject_init() doesn't. Thus, I need some dummy
   kobj_ktype for the glue_dirs().
3. From the usually harsh reactions against BUG(), I conclude that
   crashing the kernel due to logic errors is not a good thing to do.
   -> If all of the above is true, it certainly would be better to
   provide some sort of release() for the kobj_ktype. 

I do agree, that a simple WARN() would have been better than the above
dummy's implementation.

On the other hand, I'm not exactly sure whether requiring a non-NULL
ktype at kset_init() is a good idea at all. Given that there is
currently only one single use of kset_init() not providing a non-trivial
kobj_ktype and that kobject_init() does require them all to be non-NULL,
I let kset_init() require this, too.

Summarizing, the options are either of
- do not use kset_init() for the glue_dirs initialization, but revert to
  the open coding.
- make kset_init() accept NULL ktype's.
- introduce another initialization function, maybe
  "kset_init_internal()" which accepts NULL for the ktype. I had to
  export this internal function though.
- keep the above dummy release function, but make it even more clear
  that it is a dummy. Furthermore, use WARN() instead of
  pr_err()/dump_stack().

Just tell me which of the options to pick and I will do that.


>> +}
>> +
>> +static struct kobj_type glue_dirs_ktype = {
>> +    .release = glue_dirs_release_dummy,
>> +};
>> +
>>  /* Hotplug events for classes go to the class subsys */
>>  static struct kset *class_kset;
>>  
>> @@ -175,18 +194,14 @@ int __class_register(struct class *cls, struct 
>> lock_class_key *key)
>>              return -ENOMEM;
>>      klist_init(&cp->klist_devices, klist_class_dev_get, 
>> klist_class_dev_put);
>>      INIT_LIST_HEAD(&cp->interfaces);
>> -    kset_init(&cp->glue_dirs);
>> +    kset_init(&cp->glue_dirs, &glue_dirs_ktype, NULL);
>>      __mutex_init(&cp->mutex, "subsys mutex", key);
>> -    error = kobject_set_name(&cp->subsys.kobj, "%s", cls->name);
>> -    if (error) {
>> -            kfree(cp);
>> -            return error;
>> -    }
>>  
>>      /* set the default /sys/dev directory for devices of this class */
>>      if (!cls->dev_kobj)
>>              cls->dev_kobj = sysfs_dev_char_kobj;
>>  
>> +    kset_init(&cp->subsys, &class_ktype, NULL);
>>  #if defined(CONFIG_BLOCK)
>>      /* let the block class directory show up in the root of sysfs */
>>      if (!sysfs_deprecated || cls != &block_class)
>> @@ -194,13 +209,19 @@ int __class_register(struct class *cls, struct 
>> lock_class_key *key)
>>  #else
>>      cp->subsys.kobj.kset = class_kset;
>>  #endif
>> -    cp->subsys.kobj.ktype = &class_ktype;
>>      cp->class = cls;
>>      cls->p = cp;
>>  
>> -    error = kset_register(&cp->subsys);
>> +    error = kset_register(&cp->subsys, cls->name, NULL);
>>      if (error) {
>> -            kfree(cp);
>> +            /*
>> +             * class->release() would be called by cp->subsys'
>> +             * release function. Prevent this from happening in
>> +             * the error case by zeroing cp->class out.
>> +             */
>> +            cp->class = NULL;
>> +            cls->p = NULL;
>> +            kset_put(&cp->subsys);
>
> That seems pretty messy, why can't we allow the release to be called?  I
> don't think this is really correct :(

I don't know whether it is actually correct, but before the introduction
of this patch, the class->release() would not have been called on error
either. I did not want to change that behaviour and thus, I have put
this "messy" and admittedly ugly clear-the-class-pointer code into the
error handling above.

I will check whether calling class->release() on error would be a
problem for the callers of class_register() (there are 64 of them). If
it turns out not to be one, I could simply remove the above class
pointer purging.

However, for bisectability, I suggest to send this as a separate patch,
if any: this would change class_register()'s behaviour and is quite
unrelated to the kset stuff.

If you do not agree, please tell me.

> But overall I like the patch, nice job.  Any way to fix this last bit
> up?

Thank you very much for your feedback. I will wait for your input on the
two questions above regarding
- the dummy implementation of the glue_dirs' ktype and
- the unmessing of class_register() in a separate patch
and resend this patch accordingly then.

Thank you,

Nicolai Stange
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to