Re: 2.6.22.14 oops msg with commvault galaxy ?
Le vendredi 14 décembre 2007 à 09:28 -0800, Greg KH a écrit : > On Fri, Dec 14, 2007 at 10:37:39PM +0530, Dhaval Giani wrote: > > On Fri, Dec 14, 2007 at 08:26:42AM -0800, Greg KH wrote: > > > On Thu, Dec 13, 2007 at 09:21:26PM +0100, Ingo Molnar wrote: > > > > > > > > * Kay Sievers <[EMAIL PROTECTED]> wrote: > > > > > > > > > > > > This one also fails to apply properly at the exact same place > > > > > > > > has Ingo's previously posted patch. Would need to backport his > > > > > > > > one. > > > > > > > > > > > > > > It depends on a completely reworked sysfs logic, I don't think it > > > > > > > makes any sense to backport that. > > > > > > > > > > > > well, if it fixes a live bug in a still supported stable kernel > > > > > > release... > > > > > > > > > > > > Vincent, could you try to just get rid of all actual uses of > > > > > > se->attr.owner, within fs/sysfs/*.c? Something like the patch > > > > > > below. > > > > > > (totally untested - might be fatally broken as well) > > > > > > > > > > How can you think that this is not needed? You can not remove it with > > > > > sysfs you are patching. Hope this explains it: > > > > > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=0ab66088c855eca68513bdd7442a426c4b374ced > > > > > > > > yeah - as i said it might be fatally broken (in fact it is). Do we > > > > understand why Vincent got the crashes with vanilla 2.6.22.14 ? > > > > > > No, and I can't seem to duplicate them here at all. > > > > > > Does anyone have a test case for this that I can work on trying to > > > duplicate? > > > > > > > If you apply CFS without my fix, and try to constantly check cpu_shares > > for a user who is logging and logging out, you should hit it. (That's > > what I was doing). > > Hm, how about a "vanilla 2.6.22.14 kernel _without_ any patches". > That's what I am most worried about :) Since I was getting the problem with both vanilla & CFS patched kernels and that, sadly, I don't have the time to do git bisect at the moment I decided to go ahead and prepare a full migration to 2.6.23 (I was hoping to skip directly to 2.6.24 but...). I can confirm at the moment that 2.6.23 works properly with Galaxy (just has 2.6.20 & 2.6.21 used to...). Thnx very much everyone for the help but sadly this bug will have to remain unresolved. > thanks, - vin -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Fri, Dec 14, 2007 at 10:37:39PM +0530, Dhaval Giani wrote: > On Fri, Dec 14, 2007 at 08:26:42AM -0800, Greg KH wrote: > > On Thu, Dec 13, 2007 at 09:21:26PM +0100, Ingo Molnar wrote: > > > > > > * Kay Sievers <[EMAIL PROTECTED]> wrote: > > > > > > > > > > This one also fails to apply properly at the exact same place > > > > > > > has Ingo's previously posted patch. Would need to backport his > > > > > > > one. > > > > > > > > > > > > It depends on a completely reworked sysfs logic, I don't think it > > > > > > makes any sense to backport that. > > > > > > > > > > well, if it fixes a live bug in a still supported stable kernel > > > > > release... > > > > > > > > > > Vincent, could you try to just get rid of all actual uses of > > > > > se->attr.owner, within fs/sysfs/*.c? Something like the patch below. > > > > > (totally untested - might be fatally broken as well) > > > > > > > > How can you think that this is not needed? You can not remove it with > > > > sysfs you are patching. Hope this explains it: > > > > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=0ab66088c855eca68513bdd7442a426c4b374ced > > > > > > yeah - as i said it might be fatally broken (in fact it is). Do we > > > understand why Vincent got the crashes with vanilla 2.6.22.14 ? > > > > No, and I can't seem to duplicate them here at all. > > > > Does anyone have a test case for this that I can work on trying to > > duplicate? > > > > If you apply CFS without my fix, and try to constantly check cpu_shares > for a user who is logging and logging out, you should hit it. (That's > what I was doing). Hm, how about a "vanilla 2.6.22.14 kernel _without_ any patches". That's what I am most worried about :) thanks, greg k-h -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Fri, Dec 14, 2007 at 08:26:42AM -0800, Greg KH wrote: > On Thu, Dec 13, 2007 at 09:21:26PM +0100, Ingo Molnar wrote: > > > > * Kay Sievers <[EMAIL PROTECTED]> wrote: > > > > > > > > This one also fails to apply properly at the exact same place > > > > > > has Ingo's previously posted patch. Would need to backport his > > > > > > one. > > > > > > > > > > It depends on a completely reworked sysfs logic, I don't think it > > > > > makes any sense to backport that. > > > > > > > > well, if it fixes a live bug in a still supported stable kernel > > > > release... > > > > > > > > Vincent, could you try to just get rid of all actual uses of > > > > se->attr.owner, within fs/sysfs/*.c? Something like the patch below. > > > > (totally untested - might be fatally broken as well) > > > > > > How can you think that this is not needed? You can not remove it with > > > sysfs you are patching. Hope this explains it: > > > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=0ab66088c855eca68513bdd7442a426c4b374ced > > > > yeah - as i said it might be fatally broken (in fact it is). Do we > > understand why Vincent got the crashes with vanilla 2.6.22.14 ? > > No, and I can't seem to duplicate them here at all. > > Does anyone have a test case for this that I can work on trying to > duplicate? > If you apply CFS without my fix, and try to constantly check cpu_shares for a user who is logging and logging out, you should hit it. (That's what I was doing). -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, Dec 13, 2007 at 09:21:26PM +0100, Ingo Molnar wrote: > > * Kay Sievers <[EMAIL PROTECTED]> wrote: > > > > > > This one also fails to apply properly at the exact same place > > > > > has Ingo's previously posted patch. Would need to backport his > > > > > one. > > > > > > > > It depends on a completely reworked sysfs logic, I don't think it > > > > makes any sense to backport that. > > > > > > well, if it fixes a live bug in a still supported stable kernel > > > release... > > > > > > Vincent, could you try to just get rid of all actual uses of > > > se->attr.owner, within fs/sysfs/*.c? Something like the patch below. > > > (totally untested - might be fatally broken as well) > > > > How can you think that this is not needed? You can not remove it with > > sysfs you are patching. Hope this explains it: > > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=0ab66088c855eca68513bdd7442a426c4b374ced > > yeah - as i said it might be fatally broken (in fact it is). Do we > understand why Vincent got the crashes with vanilla 2.6.22.14 ? No, and I can't seem to duplicate them here at all. Does anyone have a test case for this that I can work on trying to duplicate? thanks, greg k-h -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, Dec 13, 2007 at 09:21:26PM +0100, Ingo Molnar wrote: > > * Kay Sievers <[EMAIL PROTECTED]> wrote: > > > > > > This one also fails to apply properly at the exact same place > > > > > has Ingo's previously posted patch. Would need to backport his > > > > > one. > > > > > > > > It depends on a completely reworked sysfs logic, I don't think it > > > > makes any sense to backport that. > > > > > > well, if it fixes a live bug in a still supported stable kernel > > > release... > > > > > > Vincent, could you try to just get rid of all actual uses of > > > se->attr.owner, within fs/sysfs/*.c? Something like the patch below. > > > (totally untested - might be fatally broken as well) > > > > How can you think that this is not needed? You can not remove it with > > sysfs you are patching. Hope this explains it: > > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=0ab66088c855eca68513bdd7442a426c4b374ced > > yeah - as i said it might be fatally broken (in fact it is). Do we > understand why Vincent got the crashes with vanilla 2.6.22.14 ? > My guess is some variables have probably been left uninitialized. I am a bit too scared to look into sysfs parts of the code now. -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
* Kay Sievers <[EMAIL PROTECTED]> wrote: > > > > This one also fails to apply properly at the exact same place > > > > has Ingo's previously posted patch. Would need to backport his > > > > one. > > > > > > It depends on a completely reworked sysfs logic, I don't think it > > > makes any sense to backport that. > > > > well, if it fixes a live bug in a still supported stable kernel > > release... > > > > Vincent, could you try to just get rid of all actual uses of > > se->attr.owner, within fs/sysfs/*.c? Something like the patch below. > > (totally untested - might be fatally broken as well) > > How can you think that this is not needed? You can not remove it with > sysfs you are patching. Hope this explains it: > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=0ab66088c855eca68513bdd7442a426c4b374ced yeah - as i said it might be fatally broken (in fact it is). Do we understand why Vincent got the crashes with vanilla 2.6.22.14 ? Ingo -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, Dec 13, 2007 at 05:50:59PM +0100, Ingo Molnar wrote: > > * Kay Sievers <[EMAIL PROTECTED]> wrote: > > > > > > > + sa->attr.owner = NULL; > > > > > > sa->attr.name = name; > > > > > > > > > > i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs > > > > > initialize the owner field to NULL automatically? > > > > > > > > Attibutes do not have an owner anymore: > > > > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=7b595756ec1f49e0049a9e01a1298d53a7faaa15 > > > > > > This one also fails to apply properly at the exact same place has > > > Ingo's previously posted patch. Would need to backport his one. > > > > It depends on a completely reworked sysfs logic, I don't think it > > makes any sense to backport that. > > well, if it fixes a live bug in a still supported stable kernel > release... > > Vincent, could you try to just get rid of all actual uses of > se->attr.owner, within fs/sysfs/*.c? Something like the patch below. > (totally untested - might be fatally broken as well) > hmm. I am not too sure if it is a good idea. I think it will break a lot of drivers. But I will just wait for the sysfs experts to speak up thre. -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, 2007-12-13 at 17:50 +0100, Ingo Molnar wrote: > * Kay Sievers <[EMAIL PROTECTED]> wrote: > > > > > > > + sa->attr.owner = NULL; > > > > > > sa->attr.name = name; > > > > > > > > > > i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs > > > > > initialize the owner field to NULL automatically? > > > > > > > > Attibutes do not have an owner anymore: > > > > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=7b595756ec1f49e0049a9e01a1298d53a7faaa15 > > > > > > This one also fails to apply properly at the exact same place has > > > Ingo's previously posted patch. Would need to backport his one. > > > > It depends on a completely reworked sysfs logic, I don't think it > > makes any sense to backport that. > > well, if it fixes a live bug in a still supported stable kernel > release... > > Vincent, could you try to just get rid of all actual uses of > se->attr.owner, within fs/sysfs/*.c? Something like the patch below. > (totally untested - might be fatally broken as well) How can you think that this is not needed? You can not remove it with sysfs you are patching. Hope this explains it: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=0ab66088c855eca68513bdd7442a426c4b374ced Kay -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
* Kay Sievers <[EMAIL PROTECTED]> wrote: > > > > > + sa->attr.owner = NULL; > > > > > sa->attr.name = name; > > > > > > > > i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs > > > > initialize the owner field to NULL automatically? > > > > > > Attibutes do not have an owner anymore: > > > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=7b595756ec1f49e0049a9e01a1298d53a7faaa15 > > > > This one also fails to apply properly at the exact same place has > > Ingo's previously posted patch. Would need to backport his one. > > It depends on a completely reworked sysfs logic, I don't think it > makes any sense to backport that. well, if it fixes a live bug in a still supported stable kernel release... Vincent, could you try to just get rid of all actual uses of se->attr.owner, within fs/sysfs/*.c? Something like the patch below. (totally untested - might be fatally broken as well) Ingo --- fs/sysfs/bin.c |7 --- fs/sysfs/file.c | 13 - 2 files changed, 20 deletions(-) Index: linux-cfs-2.6.22.13.q/fs/sysfs/bin.c === --- linux-cfs-2.6.22.13.q.orig/fs/sysfs/bin.c +++ linux-cfs-2.6.22.13.q/fs/sysfs/bin.c @@ -125,11 +125,6 @@ static int open(struct inode * inode, st if (!kobj || !attr) goto Done; - /* Grab the module reference for this attribute if we have one */ - error = -ENODEV; - if (!try_module_get(attr->attr.owner)) - goto Done; - error = -EACCES; if ((file->f_mode & FMODE_WRITE) && !(attr->write || attr->mmap)) goto Error; @@ -145,7 +140,6 @@ static int open(struct inode * inode, st goto Done; Error: - module_put(attr->attr.owner); Done: if (error) kobject_put(kobj); @@ -159,7 +153,6 @@ static int release(struct inode * inode, u8 * buffer = file->private_data; kobject_put(kobj); - module_put(attr->attr.owner); kfree(buffer); return 0; } Index: linux-cfs-2.6.22.13.q/fs/sysfs/file.c === --- linux-cfs-2.6.22.13.q.orig/fs/sysfs/file.c +++ linux-cfs-2.6.22.13.q/fs/sysfs/file.c @@ -257,12 +257,6 @@ static int sysfs_open_file(struct inode if (!kobj || !attr) goto Einval; - /* Grab the module reference for this attribute if we have one */ - if (!try_module_get(attr->owner)) { - error = -ENODEV; - goto Done; - } - /* if the kobject has no ktype, then we assume that it is a subsystem * itself, and use ops for it. */ @@ -332,7 +326,6 @@ static int sysfs_open_file(struct inode goto Done; Eaccess: error = -EACCES; - module_put(attr->owner); Done: if (error) kobject_put(kobj); @@ -343,14 +336,12 @@ static int sysfs_release(struct inode * { struct kobject * kobj = to_kobj(filp->f_path.dentry->d_parent); struct attribute * attr = to_attr(filp->f_path.dentry); - struct module * owner = attr->owner; struct sysfs_buffer * buffer = filp->private_data; if (buffer) remove_from_collection(buffer, inode); kobject_put(kobj); /* After this point, attr should not be accessed. */ - module_put(owner); if (buffer) { if (buffer->page) @@ -615,7 +606,6 @@ static void sysfs_schedule_callback_work (ss->func)(ss->data); kobject_put(ss->kobj); - module_put(ss->owner); kfree(ss); } @@ -644,11 +634,8 @@ int sysfs_schedule_callback(struct kobje { struct sysfs_schedule_callback_struct *ss; - if (!try_module_get(owner)) - return -ENODEV; ss = kmalloc(sizeof(*ss), GFP_KERNEL); if (!ss) { - module_put(owner); return -ENOMEM; } kobject_get(kobj); -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, 2007-12-13 at 15:02 +, Vincent Fortier wrote: > On Thu, 2007-12-13 at 08:59 -0500, Kay Sievers wrote: > > On Thu, 2007-12-13 at 13:55 +0100, Ingo Molnar wrote: > > > * Dhaval Giani <[EMAIL PROTECTED]> wrote: > > > > > > > Could you please see if the following patch removes the oops due > > to > > > > CFS sysfs files? (There might still be the other oops due to the > > > > floppy sysfs files) > > > > > > > > Ingo, could you please add this patch in your CFS backport to > > 2.6.22 > > > > and older kernels? > > > > > > sure - i've updated the backport patches with this fix. > > > > > > > static void user_attr_init(struct subsys_attribute *sa, char > > *name, int mode) > > > > { > > > > + sa->attr.owner = NULL; > > > > sa->attr.name = name; > > > > > > i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs > > > initialize the owner field to NULL automatically? > > > > Attibutes do not have an owner anymore: > > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=7b595756ec1f49e0049a9e01a1298d53a7faaa15 > > This one also fails to apply properly at the exact same place has Ingo's > previously posted patch. Would need to backport his one. It depends on a completely reworked sysfs logic, I don't think it makes any sense to backport that. Kay -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, 2007-12-13 at 08:59 -0500, Kay Sievers wrote: > On Thu, 2007-12-13 at 13:55 +0100, Ingo Molnar wrote: > > * Dhaval Giani <[EMAIL PROTECTED]> wrote: > > > > > Could you please see if the following patch removes the oops due > to > > > CFS sysfs files? (There might still be the other oops due to the > > > floppy sysfs files) > > > > > > Ingo, could you please add this patch in your CFS backport to > 2.6.22 > > > and older kernels? > > > > sure - i've updated the backport patches with this fix. > > > > > static void user_attr_init(struct subsys_attribute *sa, char > *name, int mode) > > > { > > > + sa->attr.owner = NULL; > > > sa->attr.name = name; > > > > i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs > > initialize the owner field to NULL automatically? > > Attibutes do not have an owner anymore: > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=7b595756ec1f49e0049a9e01a1298d53a7faaa15 This one also fails to apply properly at the exact same place has Ingo's previously posted patch. Would need to backport his one. > Kay - vin -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, Dec 13, 2007 at 01:24:26PM +, Vincent Fortier wrote: > On Thu, 2007-12-13 at 18:32 +0530, Dhaval Giani wrote: > > On Thu, Dec 13, 2007 at 01:55:09PM +0100, Ingo Molnar wrote: > > > > > > * Dhaval Giani <[EMAIL PROTECTED]> wrote: > > > > > > > Could you please see if the following patch removes the oops due to > > > > CFS sysfs files? (There might still be the other oops due to the > > > > floppy sysfs files) > > > > > > > > Ingo, could you please add this patch in your CFS backport to 2.6.22 > > > > and older kernels? > > > > > > sure - i've updated the backport patches with this fix. > > > > > > > Thanks! > > CFS v24 now does not apply correctly on a 2.6.22.15-rc1 here: Could you try on 2.6.22.13/14, while we wait for Ingo ;). -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, 2007-12-13 at 13:55 +0100, Ingo Molnar wrote: > * Dhaval Giani <[EMAIL PROTECTED]> wrote: > > > Could you please see if the following patch removes the oops due to > > CFS sysfs files? (There might still be the other oops due to the > > floppy sysfs files) > > > > Ingo, could you please add this patch in your CFS backport to 2.6.22 > > and older kernels? > > sure - i've updated the backport patches with this fix. > > > static void user_attr_init(struct subsys_attribute *sa, char *name, int > > mode) > > { > > + sa->attr.owner = NULL; > > sa->attr.name = name; > > i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs > initialize the owner field to NULL automatically? Attibutes do not have an owner anymore: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=7b595756ec1f49e0049a9e01a1298d53a7faaa15 Kay -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, 2007-12-13 at 08:12 -0500, Ingo Molnar wrote: > > * Dhaval Giani <[EMAIL PROTECTED]> wrote: > > > > > static void user_attr_init(struct subsys_attribute *sa, char > *name, int mode) > > > > { > > > > + sa->attr.owner = NULL; > > > > sa->attr.name = name; > > > > > > i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs > > > initialize the owner field to NULL automatically? > > > > > > > Going through git log, it seems that commit > > 7b595756ec1f49e0049a9e01a1298d53a7faaa15 deemed attribute->owner as > > unnecessary. I guess that answers the question. > > thx. The only open question seems to be: Vincent had sysfs crashes > without the CFS patchset as well. > > Wouldnt it be prudent to backport the core bits of the above commit > (attached below), to make sure the owner field is never utilized. > (because it seems it's so easy and common to not maintain it properly) > > Vincent, does the patch below resolve the non-CFS crashes? I was about to test but it does not apply on a 2.6.22: [EMAIL PROTECTED] linux-2.6.22.15-rc1-patched]# patch -p1 < ../make_sure_owner_field_is_never_utilized.patch patching file fs/sysfs/bin.c Hunk #1 FAILED at 175. Hunk #2 FAILED at 198. Hunk #3 FAILED at 207. 3 out of 3 hunks FAILED -- saving rejects to file fs/sysfs/bin.c.rej patching file fs/sysfs/file.c Hunk #1 FAILED at 241. Hunk #2 FAILED at 250. Hunk #3 FAILED at 266. Hunk #4 FAILED at 274. Hunk #5 FAILED at 283. Hunk #6 FAILED at 292. Hunk #7 FAILED at 304. Hunk #8 FAILED at 312. 8 out of 8 hunks FAILED -- saving rejects to file fs/sysfs/file.c.rej I was about to backport it but I find it's not that trivial... Help would be appreciated. - vin -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, 2007-12-13 at 18:32 +0530, Dhaval Giani wrote: > On Thu, Dec 13, 2007 at 01:55:09PM +0100, Ingo Molnar wrote: > > > > * Dhaval Giani <[EMAIL PROTECTED]> wrote: > > > > > Could you please see if the following patch removes the oops due to > > > CFS sysfs files? (There might still be the other oops due to the > > > floppy sysfs files) > > > > > > Ingo, could you please add this patch in your CFS backport to 2.6.22 > > > and older kernels? > > > > sure - i've updated the backport patches with this fix. > > > > Thanks! CFS v24 now does not apply correctly on a 2.6.22.15-rc1 here: --- 31,43 #include #include #include + #include #include #include #include #include + #include #include #include due to [patch 31/36] Revert "Fix SMP poweroff hangs which removes: -#include About to build/test this morning. thnx. - vin > > > > static void user_attr_init(struct subsys_attribute *sa, char *name, int > > > mode) > > > { > > > + sa->attr.owner = NULL; > > > sa->attr.name = name; > > > > i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs > > initialize the owner field to NULL automatically? > > > > Going through git log, it seems that commit > 7b595756ec1f49e0049a9e01a1298d53a7faaa15 deemed attribute->owner as > unnecessary. I guess that answers the question. > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
* Dhaval Giani <[EMAIL PROTECTED]> wrote: > > > static void user_attr_init(struct subsys_attribute *sa, char *name, int > > > mode) > > > { > > > + sa->attr.owner = NULL; > > > sa->attr.name = name; > > > > i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs > > initialize the owner field to NULL automatically? > > > > Going through git log, it seems that commit > 7b595756ec1f49e0049a9e01a1298d53a7faaa15 deemed attribute->owner as > unnecessary. I guess that answers the question. thx. The only open question seems to be: Vincent had sysfs crashes without the CFS patchset as well. Wouldnt it be prudent to backport the core bits of the above commit (attached below), to make sure the owner field is never utilized. (because it seems it's so easy and common to not maintain it properly) Vincent, does the patch below resolve the non-CFS crashes? Ingo diff --git a/fs/sysfs/bin.c b/fs/sysfs/bin.c index 618b8ae..3c5574a 100644 --- a/fs/sysfs/bin.c +++ b/fs/sysfs/bin.c @@ -175,25 +175,20 @@ static int open(struct inode * inode, struct file * file) if (!sysfs_get_active(attr_sd)) return -ENODEV; - /* Grab the module reference for this attribute */ - error = -ENODEV; - if (!try_module_get(attr->attr.owner)) - goto err_sput; - error = -EACCES; if ((file->f_mode & FMODE_WRITE) && !(attr->write || attr->mmap)) - goto err_mput; + goto err_out; if ((file->f_mode & FMODE_READ) && !(attr->read || attr->mmap)) - goto err_mput; + goto err_out; error = -ENOMEM; bb = kzalloc(sizeof(*bb), GFP_KERNEL); if (!bb) - goto err_mput; + goto err_out; bb->buffer = kmalloc(PAGE_SIZE, GFP_KERNEL); if (!bb->buffer) - goto err_mput; + goto err_out; mutex_init(&bb->mutex); file->private_data = bb; @@ -203,9 +198,7 @@ static int open(struct inode * inode, struct file * file) sysfs_get(attr_sd); return 0; - err_mput: - module_put(attr->attr.owner); - err_sput: + err_out: sysfs_put_active(attr_sd); kfree(bb); return error; @@ -214,13 +207,11 @@ static int open(struct inode * inode, struct file * file) static int release(struct inode * inode, struct file * file) { struct sysfs_dirent *attr_sd = file->f_path.dentry->d_fsdata; - struct bin_attribute *attr = attr_sd->s_elem.bin_attr.bin_attr; struct bin_buffer *bb = file->private_data; if (bb->mmapped) sysfs_put_active_two(attr_sd); sysfs_put(attr_sd); - module_put(attr->attr.owner); kfree(bb->buffer); kfree(bb); return 0; diff --git a/fs/sysfs/file.c b/fs/sysfs/file.c index d673d9b..a84b734 100644 --- a/fs/sysfs/file.c +++ b/fs/sysfs/file.c @@ -241,7 +241,6 @@ sysfs_write_file(struct file *file, const char __user *buf, size_t count, loff_t static int sysfs_open_file(struct inode *inode, struct file *file) { struct sysfs_dirent *attr_sd = file->f_path.dentry->d_fsdata; - struct attribute *attr = attr_sd->s_elem.attr.attr; struct kobject *kobj = attr_sd->s_parent->s_elem.dir.kobj; struct sysfs_buffer * buffer; struct sysfs_ops * ops = NULL; @@ -251,11 +250,6 @@ static int sysfs_open_file(struct inode *inode, struct file *file) if (!sysfs_get_active_two(attr_sd)) return -ENODEV; - /* Grab the module reference for this attribute */ - error = -ENODEV; - if (!try_module_get(attr->owner)) - goto err_sput; - /* if the kobject has no ktype, then we assume that it is a subsystem * itself, and use ops for it. */ @@ -272,7 +266,7 @@ static int sysfs_open_file(struct inode *inode, struct file *file) * or the subsystem have no operations. */ if (!ops) - goto err_mput; + goto err_out; /* File needs write support. * The inode's perms must say it's ok, @@ -280,7 +274,7 @@ static int sysfs_open_file(struct inode *inode, struct file *file) */ if (file->f_mode & FMODE_WRITE) { if (!(inode->i_mode & S_IWUGO) || !ops->store) - goto err_mput; + goto err_out; } /* File needs read support. @@ -289,7 +283,7 @@ static int sysfs_open_file(struct inode *inode, struct file *file) */ if (file->f_mode & FMODE_READ) { if (!(inode->i_mode & S_IRUGO) || !ops->show) - goto err_mput; + goto err_out; } /* No error? Great, allocate a buffer for the file, and store it @@ -298,7 +292,7 @@ static int sysfs_open_file(struct inode *inode, struct file *file) error = -ENOMEM; buffer = kzalloc(sizeof(struct sysfs_buff
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Thu, Dec 13, 2007 at 01:55:09PM +0100, Ingo Molnar wrote: > > * Dhaval Giani <[EMAIL PROTECTED]> wrote: > > > Could you please see if the following patch removes the oops due to > > CFS sysfs files? (There might still be the other oops due to the > > floppy sysfs files) > > > > Ingo, could you please add this patch in your CFS backport to 2.6.22 > > and older kernels? > > sure - i've updated the backport patches with this fix. > Thanks! > > static void user_attr_init(struct subsys_attribute *sa, char *name, int > > mode) > > { > > + sa->attr.owner = NULL; > > sa->attr.name = name; > > i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs > initialize the owner field to NULL automatically? > Going through git log, it seems that commit 7b595756ec1f49e0049a9e01a1298d53a7faaa15 deemed attribute->owner as unnecessary. I guess that answers the question. -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
* Dhaval Giani <[EMAIL PROTECTED]> wrote: > Could you please see if the following patch removes the oops due to > CFS sysfs files? (There might still be the other oops due to the > floppy sysfs files) > > Ingo, could you please add this patch in your CFS backport to 2.6.22 > and older kernels? sure - i've updated the backport patches with this fix. > static void user_attr_init(struct subsys_attribute *sa, char *name, int mode) > { > + sa->attr.owner = NULL; > sa->attr.name = name; i'm wondering why doesnt this affect 2.6.23 and later? Does sysfs initialize the owner field to NULL automatically? Ingo -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
Hi Vincent, Could you please see if the following patch removes the oops due to CFS sysfs files? (There might still be the other oops due to the floppy sysfs files) Ingo, could you please add this patch in your CFS backport to 2.6.22 and older kernels? Thanks, -- kdump showed that the owner field had some junk value which caused the oops reported at http://lkml.org/lkml/2007/12/10/113 . This patch sets the value of that field to NULL. Signed-off-by: Dhaval Giani <[EMAIL PROTECTED]> Signed-off-by: Maneesh Soni <[EMAIL PROTECTED]> --- kernel/user.c |1 + 1 files changed, 1 insertion(+) Index: linux-2.6.22.13/kernel/user.c === --- linux-2.6.22.13.orig/kernel/user.c +++ linux-2.6.22.13/kernel/user.c @@ -145,6 +145,7 @@ ssize_t cpu_shares_store(struct kset *ks static void user_attr_init(struct subsys_attribute *sa, char *name, int mode) { + sa->attr.owner = NULL; sa->attr.name = name; sa->attr.mode = mode; sa->show = cpu_shares_show; -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
RE: 2.6.22.14 oops msg with commvault galaxy ?
> -Message d'origine- > De : [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] De la part de Dhaval Giani > > On Wed, Dec 12, 2007 at 07:57:33AM -0500, Fortier,Vincent > [Montreal] wrote: > > > -Message d'origine- > > > De : Dhaval Giani [mailto:[EMAIL PROTECTED] > > > > > > On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote: > > > > > > > > * Fortier,Vincent [Montreal] <[EMAIL PROTECTED]> wrote: > > > > > > > > > > That has changed from /sys/kernel/uids//cpu_share > > > > > > > > > > Here is my config. > > > > > > > > > > Maybie I should give it a shot without CFS at all and see what > > > > > happends ? > > > > It got triggerred also using a 2.6.22.14: Here are my preliminary test results: 2.6.21.7: OK 2.6.22.13/14: Failure 2.6.23.9: OK 2.6.24-rc5-git2: OK It seems to only hang using a 2.6.22 kernel. > > No, not any more. Would it be possible for you to do a > git-bisect? I am not too well versed with sysfs, so it is not > apparent to me what is causing this oops. It seems to be > easily reproducible. I don't still have a reliable method to > reproduce it without the CFS patch. Could sysfs experts > please help debugging? > I seriously doubt I have the time to do a git-bisect at the moment - vin -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Wed, Dec 12, 2007 at 07:57:33AM -0500, Fortier,Vincent [Montreal] wrote: > > -Message d'origine- > > De : Dhaval Giani [mailto:[EMAIL PROTECTED] > > > > On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote: > > > > > > * Fortier,Vincent [Montreal] <[EMAIL PROTECTED]> wrote: > > > > > > > > That has changed from /sys/kernel/uids//cpu_share > > > > > > > > Here is my config. > > > > > > > > Maybie I should give it a shot without CFS at all and see what > > > > happends ? > > It got triggerred also using a 2.6.22.14: > [57560.396000] BUG: unable to handle kernel paging request at virtual > address 8000 > [57560.396000] printing eip: > [57560.396000] c01d6c56 > [57560.396000] *pdpt = 08d02001 > [57560.396000] *pde = > [57560.396000] Oops: [#34] > [57560.396000] SMP > [57560.396000] last sysfs file: /devices/platform/floppy.0/uevent > [57560.396000] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd > nfs_acl sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse > ide_cd ide_generic usbkbd usbmouse tsdev iTCO_wdt iTCO_vendor_support > psmouse e752x_edac edac_mc serio_raw evdev pcspkr sg floppy shpchp > pci_hotplug sr_mod cdrom ext3 jbd mbcache dm_mirror dm_snapshot dm_mod > generic piix ide_core tg3 ata_piix ehci_hcd uhci_hcd usbcore thermal > processor fan mptscsih mptbase megaraid_sas megaraid_mbox megaraid_mm > cciss aacraid > [57560.396000] CPU:2 > [57560.396000] EIP:0060:[]Not tainted VLI > [57560.396000] EFLAGS: 00010297 (2.6.22.14-etch-686-envcan #1) > [57560.396000] EIP is at vsnprintf+0x2af/0x48c > [57560.396000] eax: 8000 ebx: ecx: 8000 edx: > fffe > [57560.396000] esi: edf37017 edi: edf09eac ebp: esp: > edf09e4c > [57560.396000] ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 > [57560.396000] Process clBackup (pid: 31421, ti=edf08000 task=f7d36530 > task.ti=edf08000) > [57560.396000] Stack: c852b000 1000 c0338c78 f895b56c c0233bf5 > c852b000 120c8fe8 edf37017 > [57560.396000]00c3bd08 > c03354eb 0003 0017 > [57560.396000]c0376dc0 c852b000 c01d6eb4 edf09eac edf09eac > c0233170 edf37017 c03354ea > [57560.396000] Call Trace: > [57560.396000] [] dev_uevent+0x189/0x1e0 > [57560.396000] [] sprintf+0x20/0x23 > [57560.396000] [] show_uevent+0xad/0xd5 > [57560.396000] [] get_page_from_freelist+0x296/0x32d > [57560.396000] [] group_send_sig_info+0x12/0x56 > [57560.396000] [] __alloc_pages+0x52/0x294 > [57560.396000] [] show_uevent+0x0/0xd5 > [57560.396000] [] dev_attr_show+0x15/0x18 > [57560.396000] [] sysfs_read_file+0x87/0xd8 > [57560.396000] [] sys_getxattr+0x46/0x4e > [57560.396000] [] sysfs_read_file+0x0/0xd8 > [57560.396000] [] vfs_read+0xa6/0x128 > [57560.396000] [] sys_read+0x41/0x67 > [57560.396000] [] syscall_call+0x7/0xb > [57560.396000] === > [57560.396000] Code: 74 24 28 73 03 c6 06 20 4d 46 85 ed 7f f1 e9 b9 00 > 00 00 8b 0f b8 79 e0 32 c0 8b 54 24 2c 81 f9 ff 0f 00 00 0f 46 c8 89 c8 > eb 06 <80> 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 44 24 30 10 89 c3 > [57560.396000] EIP: [] vsnprintf+0x2af/0x48c SS:ESP > 0068:edf09e4c > > > > > > > and also with CFS but without CONFIG_FAIR_GROUP_SCHED. > > > > > Is it still required since it now does not seems to be CFS related? > No, not any more. Would it be possible for you to do a git-bisect? I am not too well versed with sysfs, so it is not apparent to me what is causing this oops. It seems to be easily reproducible. I don't still have a reliable method to reproduce it without the CFS patch. Could sysfs experts please help debugging? Thanks, -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
RE: 2.6.22.14 oops msg with commvault galaxy ?
> -Message d'origine- > De : [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] De la part de > Fortier,Vincent [Montreal] > > > -Message d'origine- > > De : Dhaval Giani [mailto:[EMAIL PROTECTED] > > > > On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote: > > > > > > * Fortier,Vincent [Montreal] <[EMAIL PROTECTED]> wrote: > > > > > > > > That has changed from /sys/kernel/uids//cpu_share > > > > > > > > Here is my config. > > > > > > > > Maybie I should give it a shot without CFS at all and see what > > > > happends ? > > It got triggerred also using a 2.6.22.14: Just to clarify... this is a non CFS kernel oops... > [57560.396000] BUG: unable to handle kernel paging request at > virtual address 8000 [57560.396000] printing eip: > [57560.396000] c01d6c56 > [57560.396000] *pdpt = 08d02001 > [57560.396000] *pde = > [57560.396000] Oops: [#34] > [57560.396000] SMP > [57560.396000] last sysfs file: > /devices/platform/floppy.0/uevent [57560.396000] Modules > linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl sunrpc > ppdev parport_pc lp parport button ac battery ipv6 fuse > ide_cd ide_generic usbkbd usbmouse tsdev iTCO_wdt > iTCO_vendor_support psmouse e752x_edac edac_mc serio_raw > evdev pcspkr sg floppy shpchp pci_hotplug sr_mod cdrom ext3 > jbd mbcache dm_mirror dm_snapshot dm_mod generic piix > ide_core tg3 ata_piix ehci_hcd uhci_hcd usbcore thermal > processor fan mptscsih mptbase megaraid_sas megaraid_mbox > megaraid_mm cciss aacraid > [57560.396000] CPU:2 > [57560.396000] EIP:0060:[]Not tainted VLI > [57560.396000] EFLAGS: 00010297 (2.6.22.14-etch-686-envcan #1) > [57560.396000] EIP is at vsnprintf+0x2af/0x48c > [57560.396000] eax: 8000 ebx: ecx: 8000 edx: > fffe > [57560.396000] esi: edf37017 edi: edf09eac ebp: esp: > edf09e4c > [57560.396000] ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 > [57560.396000] Process clBackup (pid: 31421, ti=edf08000 task=f7d36530 > task.ti=edf08000) > [57560.396000] Stack: c852b000 1000 c0338c78 f895b56c > c0233bf5 c852b000 120c8fe8 edf37017 > [57560.396000]00c3bd08 > c03354eb 0003 0017 > [57560.396000]c0376dc0 c852b000 c01d6eb4 edf09eac edf09eac > c0233170 edf37017 c03354ea > [57560.396000] Call Trace: > [57560.396000] [] dev_uevent+0x189/0x1e0 > [57560.396000] [] sprintf+0x20/0x23 [57560.396000] > [] show_uevent+0xad/0xd5 [57560.396000] > [] get_page_from_freelist+0x296/0x32d > [57560.396000] [] group_send_sig_info+0x12/0x56 > [57560.396000] [] __alloc_pages+0x52/0x294 > [57560.396000] [] show_uevent+0x0/0xd5 > [57560.396000] [] dev_attr_show+0x15/0x18 > [57560.396000] [] sysfs_read_file+0x87/0xd8 > [57560.396000] [] sys_getxattr+0x46/0x4e > [57560.396000] [] sysfs_read_file+0x0/0xd8 > [57560.396000] [] vfs_read+0xa6/0x128 > [57560.396000] [] sys_read+0x41/0x67 > [57560.396000] [] syscall_call+0x7/0xb > [57560.396000] === [57560.396000] Code: > 74 24 28 73 03 c6 06 20 4d 46 85 ed 7f f1 e9 b9 00 00 00 8b > 0f b8 79 e0 32 c0 8b 54 24 2c 81 f9 ff 0f 00 00 0f 46 c8 89 > c8 eb 06 <80> 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 44 24 > 30 10 89 c3 [57560.396000] EIP: [] > vsnprintf+0x2af/0x48c SS:ESP 0068:edf09e4c > > > > > > > and also with CFS but without CONFIG_FAIR_GROUP_SCHED. > > > > > Is it still required since it now does not seems to be CFS related? > > > > > Hi Ingo, > > > > I am able to reproduce the oops here on my system with > > 2.6.22.14 + CFS backport. I am not able to reproduce it with > > 2.6.22.13 + CFS backport. I believe the CFS backport is > just exposing > > the bug. Can't find an obvious culprit and am looking into > this issue. > > > > Vincent, could you please confirm if you are able to reproduce this > > with > > 2.6.22.13 + CFS? > > Using 2.6.13 + CFS v24 I was also able to reproduce the bug > (I already had one built in my depot without the > display_most-recently-opened_sysfs_file_name_when_oopsing.patc > h). So it looks like it is at least related to >= 2.6.22.13 > and probably not directly CFS related. Note that to get a > oops on a 2.6.13 it seems to need a full backup since it > usually works with incremental. The backup does start > properly then, in this case, at around 70% it oopsed. Using > 2.6.22.14 it seems to oops right at startup. Here is the > 2.6.22.13 CFS v24 oops: Again, just to clarify, I'm not even sure the backup worked at all using a 2.6.22.13 CFS v24 since I already had a previous pending full backup at 70% ... so it may simply had tried to finalize that one and crash right at startup? > [ 170.152908] SGI XFS Quota Management subsystem [ > 170.168443] Filesystem "drbd0": Disabling barriers, not > supported by the underlying device [ 170.174964] XFS > mounting filesystem drbd0 [ 170.232455] Ending clean XFS > mount for filesystem: drbd0 [
RE: 2.6.22.14 oops msg with commvault galaxy ?
> -Message d'origine- > De : Dhaval Giani [mailto:[EMAIL PROTECTED] > > On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote: > > > > * Fortier,Vincent [Montreal] <[EMAIL PROTECTED]> wrote: > > > > > > That has changed from /sys/kernel/uids//cpu_share > > > > > > Here is my config. > > > > > > Maybie I should give it a shot without CFS at all and see what > > > happends ? It got triggerred also using a 2.6.22.14: [57560.396000] BUG: unable to handle kernel paging request at virtual address 8000 [57560.396000] printing eip: [57560.396000] c01d6c56 [57560.396000] *pdpt = 08d02001 [57560.396000] *pde = [57560.396000] Oops: [#34] [57560.396000] SMP [57560.396000] last sysfs file: /devices/platform/floppy.0/uevent [57560.396000] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd ide_generic usbkbd usbmouse tsdev iTCO_wdt iTCO_vendor_support psmouse e752x_edac edac_mc serio_raw evdev pcspkr sg floppy shpchp pci_hotplug sr_mod cdrom ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core tg3 ata_piix ehci_hcd uhci_hcd usbcore thermal processor fan mptscsih mptbase megaraid_sas megaraid_mbox megaraid_mm cciss aacraid [57560.396000] CPU:2 [57560.396000] EIP:0060:[]Not tainted VLI [57560.396000] EFLAGS: 00010297 (2.6.22.14-etch-686-envcan #1) [57560.396000] EIP is at vsnprintf+0x2af/0x48c [57560.396000] eax: 8000 ebx: ecx: 8000 edx: fffe [57560.396000] esi: edf37017 edi: edf09eac ebp: esp: edf09e4c [57560.396000] ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 [57560.396000] Process clBackup (pid: 31421, ti=edf08000 task=f7d36530 task.ti=edf08000) [57560.396000] Stack: c852b000 1000 c0338c78 f895b56c c0233bf5 c852b000 120c8fe8 edf37017 [57560.396000]00c3bd08 c03354eb 0003 0017 [57560.396000]c0376dc0 c852b000 c01d6eb4 edf09eac edf09eac c0233170 edf37017 c03354ea [57560.396000] Call Trace: [57560.396000] [] dev_uevent+0x189/0x1e0 [57560.396000] [] sprintf+0x20/0x23 [57560.396000] [] show_uevent+0xad/0xd5 [57560.396000] [] get_page_from_freelist+0x296/0x32d [57560.396000] [] group_send_sig_info+0x12/0x56 [57560.396000] [] __alloc_pages+0x52/0x294 [57560.396000] [] show_uevent+0x0/0xd5 [57560.396000] [] dev_attr_show+0x15/0x18 [57560.396000] [] sysfs_read_file+0x87/0xd8 [57560.396000] [] sys_getxattr+0x46/0x4e [57560.396000] [] sysfs_read_file+0x0/0xd8 [57560.396000] [] vfs_read+0xa6/0x128 [57560.396000] [] sys_read+0x41/0x67 [57560.396000] [] syscall_call+0x7/0xb [57560.396000] === [57560.396000] Code: 74 24 28 73 03 c6 06 20 4d 46 85 ed 7f f1 e9 b9 00 00 00 8b 0f b8 79 e0 32 c0 8b 54 24 2c 81 f9 ff 0f 00 00 0f 46 c8 89 c8 eb 06 <80> 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 44 24 30 10 89 c3 [57560.396000] EIP: [] vsnprintf+0x2af/0x48c SS:ESP 0068:edf09e4c > > > > and also with CFS but without CONFIG_FAIR_GROUP_SCHED. > > Is it still required since it now does not seems to be CFS related? > > Hi Ingo, > > I am able to reproduce the oops here on my system with > 2.6.22.14 + CFS backport. I am not able to reproduce it with > 2.6.22.13 + CFS backport. I believe the CFS backport is just > exposing the bug. Can't find an obvious culprit and am > looking into this issue. > > Vincent, could you please confirm if you are able to > reproduce this with > 2.6.22.13 + CFS? Using 2.6.13 + CFS v24 I was also able to reproduce the bug (I already had one built in my depot without the display_most-recently-opened_sysfs_file_name_when_oopsing.patch). So it looks like it is at least related to >= 2.6.22.13 and probably not directly CFS related. Note that to get a oops on a 2.6.13 it seems to need a full backup since it usually works with incremental. The backup does start properly then, in this case, at around 70% it oopsed. Using 2.6.22.14 it seems to oops right at startup. Here is the 2.6.22.13 CFS v24 oops: [ 170.152908] SGI XFS Quota Management subsystem [ 170.168443] Filesystem "drbd0": Disabling barriers, not supported by the underlying device [ 170.174964] XFS mounting filesystem drbd0 [ 170.232455] Ending clean XFS mount for filesystem: drbd0 [ 170.318614] Filesystem "drbd1": Disabling barriers, not supported by the underlying device [ 170.327708] XFS mounting filesystem drbd1 [ 170.380481] Ending clean XFS mount for filesystem: drbd1 [ 947.493764] BUG: unable to handle kernel NULL pointer dereference at virtual address 00c8 [ 947.493797] printing eip: [ 947.493810] c01a922c [ 947.493823] *pdpt = 2a97a001 [ 947.493837] *pde = [ 947.493852] Oops: [#1] [ 947.493865] SMP [ 947.493881] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd ide_generic usbkbd usbmouse tsdev iTCO_wdt i
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote: > > * Fortier,Vincent [Montreal] <[EMAIL PROTECTED]> wrote: > > > > That has changed from /sys/kernel/uids//cpu_share > > > > > > > Here is my config. > > > > Maybie I should give it a shot without CFS at all and see what > > happends ? > > and also with CFS but without CONFIG_FAIR_GROUP_SCHED. > Hi Ingo, I am able to reproduce the oops here on my system with 2.6.22.14 + CFS backport. I am not able to reproduce it with 2.6.22.13 + CFS backport. I believe the CFS backport is just exposing the bug. Can't find an obvious culprit and am looking into this issue. Vincent, could you please confirm if you are able to reproduce this with 2.6.22.13 + CFS? Thanks, -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
* Fortier,Vincent [Montreal] <[EMAIL PROTECTED]> wrote: > > That has changed from /sys/kernel/uids//cpu_share > > > > Here is my config. > > Maybie I should give it a shot without CFS at all and see what > happends ? and also with CFS but without CONFIG_FAIR_GROUP_SCHED. Ingo -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Tue, Dec 11, 2007 at 10:53:38PM +0530, Dhaval Giani wrote: > On Tue, Dec 11, 2007 at 09:04:00AM -0800, Greg KH wrote: > > On Tue, Dec 11, 2007 at 10:13:19PM +0530, Dhaval Giani wrote: > > > On Tue, Dec 11, 2007 at 08:24:37PM +0530, Dhaval Giani wrote: > > > > On Mon, Dec 10, 2007 at 09:15:01AM -0800, Randy Dunlap wrote: > > > > > On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent [Montreal] wrote: > > > > > > > > > > Ingo, can you look at this, please? > > > > > Vincent is getting oopses on 2.6.22.14-cfs-etch. > > > > > > > > > > > > > Hi, > > > > > > > > We are looking into this bug now. I believe that the patch at > > > > http://marc.info/?l=linux-kernel&m=119404922603293 should help. > > > > > > > > I am working with Kay to get this ported. > > > > > > > > > > Hi Vincent, > > > > > > Does the following patch help? > > > > > > Kay/Greg, could you please review and add your Signed-off-by(s) as > > > required? > > > > Um, why? What is this patch for? Where is it to be sent, to Linus for > > 2.6.24-final? Or to the -stable tree? > > > > Hi Greg, > > This is for 2.26.24-final, since Fair User scheduling is not yet there > in stable. Again, I think this patch is too big for that release, unless it really is determined that this fix is needed. As this thread shows, I do not think it is true... thanks, greg k-h -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Tue, 11 Dec 2007 14:08:15 -0500 Fortier,Vincent [Montreal] wrote: > > -Message d'origine- > > De : [EMAIL PROTECTED] > > [mailto:[EMAIL PROTECTED] De la part de Dhaval Giani > > > > > > > > > > My tests with Galaxy 5.9 shows that it still does not work. > > > > Although, the error seems to have changed a bit (see > > > > attached dmesg) > > > > > > > > > > Hmmm, makes me suspect the bug is somewhere else. What I am > > > not able > > > to figure out is that I was able to recreate the trace you > > > had on my > > > systems. So there is a bug somewhere there. > > > > > > > To make it clearer why I think so, > > > > > [ 638.466838] last sysfs file: /devices/platform/floppy.0/uevent > > ^ > > That has changed from /sys/kernel/uids//cpu_share > > > > Here is my config. > > Maybie I should give it a shot without CFS at all and see what happends I agree. --- ~Randy Features and documentation: http://lwn.net/Articles/260136/ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
> > > > My tests with Galaxy 5.9 shows that it still does not work. Although, > > the error seems to have changed a bit (see attached dmesg) > > > > Hmmm, makes me suspect the bug is somewhere else. What I am not able to > figure out is that I was able to recreate the trace you had on my > systems. So there is a bug somewhere there. > To make it clearer why I think so, > [ 638.466375] BUG: unable to handle kernel paging request at virtual address > 8000 > [ 638.466479] printing eip: > [ 638.466527] c01d9182 > [ 638.466574] *pdpt = 2d022001 > [ 638.466622] *pde = > [ 638.466672] Oops: [#1] > [ 638.466719] SMP > [ 638.466838] last sysfs file: /devices/platform/floppy.0/uevent ^ That has changed from /sys/kernel/uids//cpu_share > [ 638.466890] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl > sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd > ide_generic usbkbd usbmouse tsdev iTCO_wdt iTCO_vendor_support sg psmouse > e752x_edac shpchp sr_mod pci_hotplug serio_raw edac_mc evdev pcspkr cdrom > floppy ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core > ata_piix ehci_hcd uhci_hcd tg3 usbcore thermal processor fan mptscsih mptbase > megaraid_sas megaraid_mbox megaraid_mm cciss aacraid > [ 638.469273] CPU:3 > [ 638.469274] EIP:0060:[]Not tainted VLI > [ 638.469275] EFLAGS: 00010297 (2.6.22.14-cfs-etch-686-envcan #1) > [ 638.469444] EIP is at vsnprintf+0x2af/0x48c > [ 638.469504] eax: 8000 ebx: ecx: 8000 edx: fffe > [ 638.469567] esi: ebc29017 edi: ed019eac ebp: esp: ed019e4c > [ 638.469631] ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 > [ 638.469693] Process clBackup (pid: 4849, ti=ed018000 task=f6674c60 > task.ti=ed018000) > [ 638.469757] Stack: ec2a9000 1000 c033b658 f899b56c c0236131 ec2a9000 > 143d6fe8 ebc29017 > [ 638.470212]00f9b608 c0337ecb > 0003 0017 > [ 638.470665]c037a3a0 ec2a9000 c01d93e0 ed019eac ed019eac c02356ac > ebc29017 c0337eca > [ 638.471129] Call Trace: > [ 638.471258] [] dev_uevent+0x189/0x1e0 > [ 638.471377] [] sprintf+0x20/0x23 > [ 638.471486] [] show_uevent+0xad/0xd5 > [ 638.471594] [] get_page_from_freelist+0x273/0x30a > [ 638.471713] [] group_send_sig_info+0x12/0x56 > [ 638.471822] [] __alloc_pages+0x52/0x286 > [ 638.471930] [] show_uevent+0x0/0xd5 > [ 638.472034] [] dev_attr_show+0x15/0x18 > [ 638.472138] [] sysfs_read_file+0x87/0xd8 > [ 638.472240] [] sys_getxattr+0x46/0x4e > [ 638.472341] [] sysfs_read_file+0x0/0xd8 > [ 638.472445] [] vfs_read+0xa6/0x128 > [ 638.472551] [] sys_read+0x41/0x67 > [ 638.472656] [] syscall_call+0x7/0xb > [ 638.472765] === > [ 638.472822] Code: 74 24 28 73 03 c6 06 20 4d 46 85 ed 7f f1 e9 b9 00 00 00 > 8b 0f b8 59 0a 33 c0 8b 54 24 2c 81 f9 ff 0f 00 00 0f 46 c8 89 c8 eb 06 <80> > 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 44 24 30 10 89 c3 > [ 638.475660] EIP: [] vsnprintf+0x2af/0x48c SS:ESP 0068:ed019e4c > > -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Tue, Dec 11, 2007 at 12:47:29PM -0500, Fortier,Vincent [Montreal] wrote: > > -Message d'origine- > > De : [EMAIL PROTECTED] > > [mailto:[EMAIL PROTECTED] De la part de Dhaval Giani > > > > On Tue, Dec 11, 2007 at 09:04:00AM -0800, Greg KH wrote: > > > On Tue, Dec 11, 2007 at 10:13:19PM +0530, Dhaval Giani wrote: > > > > On Tue, Dec 11, 2007 at 08:24:37PM +0530, Dhaval Giani wrote: > > > > > On Mon, Dec 10, 2007 at 09:15:01AM -0800, Randy Dunlap wrote: > > > > > > On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent wrote: > > > > > > > > > > > > Ingo, can you look at this, please? > > > > > > Vincent is getting oopses on 2.6.22.14-cfs-etch. > > > > > > > > > > > > > > > > Hi, > > > > > > > > > > We are looking into this bug now. I believe that the patch at > > > > > http://marc.info/?l=linux-kernel&m=119404922603293 should help. > > > > > > > > > > I am working with Kay to get this ported. > > > > > > > > > > > > > Hi Vincent, > > > > > > > > Does the following patch help? > > > > > > > > Kay/Greg, could you please review and add your Signed-off-by(s) as > > > > > required? > > > > > > Um, why? What is this patch for? Where is it to be sent, to Linus > > > for 2.6.24-final? Or to the -stable tree? > > > > > > > Hi Greg, > > > > This is for 2.26.24-final, since Fair User scheduling is not > > yet there in stable. > > > > > > This is basically a port of the patch at > > > > http://marc.info/?l=linux-kernel&m=119404922603293 > > > > > > Yeah, but that patch needs some other core kobject changes, right? > > > > > > > Yep, there are some other changes that patch needed. We have > > worked around them by using the existing functions in the > > current Linus tree. > > > > > What exactly are you trying to fix here, the fact that this code > never > > > even worked? > > > > > > > The code was not using the kobject API. Its been cleaned up now (I > > hope!) > > It refused to apply cleanly on a 2.6.22.14 + CFS v24, only one failure > occured. So I resolved it manually and attached the resulting diff. > > My tests with Galaxy 5.9 shows that it still does not work. Although, > the error seems to have changed a bit (see attached dmesg) > Hmmm, makes me suspect the bug is somewhere else. What I am not able to figure out is that I was able to recreate the trace you had on my systems. So there is a bug somewhere there. Could you send your config please? > > > And, please, we need some documentation for Documenatation/ABI/ on > > > exactly what these sysfs files and tree is for. Please add that now > > > > for Linus's tree. > > > > On to it, will send the patch asap. > > > > > confused, > > > > hope i helped (in clearing it :) ) > > Should this patch eventually be included in? > 2.6.25 ? > 2.6.24 ? > (-stable 2.6.23 & 2.6.22) || backport CFS v24 -> v25? > 2.6.24 is what I believe, unless of course the bug lies elsewhere. -- regards, Dhaval [ 638.466375] BUG: unable to handle kernel paging request at virtual address 8000 [ 638.466479] printing eip: [ 638.466527] c01d9182 [ 638.466574] *pdpt = 2d022001 [ 638.466622] *pde = [ 638.466672] Oops: [#1] [ 638.466719] SMP [ 638.466838] last sysfs file: /devices/platform/floppy.0/uevent [ 638.466890] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd ide_generic usbkbd usbmouse tsdev iTCO_wdt iTCO_vendor_support sg psmouse e752x_edac shpchp sr_mod pci_hotplug serio_raw edac_mc evdev pcspkr cdrom floppy ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core ata_piix ehci_hcd uhci_hcd tg3 usbcore thermal processor fan mptscsih mptbase megaraid_sas megaraid_mbox megaraid_mm cciss aacraid [ 638.469273] CPU:3 [ 638.469274] EIP:0060:[]Not tainted VLI [ 638.469275] EFLAGS: 00010297 (2.6.22.14-cfs-etch-686-envcan #1) [ 638.469444] EIP is at vsnprintf+0x2af/0x48c [ 638.469504] eax: 8000 ebx: ecx: 8000 edx: fffe [ 638.469567] esi: ebc29017 edi: ed019eac ebp: esp: ed019e4c [ 638.469631] ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 [ 638.469693] Process clBackup (pid: 4849, ti=ed018000 task=f6674c60 task.ti=ed018000) [ 638.469757] Stack: ec2a9000 1000 c033b658 f899b56c c0236131 ec2a9000 143d6fe8 ebc29017 [ 638.470212]00f9b608 c0337ecb 0003 0017 [ 638.470665]c037a3a0 ec2a9000 c01d93e0 ed019eac ed019eac c02356ac ebc29017 c0337eca [ 638.471129] Call Trace: [ 638.471258] [] dev_uevent+0x189/0x1e0 [ 638.471377] [] sprintf+0x20/0x23 [ 638.471486] [] show_uevent+0xad/0xd5 [ 638.471594] [] get_page_from_freelist+0x273/0x30a [ 638.471713] [] group_send_sig_info+0x12/0x56 [ 638.471822] [] __alloc_pages+0x52/0x286 [ 638.471930] [] show
RE: 2.6.22.14 oops msg with commvault galaxy ?
> -Message d'origine- > De : [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] De la part de Dhaval Giani > > On Tue, Dec 11, 2007 at 09:04:00AM -0800, Greg KH wrote: > > On Tue, Dec 11, 2007 at 10:13:19PM +0530, Dhaval Giani wrote: > > > On Tue, Dec 11, 2007 at 08:24:37PM +0530, Dhaval Giani wrote: > > > > On Mon, Dec 10, 2007 at 09:15:01AM -0800, Randy Dunlap wrote: > > > > > On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent wrote: > > > > > > > > > > Ingo, can you look at this, please? > > > > > Vincent is getting oopses on 2.6.22.14-cfs-etch. > > > > > > > > > > > > > Hi, > > > > > > > > We are looking into this bug now. I believe that the patch at > > > > http://marc.info/?l=linux-kernel&m=119404922603293 should help. > > > > > > > > I am working with Kay to get this ported. > > > > > > > > > > Hi Vincent, > > > > > > Does the following patch help? > > > > > > Kay/Greg, could you please review and add your Signed-off-by(s) as > > > required? > > > > Um, why? What is this patch for? Where is it to be sent, to Linus > > for 2.6.24-final? Or to the -stable tree? > > > > Hi Greg, > > This is for 2.26.24-final, since Fair User scheduling is not > yet there in stable. > > > > This is basically a port of the patch at > > > http://marc.info/?l=linux-kernel&m=119404922603293 > > > > Yeah, but that patch needs some other core kobject changes, right? > > > > Yep, there are some other changes that patch needed. We have > worked around them by using the existing functions in the > current Linus tree. > > > What exactly are you trying to fix here, the fact that this code never > > even worked? > > > > The code was not using the kobject API. Its been cleaned up now (I > hope!) It refused to apply cleanly on a 2.6.22.14 + CFS v24, only one failure occured. So I resolved it manually and attached the resulting diff. My tests with Galaxy 5.9 shows that it still does not work. Although, the error seems to have changed a bit (see attached dmesg) > > And, please, we need some documentation for Documenatation/ABI/ on > > exactly what these sysfs files and tree is for. Please add that now > > for Linus's tree. > > On to it, will send the patch asap. > > > confused, > > hope i helped (in clearing it :) ) Should this patch eventually be included in? 2.6.25 ? 2.6.24 ? (-stable 2.6.23 & 2.6.22) || backport CFS v24 -> v25? Thnx, - vin dmesg.2.6.22.14-CFSv24-FairUserInterfaceBUGfix Description: dmesg.2.6.22.14-CFSv24-FairUserInterfaceBUGfix FairUserInterface-BugFix.patch Description: FairUserInterface-BugFix.patch
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Tue, Dec 11, 2007 at 09:04:00AM -0800, Greg KH wrote: > On Tue, Dec 11, 2007 at 10:13:19PM +0530, Dhaval Giani wrote: > > On Tue, Dec 11, 2007 at 08:24:37PM +0530, Dhaval Giani wrote: > > > On Mon, Dec 10, 2007 at 09:15:01AM -0800, Randy Dunlap wrote: > > > > On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent [Montreal] wrote: > > > > > > > > Ingo, can you look at this, please? > > > > Vincent is getting oopses on 2.6.22.14-cfs-etch. > > > > > > > > > > Hi, > > > > > > We are looking into this bug now. I believe that the patch at > > > http://marc.info/?l=linux-kernel&m=119404922603293 should help. > > > > > > I am working with Kay to get this ported. > > > > > > > Hi Vincent, > > > > Does the following patch help? > > > > Kay/Greg, could you please review and add your Signed-off-by(s) as > > required? > > Um, why? What is this patch for? Where is it to be sent, to Linus for > 2.6.24-final? Or to the -stable tree? > Hi Greg, This is for 2.26.24-final, since Fair User scheduling is not yet there in stable. > > This is basically a port of the patch at > > http://marc.info/?l=linux-kernel&m=119404922603293 > > Yeah, but that patch needs some other core kobject changes, right? > Yep, there are some other changes that patch needed. We have worked around them by using the existing functions in the current Linus tree. > What exactly are you trying to fix here, the fact that this code never > even worked? > The code was not using the kobject API. Its been cleaned up now (I hope!) > And, please, we need some documentation for Documenatation/ABI/ on > exactly what these sysfs files and tree is for. Please add that now for > Linus's tree. > On to it, will send the patch asap. > confused, > hope i helped (in clearing it :) ) Thanks, -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Tue, Dec 11, 2007 at 10:13:19PM +0530, Dhaval Giani wrote: > On Tue, Dec 11, 2007 at 08:24:37PM +0530, Dhaval Giani wrote: > > On Mon, Dec 10, 2007 at 09:15:01AM -0800, Randy Dunlap wrote: > > > On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent [Montreal] wrote: > > > > > > Ingo, can you look at this, please? > > > Vincent is getting oopses on 2.6.22.14-cfs-etch. > > > > > > > Hi, > > > > We are looking into this bug now. I believe that the patch at > > http://marc.info/?l=linux-kernel&m=119404922603293 should help. > > > > I am working with Kay to get this ported. > > > > Hi Vincent, > > Does the following patch help? > > Kay/Greg, could you please review and add your Signed-off-by(s) as > required? Um, why? What is this patch for? Where is it to be sent, to Linus for 2.6.24-final? Or to the -stable tree? > This is basically a port of the patch at > http://marc.info/?l=linux-kernel&m=119404922603293 Yeah, but that patch needs some other core kobject changes, right? What exactly are you trying to fix here, the fact that this code never even worked? And, please, we need some documentation for Documenatation/ABI/ on exactly what these sysfs files and tree is for. Please add that now for Linus's tree. confused, greg k-h -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Tue, Dec 11, 2007 at 08:24:37PM +0530, Dhaval Giani wrote: > On Mon, Dec 10, 2007 at 09:15:01AM -0800, Randy Dunlap wrote: > > On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent [Montreal] wrote: > > > > Ingo, can you look at this, please? > > Vincent is getting oopses on 2.6.22.14-cfs-etch. > > > > Hi, > > We are looking into this bug now. I believe that the patch at > http://marc.info/?l=linux-kernel&m=119404922603293 should help. > > I am working with Kay to get this ported. > Hi Vincent, Does the following patch help? Kay/Greg, could you please review and add your Signed-off-by(s) as required? This is basically a port of the patch at http://marc.info/?l=linux-kernel&m=119404922603293 Thanks, -- The sysfs interface for the Fair User Interface hits upon the bug reported at http://lkml.org/lkml/2007/12/10/113. Kay Sievers and Greg K H had posted some sysfs cleanup patches sometime back at http://marc.info/?l=linux-kernel&m=119404922603293 . This patch has been ported to 2.6.22.14 + CFS v24 backport. Cc: Ingo Molnar <[EMAIL PROTECTED]> Not-yet-Signed-off-by: Srivatsa Vaddagiri <[EMAIL PROTECTED]> Not-yet-Signed-off-by: Dhaval Giani <[EMAIL PROTECTED]> --- include/linux/sched.h |9 --- kernel/ksysfs.c |7 -- kernel/user.c | 129 +- 3 files changed, 80 insertions(+), 65 deletions(-) Index: current/include/linux/sched.h === --- current.orig/include/linux/sched.h +++ current/include/linux/sched.h @@ -586,18 +586,13 @@ struct user_struct { #ifdef CONFIG_FAIR_USER_SCHED struct task_group *tg; #ifdef CONFIG_SYSFS - struct kset kset; - struct subsys_attribute user_attr; + struct kobject kobj; struct work_struct work; #endif #endif }; -#ifdef CONFIG_FAIR_USER_SCHED -extern int uids_kobject_init(void); -#else -static inline int uids_kobject_init(void) { return 0; } -#endif +extern int uids_sysfs_init(void); extern struct user_struct *find_user(uid_t); Index: current/kernel/ksysfs.c === --- current.orig/kernel/ksysfs.c +++ current/kernel/ksysfs.c @@ -89,12 +89,9 @@ static int __init ksysfs_init(void) error = sysfs_create_group(&kernel_subsys.kobj, &kernel_attr_group); - /* -* Create "/sys/kernel/uids" directory and corresponding root user's -* directory under it. -*/ + /* create the /sys/kernel/uids/ directory */ if (!error) - error = uids_kobject_init(); + error = uids_sysfs_init(); return error; } Index: current/kernel/user.c === --- current.orig/kernel/user.c +++ current/kernel/user.c @@ -118,7 +118,6 @@ static void sched_switch_user(struct tas #if defined(CONFIG_FAIR_USER_SCHED) && defined(CONFIG_SYSFS) -static struct kobject uids_kobject; /* represents /sys/kernel/uids directory */ static DEFINE_MUTEX(uids_mutex); static inline void uids_mutex_lock(void) @@ -131,83 +130,104 @@ static inline void uids_mutex_unlock(voi mutex_unlock(&uids_mutex); } -/* return cpu shares held by the user */ -ssize_t cpu_shares_show(struct kset *kset, char *buffer) +/* uid directory attributes */ +static ssize_t cpu_shares_show(struct kobject *kobj, + struct attribute *attr, + char *buf) { - struct user_struct *up = container_of(kset, struct user_struct, kset); + struct user_struct *up = container_of(kobj, struct user_struct, kobj); - return sprintf(buffer, "%lu\n", sched_group_shares(up->tg)); + return sprintf(buf, "%lu\n", sched_group_shares(up->tg)); } -/* modify cpu shares held by the user */ -ssize_t cpu_shares_store(struct kset *kset, const char *buffer, size_t size) +static ssize_t cpu_shares_store(struct kobject *kobj, + struct attribute *attr, + const char *buf, size_t size) { - struct user_struct *up = container_of(kset, struct user_struct, kset); + struct user_struct *up = container_of(kobj, struct user_struct, kobj); unsigned long shares; int rc; - sscanf(buffer, "%lu", &shares); + sscanf(buf, "%lu", &shares); rc = sched_group_set_shares(up->tg, shares); return (rc ? rc : size); } -static void user_attr_init(struct subsys_attribute *sa, char *name, int mode) -{ - sa->attr.name = name; - sa->attr.mode = mode; - sa->show = cpu_shares_show; - sa->store = cpu_shares_store; -} - -/* Create "/sys/kernel/uids/" directory and - * "/sys/kernel/uids//cpu_share" file for this user. - */ -static int user_kobject_create(struct user_struct *up) -{ - struct kset *kset = &up->kset; - struct kobject *kobj =
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Mon, Dec 10, 2007 at 09:15:01AM -0800, Randy Dunlap wrote: > On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent [Montreal] wrote: > > Ingo, can you look at this, please? > Vincent is getting oopses on 2.6.22.14-cfs-etch. > Hi, We are looking into this bug now. I believe that the patch at http://marc.info/?l=linux-kernel&m=119404922603293 should help. I am working with Kay to get this ported. Thanks, -- regards, Dhaval -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
RE: 2.6.22.14 oops msg with commvault galaxy ?
> -Message d'origine- > De : Randy Dunlap [mailto:[EMAIL PROTECTED] > Envoyé : 10 décembre 2007 12:15 > > On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent [Montreal] wrote: > > Ingo, can you look at this, please? > Vincent is getting oopses on 2.6.22.14-cfs-etch. > > Vincent, did you apply the cfs patch or did Debian etch provide that? I did. http://linux-dev.qc.ec.gc.ca/ > If you applied it, did you use > http://people.redhat.com/mingo/cfs-scheduler/sched-cfs-v2.6.22.13-v24.patch > or a different patch? > I applied exactly that one.. and had already sent that info to ingo this morning since I presumed the CFS patchset could be involved in this by reagarding the more detailed output. Also note that CFS v24 on 2.6.21 does not produce the oops and I can run galaxy backups on the system without any problems. - vin -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent [Montreal] wrote: Ingo, can you look at this, please? Vincent is getting oopses on 2.6.22.14-cfs-etch. Vincent, did you apply the cfs patch or did Debian etch provide that? If you applied it, did you use http://people.redhat.com/mingo/cfs-scheduler/sched-cfs-v2.6.22.13-v24.patch or a different patch? > > -Message d'origine- > > De : [EMAIL PROTECTED] > > [mailto:[EMAIL PROTECTED] De la part de > > Fortier,Vincent [Montreal] > > Envoyé : 10 décembre 2007 08:21 > > À : Randy Dunlap; Andrew Morton > > Cc : linux-kernel@vger.kernel.org > > Objet : RE: 2.6.22.14 oops msg with commvault galaxy ? > > > > > -Message d'origine- > > > De : Randy Dunlap [mailto:[EMAIL PROTECTED] Envoyé : > > 7 décembre > > > 2007 20:15 > > > > > > On Fri, 7 Dec 2007 15:11:13 -0800 Andrew Morton wrote: > > > > > > > On Fri, 7 Dec 2007 14:15:36 -0800 > > > > Randy Dunlap <[EMAIL PROTECTED]> wrote: > > > > > > > > > > Help would really be appreciated. > > > > > > > > > > Let's try the last_sysfs_file (name) patch. > > > > > I've attempted to update it for 2.6.22.14. > > > > > Andrew, does this change in fs/sysfs/file.c look OK? > > > > > > > > umm, yup. > > > > > > > > > > > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.21-r > > > > > > > > > c6/2.6.21-rc6-mm1/broken-out/gregkh-driver-sysfs-crash-debugging.patch > > > > > > > > should work. > > > > > > Thanks. > > > I produced a cleanly applying version of it for 2.6.22.14. > > > > > > Vincent, please apply this patch so we can know which file in sysfs > > > these oopses are happening with. > > > > > > > It did not applied cleanly on a 2.6.22.14... copy/paste might > > be the issue here... Anyhow, I corrected the patch failure to > > apply and here is my version of it... Hoping I got this > > (attached patch). > > > > Compiling at the moment... will try this out with commvault > > 5.9 probably in the morning and get back with the results. > > > > Let me know I got the patch wrong. > > Here is the resulting trace... hoping this helps...: > > [ 942.107304] BUG: unable to handle kernel NULL pointer dereference at > virtual address 00c8 > [ 942.107339] printing eip: > [ 942.107354] c01a924c > [ 942.107368] *pdpt = 2d6b4001 > [ 942.107383] *pde = > [ 942.107401] Oops: [#1] > [ 942.107414] SMP > [ 942.107431] last sysfs file: /kernel/uids/104/cpu_share > [ 942.107449] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl > sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd > ide_generic usbkbd usbmouse tsdev sg iTCO_wdt iTCO_vendor_support e752x_edac > edac_mc psmouse floppy shpchp pci_hotplug serio_raw sr_mod pcspkr evdev cdrom > ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core ehci_hcd > uhci_hcd usbcore ata_piix tg3 thermal processor fan mptscsih mptbase > megaraid_sas megaraid_mbox megaraid_mm cciss aacraid > [ 942.107675] CPU:0 > [ 942.107676] EIP:0060:[]Not tainted VLI > [ 942.107678] EFLAGS: 00010202 (2.6.22.14-cfs-etch-686-envcan #1) > [ 942.107730] EIP is at sysfs_open_file+0xae/0x21e > [ 942.107749] eax: ebx: f77783b8 ecx: dfb0b280 edx: 00c8 > [ 942.107769] esi: f7e0ce8c edi: c03fd5c0 ebp: c01a919e esp: f1257ed8 > [ 942.107789] ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 > [ 942.107810] Process clBackup (pid: 5191, ti=f1256000 task=f71b3290 > task.ti=f1256000) > [ 942.107831] Stack: 1000 f295d240 ed4f4ac0 f7e0ce48 f295d240 ed4f4ac0 > f1257f30 c01a919e > [ 942.107878]c0170464 dfe76100 ed4f3880 f295d240 8000 f1257f30 > 0010 c0170595 > [ 942.107921]f295d240 c01705d6 f1257f30 > ed4f3880 dfe76100 > [ 942.107968] Call Trace: > [ 942.107998] [] sysfs_open_file+0x0/0x21e > [ 942.108017] [] __dentry_open+0xc1/0x178 > [ 942.108039] [] nameidata_to_filp+0x24/0x33 > [ 942.108063] [] do_filp_open+0x32/0x39 > [ 942.108088] [] get_unused_fd+0x4a/0xaa > [ 942.108112] [] do_sys_open+0x42/0xc3 > [ 942.108134] [] sys_open+0x1c/0x1e > [ 942.108155] [] syscall_call+0x7/0xb > [ 942.108179] === > [ 942.108194] Code: b8 c0 c5 3f c0 41 e8 e8 06 03 00 83 7c 24 0c 00 0f 84 72 > 01 00 00 85 f6 0f 84 6a 01 00 00 8b 56
RE: 2.6.22.14 oops msg with commvault galaxy ?
> -Message d'origine- > De : [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] De la part de > Fortier,Vincent [Montreal] > Envoyé : 10 décembre 2007 08:21 > À : Randy Dunlap; Andrew Morton > Cc : linux-kernel@vger.kernel.org > Objet : RE: 2.6.22.14 oops msg with commvault galaxy ? > > > -Message d'origine- > > De : Randy Dunlap [mailto:[EMAIL PROTECTED] Envoyé : > 7 décembre > > 2007 20:15 > > > > On Fri, 7 Dec 2007 15:11:13 -0800 Andrew Morton wrote: > > > > > On Fri, 7 Dec 2007 14:15:36 -0800 > > > Randy Dunlap <[EMAIL PROTECTED]> wrote: > > > > > > > > Help would really be appreciated. > > > > > > > > Let's try the last_sysfs_file (name) patch. > > > > I've attempted to update it for 2.6.22.14. > > > > Andrew, does this change in fs/sysfs/file.c look OK? > > > > > > umm, yup. > > > > > > > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.21-r > > > > > > c6/2.6.21-rc6-mm1/broken-out/gregkh-driver-sysfs-crash-debugging.patch > > > > > > should work. > > > > Thanks. > > I produced a cleanly applying version of it for 2.6.22.14. > > > > Vincent, please apply this patch so we can know which file in sysfs > > these oopses are happening with. > > > > It did not applied cleanly on a 2.6.22.14... copy/paste might > be the issue here... Anyhow, I corrected the patch failure to > apply and here is my version of it... Hoping I got this > (attached patch). > > Compiling at the moment... will try this out with commvault > 5.9 probably in the morning and get back with the results. > > Let me know I got the patch wrong. Here is the resulting trace... hoping this helps...: [ 942.107304] BUG: unable to handle kernel NULL pointer dereference at virtual address 00c8 [ 942.107339] printing eip: [ 942.107354] c01a924c [ 942.107368] *pdpt = 2d6b4001 [ 942.107383] *pde = [ 942.107401] Oops: [#1] [ 942.107414] SMP [ 942.107431] last sysfs file: /kernel/uids/104/cpu_share [ 942.107449] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd ide_generic usbkbd usbmouse tsdev sg iTCO_wdt iTCO_vendor_support e752x_edac edac_mc psmouse floppy shpchp pci_hotplug serio_raw sr_mod pcspkr evdev cdrom ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core ehci_hcd uhci_hcd usbcore ata_piix tg3 thermal processor fan mptscsih mptbase megaraid_sas megaraid_mbox megaraid_mm cciss aacraid [ 942.107675] CPU:0 [ 942.107676] EIP:0060:[]Not tainted VLI [ 942.107678] EFLAGS: 00010202 (2.6.22.14-cfs-etch-686-envcan #1) [ 942.107730] EIP is at sysfs_open_file+0xae/0x21e [ 942.107749] eax: ebx: f77783b8 ecx: dfb0b280 edx: 00c8 [ 942.107769] esi: f7e0ce8c edi: c03fd5c0 ebp: c01a919e esp: f1257ed8 [ 942.107789] ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 [ 942.107810] Process clBackup (pid: 5191, ti=f1256000 task=f71b3290 task.ti=f1256000) [ 942.107831] Stack: 1000 f295d240 ed4f4ac0 f7e0ce48 f295d240 ed4f4ac0 f1257f30 c01a919e [ 942.107878]c0170464 dfe76100 ed4f3880 f295d240 8000 f1257f30 0010 c0170595 [ 942.107921]f295d240 c01705d6 f1257f30 ed4f3880 dfe76100 [ 942.107968] Call Trace: [ 942.107998] [] sysfs_open_file+0x0/0x21e [ 942.108017] [] __dentry_open+0xc1/0x178 [ 942.108039] [] nameidata_to_filp+0x24/0x33 [ 942.108063] [] do_filp_open+0x32/0x39 [ 942.108088] [] get_unused_fd+0x4a/0xaa [ 942.108112] [] do_sys_open+0x42/0xc3 [ 942.108134] [] sys_open+0x1c/0x1e [ 942.108155] [] syscall_call+0x7/0xb [ 942.108179] === [ 942.108194] Code: b8 c0 c5 3f c0 41 e8 e8 06 03 00 83 7c 24 0c 00 0f 84 72 01 00 00 85 f6 0f 84 6a 01 00 00 8b 56 04 85 d2 74 19 64 a1 08 50 3d c0 <83> 3a 02 0f 84 44 01 00 00 c1 e0 05 ff 84 10 20 01 00 00 8b 54 [ 942.108364] EIP: [] sysfs_open_file+0xae/0x21e SS:ESP 0068:f1257ed8 - vin -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
RE: 2.6.22.14 oops msg with commvault galaxy ?
> -Message d'origine- > De : Randy Dunlap [mailto:[EMAIL PROTECTED] > Envoyé : 7 décembre 2007 20:15 > > On Fri, 7 Dec 2007 15:11:13 -0800 Andrew Morton wrote: > > > On Fri, 7 Dec 2007 14:15:36 -0800 > > Randy Dunlap <[EMAIL PROTECTED]> wrote: > > > > > > Help would really be appreciated. > > > > > > Let's try the last_sysfs_file (name) patch. > > > I've attempted to update it for 2.6.22.14. > > > Andrew, does this change in fs/sysfs/file.c look OK? > > > > umm, yup. > > > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.21-r > > > c6/2.6.21-rc6-mm1/broken-out/gregkh-driver-sysfs-crash-debugging.patch > > > > should work. > > Thanks. > I produced a cleanly applying version of it for 2.6.22.14. > > Vincent, please apply this patch so we can know which file in > sysfs these oopses are happening with. > It did not applied cleanly on a 2.6.22.14... copy/paste might be the issue here... Anyhow, I corrected the patch failure to apply and here is my version of it... Hoping I got this (attached patch). Compiling at the moment... will try this out with commvault 5.9 probably in the morning and get back with the results. Let me know I got the patch wrong. - vin display_most-recently-opened_sysfs_file_name_when_oopsing.patch Description: display_most-recently-opened_sysfs_file_name_when_oopsing.patch
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Fri, 7 Dec 2007 15:11:13 -0800 Andrew Morton wrote: > On Fri, 7 Dec 2007 14:15:36 -0800 > Randy Dunlap <[EMAIL PROTECTED]> wrote: > > > > Help would really be appreciated. > > > > Let's try the last_sysfs_file (name) patch. > > I've attempted to update it for 2.6.22.14. > > Andrew, does this change in fs/sysfs/file.c look OK? > > umm, yup. > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.21-rc6/2.6.21-rc6-mm1/broken-out/gregkh-driver-sysfs-crash-debugging.patch > > should work. Thanks. I produced a cleanly applying version of it for 2.6.22.14. Vincent, please apply this patch so we can know which file in sysfs these oopses are happening with. --- From: Andrew Morton <[EMAIL PROTECTED]> Display the most-recently-opened sysfs file's name when oopsing. From: Adrian Bunk <[EMAIL PROTECTED]> Build fix From: Greg Kroah-Hartman <[EMAIL PROTECTED]> Modified to make the api call cleaner, and available to all arches if need be. Also added it to x86-64's crash dump message. Signed-off-by: Adrian Bunk <[EMAIL PROTECTED]> Signed-off-by: Andrew Morton <[EMAIL PROTECTED]> Signed-off-by: Greg Kroah-Hartman <[EMAIL PROTECTED]> --- arch/i386/kernel/traps.c |1 + arch/x86_64/kernel/traps.c |1 + fs/sysfs/file.c| 14 ++ include/linux/sysfs.h |6 ++ 4 files changed, 22 insertions(+) --- linux-2.6.22.14.orig/arch/i386/kernel/traps.c +++ linux-2.6.22.14/arch/i386/kernel/traps.c @@ -411,6 +411,7 @@ void die(const char * str, struct pt_reg #endif if (nl) printk("\n"); + sysfs_printk_last_file(); if (notify_die(DIE_OOPS, str, regs, err, current->thread.trap_no, SIGSEGV) != NOTIFY_STOP) { --- linux-2.6.22.14.orig/arch/x86_64/kernel/traps.c +++ linux-2.6.22.14/arch/x86_64/kernel/traps.c @@ -516,6 +516,7 @@ void __kprobes __die(const char * str, s printk("DEBUG_PAGEALLOC"); #endif printk("\n"); + sysfs_printk_last_file(); notify_die(DIE_OOPS, str, regs, err, current->thread.trap_no, SIGSEGV); show_registers(regs); /* Executive summary in case the oops scrolled away */ --- linux-2.6.22.14.orig/fs/sysfs/file.c +++ linux-2.6.22.14/fs/sysfs/file.c @@ -8,6 +8,7 @@ #include #include #include +#include #include #include @@ -15,6 +16,13 @@ #define to_sattr(a) container_of(a,struct subsys_attribute, attr) +/* used in crash dumps to help with debugging */ +static char last_sysfs_file[PATH_MAX]; +void sysfs_printk_last_file(void) +{ + printk(KERN_EMERG "last sysfs file: %s\n", last_sysfs_file); +} + /* * Subsystem file operations. * These operations allow subsystems to have files that can be @@ -253,6 +261,12 @@ static int sysfs_open_file(struct inode struct sysfs_buffer * buffer; struct sysfs_ops * ops = NULL; int error = 0; + char *p; + + p = d_path(file->f_dentry, sysfs_mount, last_sysfs_file, + sizeof(last_sysfs_file)); + if (p) + memmove(last_sysfs_file, p, strlen(p) + 1); if (!kobj || !attr) goto Einval; --- linux-2.6.22.14.orig/include/linux/sysfs.h +++ linux-2.6.22.14/include/linux/sysfs.h @@ -125,6 +125,7 @@ void sysfs_remove_file_from_group(struct const struct attribute *attr, const char *group); void sysfs_notify(struct kobject * k, char *dir, char *attr); +void sysfs_printk_last_file(void); extern int sysfs_make_shadowed_dir(struct kobject *kobj, @@ -240,6 +241,11 @@ static inline int __must_check sysfs_ini return 0; } +static inline void sysfs_printk_last_file(void) +{ + ; +} + #endif /* CONFIG_SYSFS */ #endif /* _SYSFS_H_ */ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Fri, 7 Dec 2007 14:15:36 -0800 Randy Dunlap <[EMAIL PROTECTED]> wrote: > > Help would really be appreciated. > > Let's try the last_sysfs_file (name) patch. > I've attempted to update it for 2.6.22.14. > Andrew, does this change in fs/sysfs/file.c look OK? umm, yup. ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.21-rc6/2.6.21-rc6-mm1/broken-out/gregkh-driver-sysfs-crash-debugging.patch should work. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Tue, 04 Dec 2007 13:47:01 + Vincent Fortier wrote: > Le vendredi 30 novembre 2007 à 12:35 -0500, Fortier,Vincent [Montreal] a > écrit : > > > -Message d'origine- > > > De : Randy Dunlap [mailto:[EMAIL PROTECTED] > > > Envoyé : 30 novembre 2007 12:13 > > > > > > On Fri, 30 Nov 2007 13:02:54 + Vincent Fortier wrote: > > > > > > > Hi all, > > > > > > > > I'm using a 2.6.22.14 + CFS v24 and I got theses errors > > > when starting > > > > up my commvault galaxy client... Do anybody know what this could mean? > > > > > > Can you provide a few lines of syslog before the Oops: line, > > > which should contain some info about what happened, e.g.: > > > > > > Unable to handle kernel paging request at virtual address > > > e4a85017 printing eip: > > > c01d915a > > > *pde = 37d0d067 > > > *pte = > > > > I've umounted the XFS/DRBD filesystem/container (tought it might have > been related?) but it did not helped... still getting the same kernel > oops. > > [1097523.808915] BUG: unable to handle kernel paging request at virtual > address 8000 > [1097523.808950] printing eip: > [1097523.808963] c01d915a > [1097523.808977] *pdpt = 220ea001 > [1097523.808992] *pde = > [1097523.809009] Oops: [#27] > [1097523.809023] SMP > [1097523.809040] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd > nfs_acl sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse > ide_cd ide_generic usbkbd usbmouse tsdev serio_raw sg psmouse iTCO_wdt > iTCO_vendor_support floppy e752x_edac sr_mod pcspkr evdev edac_mc shpchp > pci_hotplug cdrom ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic > piix ide_core ehci_hcd uhci_hcd tg3 ata_piix usbcore thermal processor > fan mptscsih mptbase megaraid_sas megaraid_mbox megaraid_mm cciss > aacraid > [1097523.809266] CPU:0 > [1097523.809268] EIP:0060:[]Not tainted VLI > [1097523.809269] EFLAGS: 00010297 (2.6.22.14-cfs-etch-686-envcan #1) > [1097523.809323] EIP is at vsnprintf+0x2af/0x48c > [1097523.809341] eax: 8000 ebx: ecx: 8000 edx: > fffe > [1097523.809361] esi: d89c6017 edi: dd1ffeac ebp: esp: > dd1ffe4c > [1097523.809382] ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 > [1097523.809403] Process clBackup (pid: 30311, ti=dd1fe000 task=f7043290 > task.ti=dd1fe000) > [1097523.809423] Stack: dbc2a000 1000 c033b638 f89e056c c02360f1 > dbc2a000 27639fe8 d89c6017 > [1097523.809469]008e2408 > c0337eab 0003 0017 > [1097523.809512]c037a340 dbc2a000 c01d93b8 dd1ffeac dd1ffeac > c023566c d89c6017 c0337eaa > [1097523.809559] Call Trace: > [1097523.809588] [] dev_uevent+0x189/0x1e0 > [1097523.809614] [] sprintf+0x20/0x23 > [1097523.809635] [] show_uevent+0xad/0xd5 > [1097523.809658] [] get_page_from_freelist+0x273/0x30a > [1097523.809686] [] group_send_sig_info+0x12/0x56 > [1097523.809711] [] __alloc_pages+0x52/0x286 > [1097523.809734] [] show_uevent+0x0/0xd5 > [1097523.809754] [] dev_attr_show+0x15/0x18 > [1097523.809775] [] sysfs_read_file+0x87/0xd8 > [1097523.809796] [] sys_getxattr+0x46/0x4e > [1097523.809818] [] sysfs_read_file+0x0/0xd8 > [1097523.809839] [] vfs_read+0xa6/0x128 > [1097523.809861] [] sys_read+0x41/0x67 > [1097523.809881] [] syscall_call+0x7/0xb > [1097523.809906] === > [1097523.809921] Code: 74 24 28 73 03 c6 06 20 4d 46 85 ed 7f f1 e9 b9 > 00 00 00 8b 0f b8 39 0a 33 c0 8b 54 24 30 81 f9 ff 0f 00 00 0f 46 c8 89 > c8 eb 06 <80> 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 44 24 2c 10 89 > c3 > [1097523.810088] EIP: [] vsnprintf+0x2af/0x48c SS:ESP > 0068:dd1ffe4c > > Help would really be appreciated. Let's try the last_sysfs_file (name) patch. I've attempted to update it for 2.6.22.14. Andrew, does this change in fs/sysfs/file.c look OK? --- ~Randy From: Randy Dunlap <[EMAIL PROTECTED]> Record last_sysfs_file name to print during oopsen so that we can have a clue. Signed-off-by: Randy Dunlap <[EMAIL PROTECTED]> --- arch/i386/kernel/traps.c |6 ++ fs/sysfs/file.c |6 ++ 2 files changed, 12 insertions(+) --- linux-2.6.22.14.orig/arch/i386/kernel/traps.c +++ linux-2.6.22.14/arch/i386/kernel/traps.c @@ -411,6 +411,12 @@ void die(const char * str, struct pt_reg #endif if (nl) printk("\n"); + { + extern char last_sysfs_file[]; + + printk(KERN_ALERT "last sysfs file: %s\n", + last_sysfs_file); + } if (notify_die(DIE_OOPS, str, regs, err, current->thread.trap_no, SIGSEGV) != NOTIFY_STOP) { --- linux-2.6.22.14.orig/fs/sysfs/file.c +++ linux-2.6.22.14/fs/sysfs/file.c @@ -8,6 +8,7 @@ #include #include #include +#include #include #include @@ -245,6 +246,8 @@ out: return len;
RE: 2.6.22.14 oops msg with commvault galaxy ?
Le vendredi 30 novembre 2007 à 12:35 -0500, Fortier,Vincent [Montreal] a écrit : > > -Message d'origine- > > De : Randy Dunlap [mailto:[EMAIL PROTECTED] > > Envoyé : 30 novembre 2007 12:13 > > > > On Fri, 30 Nov 2007 13:02:54 + Vincent Fortier wrote: > > > > > Hi all, > > > > > > I'm using a 2.6.22.14 + CFS v24 and I got theses errors > > when starting > > > up my commvault galaxy client... Do anybody know what this could mean? > > > > Can you provide a few lines of syslog before the Oops: line, > > which should contain some info about what happened, e.g.: > > > > Unable to handle kernel paging request at virtual address > > e4a85017 printing eip: > > c01d915a > > *pde = 37d0d067 > > *pte = > I've umounted the XFS/DRBD filesystem/container (tought it might have been related?) but it did not helped... still getting the same kernel oops. [1097523.808915] BUG: unable to handle kernel paging request at virtual address 8000 [1097523.808950] printing eip: [1097523.808963] c01d915a [1097523.808977] *pdpt = 220ea001 [1097523.808992] *pde = [1097523.809009] Oops: [#27] [1097523.809023] SMP [1097523.809040] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd ide_generic usbkbd usbmouse tsdev serio_raw sg psmouse iTCO_wdt iTCO_vendor_support floppy e752x_edac sr_mod pcspkr evdev edac_mc shpchp pci_hotplug cdrom ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core ehci_hcd uhci_hcd tg3 ata_piix usbcore thermal processor fan mptscsih mptbase megaraid_sas megaraid_mbox megaraid_mm cciss aacraid [1097523.809266] CPU:0 [1097523.809268] EIP:0060:[]Not tainted VLI [1097523.809269] EFLAGS: 00010297 (2.6.22.14-cfs-etch-686-envcan #1) [1097523.809323] EIP is at vsnprintf+0x2af/0x48c [1097523.809341] eax: 8000 ebx: ecx: 8000 edx: fffe [1097523.809361] esi: d89c6017 edi: dd1ffeac ebp: esp: dd1ffe4c [1097523.809382] ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 [1097523.809403] Process clBackup (pid: 30311, ti=dd1fe000 task=f7043290 task.ti=dd1fe000) [1097523.809423] Stack: dbc2a000 1000 c033b638 f89e056c c02360f1 dbc2a000 27639fe8 d89c6017 [1097523.809469]008e2408 c0337eab 0003 0017 [1097523.809512]c037a340 dbc2a000 c01d93b8 dd1ffeac dd1ffeac c023566c d89c6017 c0337eaa [1097523.809559] Call Trace: [1097523.809588] [] dev_uevent+0x189/0x1e0 [1097523.809614] [] sprintf+0x20/0x23 [1097523.809635] [] show_uevent+0xad/0xd5 [1097523.809658] [] get_page_from_freelist+0x273/0x30a [1097523.809686] [] group_send_sig_info+0x12/0x56 [1097523.809711] [] __alloc_pages+0x52/0x286 [1097523.809734] [] show_uevent+0x0/0xd5 [1097523.809754] [] dev_attr_show+0x15/0x18 [1097523.809775] [] sysfs_read_file+0x87/0xd8 [1097523.809796] [] sys_getxattr+0x46/0x4e [1097523.809818] [] sysfs_read_file+0x0/0xd8 [1097523.809839] [] vfs_read+0xa6/0x128 [1097523.809861] [] sys_read+0x41/0x67 [1097523.809881] [] syscall_call+0x7/0xb [1097523.809906] === [1097523.809921] Code: 74 24 28 73 03 c6 06 20 4d 46 85 ed 7f f1 e9 b9 00 00 00 8b 0f b8 39 0a 33 c0 8b 54 24 30 81 f9 ff 0f 00 00 0f 46 c8 89 c8 eb 06 <80> 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 44 24 2c 10 89 c3 [1097523.810088] EIP: [] vsnprintf+0x2af/0x48c SS:ESP 0068:dd1ffe4c Help would really be appreciated. - vin -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
RE: 2.6.22.14 oops msg with commvault galaxy ?
> -Message d'origine- > De : Randy Dunlap [mailto:[EMAIL PROTECTED] > Envoyé : 30 novembre 2007 12:13 > > On Fri, 30 Nov 2007 13:02:54 + Vincent Fortier wrote: > > > Hi all, > > > > I'm using a 2.6.22.14 + CFS v24 and I got theses errors > when starting > > up my commvault galaxy client... Do anybody know what this could mean? > > Can you provide a few lines of syslog before the Oops: line, > which should contain some info about what happened, e.g.: > > Unable to handle kernel paging request at virtual address > e4a85017 printing eip: > c01d915a > *pde = 37d0d067 > *pte = Would this be better? [766535.379600] BUG: unable to handle kernel NULL pointer dereference at virtual address 00c8 [766535.379636] printing eip: [766535.379652] c01a920c [766535.379665] *pdpt = 1cc2c001 [766535.379681] *pde = [766535.379698] Oops: [#1] [766535.379713] SMP [766535.379729] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd ide_generic usbkbd usbmouse tsdev sg iTCO_wdt iTCO_vendor_support psmouse e752x_edac shpchp serio_raw edac_mc pcspkr evdev sr_mod pci_hotplug floppy cdrom ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core ehci_hcd uhci_hcd ata_piix tg3 usbcore thermal processor fan mptscsih mptbase megaraid_sas megaraid_mbox megaraid_mm cciss aacraid [766535.379956] CPU:0 [766535.379957] EIP:0060:[]Not tainted VLI [766535.379959] EFLAGS: 00010202 (2.6.22.14-cfs-etch-686-envcan #1) [766535.380011] EIP is at sysfs_open_file+0x78/0x1e4 [766535.380028] eax: ebx: f7f02e58 ecx: 000d edx: 00c8 [766535.380049] esi: f7e7ec8c edi: defadf30 ebp: c01a9194 esp: defadedc [766535.380070] ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 [766535.380091] Process clBackup (pid: 22085, ti=defac000 task=de01ac60 task.ti=defac000) [766535.380110] Stack: de093300 dd2ce408 f7e7ec48 de093300 dd2ce408 defadf30 c01a9194 c017048c [766535.380155]dfe8a180 dd2cbd48 de093300 8000 defadf30 000d c01705bd de093300 [766535.380202] c01705fe defadf30 dd2cbd48 dfe8a180 e0bd6f00 [766535.380246] Call Trace: [766535.380276] [] sysfs_open_file+0x0/0x1e4 [766535.380296] [] __dentry_open+0xc1/0x178 [766535.380321] [] nameidata_to_filp+0x24/0x33 [766535.380343] [] do_filp_open+0x32/0x39 [766535.380367] [] get_unused_fd+0x4a/0xaa [766535.380390] [] do_sys_open+0x42/0xc3 [766535.380413] [] sys_open+0x1c/0x1e [766535.380434] [] syscall_call+0x7/0xb [766535.380460] === [766535.380476] Code: 14 24 83 7c 24 08 00 8b 42 0c 8b 40 54 8b 70 14 0f 84 70 01 00 00 85 f6 0f 84 68 01 00 00 8b 56 04 85 d2 74 19 64 a1 08 50 3d c0 <83> 3a 02 0f 84 42 01 00 00 c1 e0 05 ff 84 10 20 01 00 00 8b 54 [766535.380644] EIP: [] sysfs_open_file+0x78/0x1e4 SS:ESP 0068:defadedc Again, > > > > thnx very much! > - vin - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: 2.6.22.14 oops msg with commvault galaxy ?
On Fri, 30 Nov 2007 13:02:54 + Vincent Fortier wrote: > Hi all, > > I'm using a 2.6.22.14 + CFS v24 and I got theses errors when starting up > my commvault galaxy client... Do anybody know what this could mean? Can you provide a few lines of syslog before the Oops: line, which should contain some info about what happened, e.g.: Unable to handle kernel paging request at virtual address e4a85017 printing eip: c01d915a *pde = 37d0d067 *pte = > Message from [EMAIL PROTECTED] at Fri Nov 30 12:54:57 2007 ... > printemps kernel: [750078.538268] Oops: [#1] > printemps kernel: [750078.538284] SMP > printemps kernel: [750078.538528] CPU:2 > printemps kernel: [750078.538529] EIP:0060:[]Not > tainted VLI > printemps kernel: [750078.538530] EFLAGS: 00010297 > (2.6.22.14-cfs-etch-686-envcan #1) > printemps kernel: [750078.538580] EIP is at vsnprintf+0x2af/0x48c > printemps kernel: [750078.538597] eax: 8000 ebx: ecx: > 8000 edx: fffe > printemps kernel: [750078.538618] esi: e4a85017 edi: cf07feac ebp: > esp: cf07fe4c > printemps kernel: [750078.538637] ds: 007b es: 007b fs: 00d8 gs: > 0033 ss: 0068 > printemps kernel: [750078.538656] Process clBackup (pid: 29277, > ti=cf07e000 task=f6d9f8c0 task.ti=cf07e000) > printemps kernel: [750078.538676] Stack: e4834000 1000 c033b638 > f89e056c c02360f1 e4834000 1b57afe8 e4a85017 > printemps kernel: [750078.538721]00ef2608 > c0337eab 0003 0017 > printemps kernel: [750078.538767]c037a340 e4834000 c01d93b8 > cf07feac cf07feac c023566c e4a85017 c0337eaa > printemps kernel: [750078.538810] Call Trace: > printemps kernel: [750078.538839] [] dev_uevent+0x189/0x1e0 > printemps kernel: [750078.538864] [] sprintf+0x20/0x23 > printemps kernel: [750078.538885] [] show_uevent+0xad/0xd5 > printemps kernel: [750078.538907] [] get_page_from_freelist > +0x273/0x30a > printemps kernel: [750078.538933] [] group_send_sig_info > +0x12/0x56 > printemps kernel: [750078.538956] [] __alloc_pages+0x52/0x286 > printemps kernel: [750078.538984] [] show_uevent+0x0/0xd5 > printemps kernel: [750078.539006] [] dev_attr_show+0x15/0x18 > printemps kernel: [750078.539027] [] sysfs_read_file > +0x87/0xd8 > printemps kernel: [750078.539048] [] sys_getxattr+0x46/0x4e > printemps kernel: [750078.539071] [] sysfs_read_file+0x0/0xd8 > printemps kernel: [750078.539092] [] vfs_read+0xa6/0x128 > printemps kernel: [750078.539115] [] sys_read+0x41/0x67 > printemps kernel: [750078.539137] [] syscall_call+0x7/0xb > printemps kernel: [750078.539162] === > printemps kernel: [750078.539177] Code: 74 24 28 73 03 c6 06 20 4d 46 85 > ed 7f f1 e9 b9 00 00 00 8b 0f b8 39 0a 33 c0 8b 54 24 30 81 f9 ff 0f 00 > 00 0f 46 c8 89 c8 eb 06 <80> 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 > 44 24 2c 10 89 c3 > printemps kernel: [750078.539346] EIP: [] vsnprintf > +0x2af/0x48c SS:ESP 0068:cf07fe4c > > > thnx very much! --- ~Randy - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/