On 11/03/15 16:02, Greg Kroah-Hartman wrote:
> On Wed, Mar 11, 2015 at 03:59:36PM +0000, Brian Russell wrote:
>>
>>
>> On 11/03/15 15:43, Greg Kroah-Hartman wrote:
>>> On Wed, Mar 11, 2015 at 03:31:42PM +0000, Brian Russell wrote:
>>>> Protect uio driver from crashing if its owner is hot unplugged while there
>>>> are open fds.
>>>> Signed-off-by: Brian Russell <bruss...@brocade.com>
>>>
>>> Minor nit, you need a blank line before your s-o-b: line.
>>>
>>
>> Ack.
>>
>>>
>>>
>>>> ---
>>>>  drivers/uio/uio.c | 8 +++++++-
>>>>  1 file changed, 7 insertions(+), 1 deletion(-)
>>>>
>>>> diff --git a/drivers/uio/uio.c b/drivers/uio/uio.c
>>>> index 6276f13..70ce015 100644
>>>> --- a/drivers/uio/uio.c
>>>> +++ b/drivers/uio/uio.c
>>>> @@ -434,9 +434,11 @@ static int uio_open(struct inode *inode, struct file 
>>>> *filep)
>>>>            goto out;
>>>>    }
>>>>  
>>>> +  get_device(idev);
>>>
>>> What is the real oops caused when a device is removed?  Protecting this
>>> with a reference count seems ok, but it seems "heavy".
>>>
>>
>> I'm seeing it with PCI hotplug. The PCI subsystem calls remove and the
>> owner module in turn calls uio_unregister_device while app stil has
>> open fds.
> 
> Sorry, I meant, what exactly is the oops message, with the callback?
> What portion of code is crashing because we have an open fd?  The pci
> remove path of the UIO core should be fixed to handle this properly.
> Not to say that your patch isn't correct, just want to see the crash to
> know for sure.
> 

Ah, I see, sorry:

 [  168.890968] BUG: unable to handle kernel paging request at ffff8800b2fb7e70
[  168.893141] IP: [<ffffffff810b5acc>] module_put+0xc/0x20
[  168.894076] PGD 1bc8067 PUD 0 
[  168.894679] Oops: 0002 [#1] SMP 
[  168.895322] Modules linked in: igb_uio(O) xfrm_user xfrm_algo l2tp_ip6 
l2tp_ip l2tp_eth l2tp_netlink l2tp_core tun uio cpufreq_userspace 
cpufreq_powersave cpufreq_ondemand cpufreq_conservative ipv6 crc32_pclmul 
microcode aesni_intel aes_x86_64 lrw gf128mul glue_helper serio_raw ablk_helper 
ghash_clmulni_intel intel_agp intel_gtt psmouse virtio_console processor 
agpgart cryptd evdev button i2c_piix4 i2c_core pcspkr thermal_sys 
virtio_balloon usb_storage ohci_hcd squashfs loop hid_generic usbhid hid 
pata_acpi ata_generic virtio_blk virtio_net floppy ata_piix virtio_pci 
virtio_ring virtio crc32c_intel [last unloaded: igb_uio]
[  168.900849] CPU: 0 PID: 4494 Comm: dataplane Tainted: G        W  O 
3.14.33-1-amd64-vyatta #1
[  168.900849] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 
1.7.5-20140531_083030-gandalf 04/01/2014
[  168.900849] task: ffff880036bb60b0 ti: ffff880036956000 task.ti: 
ffff880036956000
[  168.900849] RIP: 0010:[<ffffffff810b5acc>]  [<ffffffff810b5acc>] 
module_put+0xc/0x20
[  168.900849] RSP: 0018:ffff880036957ea0  EFLAGS: 00010282
[  168.900849] RAX: 00000000333b7e68 RBX: ffff880036d61ce0 RCX: 0000000000000001
[  168.900849] RDX: 0000000000000000 RSI: ffff880079c92800 RDI: ffff880078dfbb98
[  168.900849] RBP: ffff880078dfbb98 R08: 0000000000000000 R09: 0000000000000000
[  168.900849] R10: ffffffff8110da65 R11: 0000000000000001 R12: 0000000000000000
[  168.900849] R13: ffff88004be4c540 R14: ffff88007c8957a0 R15: ffff880079c92810
[  168.900849] FS:  00007fc28b9f7700(0000) GS:ffff88007fc00000(0000) 
knlGS:0000000000000000
[  168.900849] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  168.900849] CR2: ffff8800b2fb7e70 CR3: 00000000368f9000 CR4: 00000000000406f0
[  168.900849] Stack:
[  168.900849]  ffffffffa0191383 ffff880079c92800 0000000000000008 
ffff88004fec9c30
[  168.900849]  ffffffff811431e8 ffffffff81082810 0000000000000000 
ffffffff81ac0a50
[  168.900849]  ffff880036bb6ba0 ffff880036bb60b0 0000000001a8dc80 
0000000000000003
[  168.900849] Call Trace:
[  168.900849]  [<ffffffffa0191383>] ? uio_release+0x43/0x70 [uio]
[  168.900849]  [<ffffffff811431e8>] ? __fput+0xc8/0x230
[  168.900849]  [<ffffffff81082810>] ? sched_clock_cpu+0x90/0xc0
[  168.900849]  [<ffffffff810707c7>] ? task_work_run+0x97/0xd0
[  168.900849]  [<ffffffff8100ceaa>] ? do_notify_resume+0x8a/0xb0
[  168.900849]  [<ffffffff814d5272>] ? int_signal+0x12/0x17
[  168.900849] Code: 48 89 de 48 c7 c7 c0 5b 70 81 31 c0 e8 8e 61 41 00 eb d9 
66 66 66 2e 0f 1f 84 00 00 00 00 00 48 85 ff 74 0c 48 8b 87 28 02 00 00 <65> 48 
ff 40 08 f3 c3 66 66 66 66 2e 0f 1f 84 00 00 00 00 00 41 
[  168.900849] RIP  [<ffffffff810b5acc>] module_put+0xc/0x20
[  168.900849]  RSP <ffff880036957ea0>
[  168.900849] CR2: ffff8800b2fb7e70
[  168.900849] ---[ end trace 20f273e64b20b382 ]---
[  168.900849] Kernel panic - not syncing: Fatal exception
[  168.900849] Kernel Offset: 0x0 from 0xffffffff81000000 (relocation range: 
0xffffffff80000000-0xffffffff9fffffff)
[  168.900849] Rebooting in 60 seconds..



> thanks,
> 
> greg k-h
> 
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to