Re: kernel 3.14.2 oops: seems related to EFI
On 05/20/2014 01:54 PM, Matt Fleming wrote: > On Mon, 19 May, at 09:09:58AM, Francis Moreau wrote: >> >> I don't know, I can't really afford to configure/compile/test this new >> kernel, sorry. > > It would be useful to know whether this issue still occurs when booting > with the efi=old_map kernel parameter. > the bug triggered: [ +0.002872] BUG: unable to handle kernel paging request at fffefd4a1e60 [ +0.66] IP: [] virt_efi_get_variable+0x48/0x80 [ +0.54] PGD 280f067 PUD 0 [ +0.31] Oops: [#1] PREEMPT SMP [ +0.39] Modules linked in: tun ses enclosure usb_storage loop fuse joydev coretemp hwmon arc4 nls_iso8859_1 nls_c [ +0.000691] ac ext4 crc16 mbcache jbd2 hid_generic usbhid hid bcache sd_mod sr_mod crc_t10dif cdrom crct10dif_common [ +0.000289] CPU: 7 PID: 23293 Comm: systemd-udevd Tainted: GW 3.14.4-1-ARCH #1 [ +0.57] Hardware name: CLEVO CO.W55xEU /W55xEU [ +0.87] task: 88039557bae0 ti: 8802de764000 task.ti: 8802de764000 [ +0.50] RIP: 0010:[] [] virt_efi_get_variable+0x48/0x80 [ +0.64] RSP: 0018:8802de765e58 EFLAGS: 00010082 [ +0.37] RAX: fffefd4a1e18 RBX: 8800da88f000 RCX: [ +0.48] RDX: 8800da88f400 RSI: 8800da88f000 RDI: [ +0.48] RBP: 8802de765e80 R08: 8802de765ec0 R09: [ +0.47] R10: R11: 0246 R12: 8800da88f400 [ +0.48] R13: R14: 8802de765ec0 R15: [ +0.48] FS: 7f10751057c0() GS:88041e3c() knlGS: [ +0.54] CS: 0010 DS: ES: CR0: 80050033 [ +0.40] CR2: fffefd4a1e60 CR3: 0003c4afa000 CR4: 001407e0 [ +0.48] Stack: [ +0.16] 8800da88f000 8802de765ec0 81b27c20 8802de765f48 [ +0.60] 3bc93ec9a0004bba 8802de765ea8 813dbc91 8800da88f000 [ +0.60] 7fffdc30c104 0004 8802de765ef8 81245779 [ +0.60] Call Trace: [ +0.25] [] efivar_entry_size+0x41/0x80 [ +0.44] [] efivarfs_file_read+0x49/0x100 [ +0.44] [] vfs_read+0x97/0x160 [ +0.37] [] SyS_read+0x59/0xd0 [ +0.39] [] system_call_fastpath+0x16/0x1b [ +0.41] Code: ce 4d 89 c7 e8 9a 06 00 00 65 ff 04 25 a0 c7 00 00 48 8b 05 1b d4 86 00 4d 89 f9 4d 89 f0 4c 89 e9 [ +0.000335] RIP [] virt_efi_get_variable+0x48/0x80 [ +0.49] RSP [ +0.26] CR2: fffefd4a1e60 [ +0.016781] ---[ end trace 5a7017feeac75345 ]--- the sad thing is tht my system can't shutdown properly when it happens. -- To unsubscribe from this list: send the line "unsubscribe linux-efi" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: kernel 3.14.2 oops: seems related to EFI
On 05/20/2014 01:54 PM, Matt Fleming wrote: > On Mon, 19 May, at 09:09:58AM, Francis Moreau wrote: >> >> I don't know, I can't really afford to configure/compile/test this new >> kernel, sorry. > > It would be useful to know whether this issue still occurs when booting > with the efi=old_map kernel parameter. > ok I can try to boot with that parameter and see if the issue happens again. Unfortunately if it doesn't, we couldn't tell. Thanks -- To unsubscribe from this list: send the line "unsubscribe linux-efi" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: kernel 3.14.2 oops: seems related to EFI
On Mon, 19 May, at 09:09:58AM, Francis Moreau wrote: > > I don't know, I can't really afford to configure/compile/test this new > kernel, sorry. It would be useful to know whether this issue still occurs when booting with the efi=old_map kernel parameter. -- Matt Fleming, Intel Open Source Technology Center -- To unsubscribe from this list: send the line "unsubscribe linux-efi" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: kernel 3.14.2 oops: seems related to EFI
On Mon, 19 May, at 09:09:58AM, Francis Moreau wrote: > On 05/18/2014 03:42 PM, Borislav Petkov wrote: > > On Sat, May 17, 2014 at 05:25:47PM +0200, Francis Moreau wrote: > >> [ +0.018677] general protection fault: [#1] PREEMPT SMP > >> [ +0.68] Modules linked in: usb_storage tun raid1 md_mod loop fuse > >> joydev coretemp hwmon arc4 intel_rapl x86_pkg_temp_thermal > >> intel_powerclamp kvm_intel nls_iso8859_1 nls_cp437 iTCO_wdt kvm vfat fat > >> iTCO_vendor_support iwldvm uvcvideo led_class crct10dif_pclmul > >> crc32_pclmul crc32c_intel ghash_clmulni_intel mac80211 videobuf2_vmalloc > >> videobuf2_memops videobuf2_core aesni_intel videodev aes_x86_64 > >> snd_hda_codec_hdmi lrw gf128mul mousedev glue_helper btusb > >> snd_hda_codec_via ablk_helper media cryptd iwlwifi snd_hda_codec_generic > >> bluetooth psmouse microcode i2c_i801 serio_raw cfg80211 6lowpan_iphc > >> rtsx_pci_ms r8169 memstick rfkill lpc_ich mii snd_hda_intel > >> snd_hda_codec thermal snd_hwdep wmi snd_pcm tpm_infineon snd_timer > >> tpm_tis mei_me snd tpm mei shpchp evdev soundcore processor battery > >> mac_hid ac > >> [ +0.000803] ext4 crc16 mbcache jbd2 hid_generic usbhid hid bcache > >> sd_mod sr_mod crc_t10dif cdrom crct10dif_common rtsx_pci_sdmmc mmc_core > >> atkbd libps2 ahci libahci ehci_pci libata xhci_hcd ehci_hcd scsi_mod > >> rtsx_pci usbcore usb_common i8042 serio i915 video button intel_gtt > >> i2c_algo_bit drm_kms_helper drm i2c_core > >> [ +0.000328] CPU: 0 PID: 30835 Comm: systemd-udevd Not tainted > >> 3.14.2-1-ARCH #1 > >> [ +0.64] Hardware name: CLEVO CO.W55xEU > >> /W55xEU , BIOS 4.6.5 > >> 03/05/2013 > >> [ +0.000102] task: 880405ee6bf0 ti: 880400f4a000 task.ti: > >> 880400f4a000 > >> [ +0.60] RIP: 0010:[] [] > >> efi_call5+0x6f/0xf0 > >> [ +0.71] RSP: 0018:880400f4bdb0 EFLAGS: 00010002 > >> [ +0.45] RAX: 80050033 RBX: 8804040e3000 RCX: > >> 8804040e3000 > >> [ +0.55] RDX: 8804040e3400 RSI: 8804040e3000 RDI: > >> bff7f7af > > > > So you get a #GP while executing call *rdi and %rdi is supposed to > > contain ->get_variable. But instead it contains some very funky shit: > > > > 0xbff7f7af > > > > Who made it contain that nuisance of a pointer which thinks it is > > ->get_variable, huh? If only I could get my hands on that guy! :-P > > > > Ok, seriously, how reproducible is this? > > I don't really know how to reproduce this, I only can say that it > usually happens while partitioning the loop device or perhaps when the > kernel reads the partition table afterwards. It looks like it's oopsing as a result of systemd-udevd trying to read a variable via the efivarfs mount, Call Trace: [] ? virt_efi_get_variable+0x51/0x80 [] efivar_entry_size+0x41/0x80 [] efivarfs_file_read+0x49/0x100 [] vfs_read+0x97/0x160 [] SyS_read+0x59/0xd0 [] system_call_fastpath+0x16/0x1b -- Matt Fleming, Intel Open Source Technology Center -- To unsubscribe from this list: send the line "unsubscribe linux-efi" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: kernel 3.14.2 oops: seems related to EFI
On 05/18/2014 03:42 PM, Borislav Petkov wrote: > On Sat, May 17, 2014 at 05:25:47PM +0200, Francis Moreau wrote: >> [ +0.018677] general protection fault: [#1] PREEMPT SMP >> [ +0.68] Modules linked in: usb_storage tun raid1 md_mod loop fuse >> joydev coretemp hwmon arc4 intel_rapl x86_pkg_temp_thermal >> intel_powerclamp kvm_intel nls_iso8859_1 nls_cp437 iTCO_wdt kvm vfat fat >> iTCO_vendor_support iwldvm uvcvideo led_class crct10dif_pclmul >> crc32_pclmul crc32c_intel ghash_clmulni_intel mac80211 videobuf2_vmalloc >> videobuf2_memops videobuf2_core aesni_intel videodev aes_x86_64 >> snd_hda_codec_hdmi lrw gf128mul mousedev glue_helper btusb >> snd_hda_codec_via ablk_helper media cryptd iwlwifi snd_hda_codec_generic >> bluetooth psmouse microcode i2c_i801 serio_raw cfg80211 6lowpan_iphc >> rtsx_pci_ms r8169 memstick rfkill lpc_ich mii snd_hda_intel >> snd_hda_codec thermal snd_hwdep wmi snd_pcm tpm_infineon snd_timer >> tpm_tis mei_me snd tpm mei shpchp evdev soundcore processor battery >> mac_hid ac >> [ +0.000803] ext4 crc16 mbcache jbd2 hid_generic usbhid hid bcache >> sd_mod sr_mod crc_t10dif cdrom crct10dif_common rtsx_pci_sdmmc mmc_core >> atkbd libps2 ahci libahci ehci_pci libata xhci_hcd ehci_hcd scsi_mod >> rtsx_pci usbcore usb_common i8042 serio i915 video button intel_gtt >> i2c_algo_bit drm_kms_helper drm i2c_core >> [ +0.000328] CPU: 0 PID: 30835 Comm: systemd-udevd Not tainted >> 3.14.2-1-ARCH #1 >> [ +0.64] Hardware name: CLEVO CO.W55xEU >> /W55xEU , BIOS 4.6.5 >> 03/05/2013 >> [ +0.000102] task: 880405ee6bf0 ti: 880400f4a000 task.ti: >> 880400f4a000 >> [ +0.60] RIP: 0010:[] [] >> efi_call5+0x6f/0xf0 >> [ +0.71] RSP: 0018:880400f4bdb0 EFLAGS: 00010002 >> [ +0.45] RAX: 80050033 RBX: 8804040e3000 RCX: >> 8804040e3000 >> [ +0.55] RDX: 8804040e3400 RSI: 8804040e3000 RDI: >> bff7f7af > > So you get a #GP while executing call *rdi and %rdi is supposed to > contain ->get_variable. But instead it contains some very funky shit: > > 0xbff7f7af > > Who made it contain that nuisance of a pointer which thinks it is > ->get_variable, huh? If only I could get my hands on that guy! :-P > > Ok, seriously, how reproducible is this? I don't really know how to reproduce this, I only can say that it usually happens while partitioning the loop device or perhaps when the kernel reads the partition table afterwards. > Can you reproduce with the > latest upstream kernel too, i.e. 3.15-rc5+? I don't know, I can't really afford to configure/compile/test this new kernel, sorry. Thanks -- To unsubscribe from this list: send the line "unsubscribe linux-efi" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: kernel 3.14.2 oops: seems related to EFI
On Sat, May 17, 2014 at 05:25:47PM +0200, Francis Moreau wrote: > [ +0.018677] general protection fault: [#1] PREEMPT SMP > [ +0.68] Modules linked in: usb_storage tun raid1 md_mod loop fuse > joydev coretemp hwmon arc4 intel_rapl x86_pkg_temp_thermal > intel_powerclamp kvm_intel nls_iso8859_1 nls_cp437 iTCO_wdt kvm vfat fat > iTCO_vendor_support iwldvm uvcvideo led_class crct10dif_pclmul > crc32_pclmul crc32c_intel ghash_clmulni_intel mac80211 videobuf2_vmalloc > videobuf2_memops videobuf2_core aesni_intel videodev aes_x86_64 > snd_hda_codec_hdmi lrw gf128mul mousedev glue_helper btusb > snd_hda_codec_via ablk_helper media cryptd iwlwifi snd_hda_codec_generic > bluetooth psmouse microcode i2c_i801 serio_raw cfg80211 6lowpan_iphc > rtsx_pci_ms r8169 memstick rfkill lpc_ich mii snd_hda_intel > snd_hda_codec thermal snd_hwdep wmi snd_pcm tpm_infineon snd_timer > tpm_tis mei_me snd tpm mei shpchp evdev soundcore processor battery > mac_hid ac > [ +0.000803] ext4 crc16 mbcache jbd2 hid_generic usbhid hid bcache > sd_mod sr_mod crc_t10dif cdrom crct10dif_common rtsx_pci_sdmmc mmc_core > atkbd libps2 ahci libahci ehci_pci libata xhci_hcd ehci_hcd scsi_mod > rtsx_pci usbcore usb_common i8042 serio i915 video button intel_gtt > i2c_algo_bit drm_kms_helper drm i2c_core > [ +0.000328] CPU: 0 PID: 30835 Comm: systemd-udevd Not tainted > 3.14.2-1-ARCH #1 > [ +0.64] Hardware name: CLEVO CO.W55xEU > /W55xEU , BIOS 4.6.5 > 03/05/2013 > [ +0.000102] task: 880405ee6bf0 ti: 880400f4a000 task.ti: > 880400f4a000 > [ +0.60] RIP: 0010:[] [] > efi_call5+0x6f/0xf0 > [ +0.71] RSP: 0018:880400f4bdb0 EFLAGS: 00010002 > [ +0.45] RAX: 80050033 RBX: 8804040e3000 RCX: > 8804040e3000 > [ +0.55] RDX: 8804040e3400 RSI: 8804040e3000 RDI: > bff7f7af So you get a #GP while executing call *rdi and %rdi is supposed to contain ->get_variable. But instead it contains some very funky shit: 0xbff7f7af Who made it contain that nuisance of a pointer which thinks it is ->get_variable, huh? If only I could get my hands on that guy! :-P Ok, seriously, how reproducible is this? Can you reproduce with the latest upstream kernel too, i.e. 3.15-rc5+? Thanks. (leaving in the rest for reference). > [ +0.56] RBP: 880400f4be80 R08: R09: > 880400f4bec0 > [ +0.55] R10: R11: 0246 R12: > 8804040e3400 > [ +0.56] R13: R14: 880400f4bec0 R15: > 0009b000 > [ +0.002960] FS: 7fb6167c97c0() GS:88041e20() > knlGS: > [ +0.002958] CS: 0010 DS: ES: CR0: 80050033 > [ +0.003177] CR2: 7fb61581f4c0 CR3: 0009b000 CR4: > 001427e0 > [ +0.003258] Stack: > [ +0.003257] 0201 8065 8804 > 8801 > [ +0.003328] 880400f4be50 > 80050033 > [ +0.003354] 00ff 00ff > > [ +0.003368] Call Trace: > [ +0.003389] [] ? virt_efi_get_variable+0x51/0x80 > [ +0.003353] [] efivar_entry_size+0x41/0x80 > [ +0.003315] [] efivarfs_file_read+0x49/0x100 > [ +0.003326] [] vfs_read+0x97/0x160 > [ +0.003305] [] SyS_read+0x59/0xd0 > [ +0.003263] [] system_call_fastpath+0x16/0x1b > [ +0.003239] Code: 89 c8 48 89 f1 80 3d e8 16 7d 00 00 74 1d 4c 89 3d > c7 16 7d 00 41 0f 20 df 4c 89 3d c4 16 7d 00 4c 8b 3d c5 16 7d 00 41 0f > 22 df d7 80 3d c0 16 7d 00 00 74 41 4c 8b 3d a7 16 7d 00 41 0f 22 > [ +0.003648] RIP [] efi_call5+0x6f/0xf0 > [ +0.003511] RSP > [ +0.024630] ---[ end trace 3670998c9a49abb7 ]--- > [ +0.05] note: systemd-udevd[30835] exited with preempt_count 2 > -- > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majord...@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ > -- Regards/Gruss, Boris. Sent from a fat crate under my desk. Formatting is fine. -- -- To unsubscribe from this list: send the line "unsubscribe linux-efi" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html