Re: kernel 3.14.2 oops: seems related to EFI

2014-05-27 Thread Francis Moreau
On 05/20/2014 01:54 PM, Matt Fleming wrote:
> On Mon, 19 May, at 09:09:58AM, Francis Moreau wrote:
>>
>> I don't know, I can't really afford to configure/compile/test this new
>> kernel, sorry.
> 
> It would be useful to know whether this issue still occurs when booting
> with the efi=old_map kernel parameter.
> 

the bug triggered:

[  +0.002872] BUG: unable to handle kernel paging request at
fffefd4a1e60
[  +0.66] IP: [] virt_efi_get_variable+0x48/0x80
[  +0.54] PGD 280f067 PUD 0
[  +0.31] Oops:  [#1] PREEMPT SMP
[  +0.39] Modules linked in: tun ses enclosure usb_storage loop fuse
joydev coretemp hwmon arc4 nls_iso8859_1 nls_c
[  +0.000691]  ac ext4 crc16 mbcache jbd2 hid_generic usbhid hid bcache
sd_mod sr_mod crc_t10dif cdrom crct10dif_common
[  +0.000289] CPU: 7 PID: 23293 Comm: systemd-udevd Tainted: GW
   3.14.4-1-ARCH #1
[  +0.57] Hardware name: CLEVO CO.W55xEU
  /W55xEU
[  +0.87] task: 88039557bae0 ti: 8802de764000 task.ti:
8802de764000
[  +0.50] RIP: 0010:[]  []
virt_efi_get_variable+0x48/0x80
[  +0.64] RSP: 0018:8802de765e58  EFLAGS: 00010082
[  +0.37] RAX: fffefd4a1e18 RBX: 8800da88f000 RCX:

[  +0.48] RDX: 8800da88f400 RSI: 8800da88f000 RDI:

[  +0.48] RBP: 8802de765e80 R08: 8802de765ec0 R09:

[  +0.47] R10:  R11: 0246 R12:
8800da88f400
[  +0.48] R13:  R14: 8802de765ec0 R15:

[  +0.48] FS:  7f10751057c0() GS:88041e3c()
knlGS:
[  +0.54] CS:  0010 DS:  ES:  CR0: 80050033
[  +0.40] CR2: fffefd4a1e60 CR3: 0003c4afa000 CR4:
001407e0
[  +0.48] Stack:
[  +0.16]  8800da88f000 8802de765ec0 81b27c20
8802de765f48
[  +0.60]  3bc93ec9a0004bba 8802de765ea8 813dbc91
8800da88f000
[  +0.60]  7fffdc30c104 0004 8802de765ef8
81245779
[  +0.60] Call Trace:
[  +0.25]  [] efivar_entry_size+0x41/0x80
[  +0.44]  [] efivarfs_file_read+0x49/0x100
[  +0.44]  [] vfs_read+0x97/0x160
[  +0.37]  [] SyS_read+0x59/0xd0
[  +0.39]  [] system_call_fastpath+0x16/0x1b
[  +0.41] Code: ce 4d 89 c7 e8 9a 06 00 00 65 ff 04 25 a0 c7 00 00
48 8b 05 1b d4 86 00 4d 89 f9 4d 89 f0 4c 89 e9
[  +0.000335] RIP  [] virt_efi_get_variable+0x48/0x80
[  +0.49]  RSP 
[  +0.26] CR2: fffefd4a1e60
[  +0.016781] ---[ end trace 5a7017feeac75345 ]---

the sad thing is tht my system can't shutdown properly when it happens.

--
To unsubscribe from this list: send the line "unsubscribe linux-efi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: kernel 3.14.2 oops: seems related to EFI

2014-05-20 Thread Francis Moreau
On 05/20/2014 01:54 PM, Matt Fleming wrote:
> On Mon, 19 May, at 09:09:58AM, Francis Moreau wrote:
>>
>> I don't know, I can't really afford to configure/compile/test this new
>> kernel, sorry.
> 
> It would be useful to know whether this issue still occurs when booting
> with the efi=old_map kernel parameter.
> 

ok I can try to boot with that parameter and see if the issue happens
again. Unfortunately if it doesn't, we couldn't tell.

Thanks
--
To unsubscribe from this list: send the line "unsubscribe linux-efi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: kernel 3.14.2 oops: seems related to EFI

2014-05-20 Thread Matt Fleming
On Mon, 19 May, at 09:09:58AM, Francis Moreau wrote:
> 
> I don't know, I can't really afford to configure/compile/test this new
> kernel, sorry.

It would be useful to know whether this issue still occurs when booting
with the efi=old_map kernel parameter.

-- 
Matt Fleming, Intel Open Source Technology Center
--
To unsubscribe from this list: send the line "unsubscribe linux-efi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: kernel 3.14.2 oops: seems related to EFI

2014-05-19 Thread Matt Fleming
On Mon, 19 May, at 09:09:58AM, Francis Moreau wrote:
> On 05/18/2014 03:42 PM, Borislav Petkov wrote:
> > On Sat, May 17, 2014 at 05:25:47PM +0200, Francis Moreau wrote:
> >> [  +0.018677] general protection fault:  [#1] PREEMPT SMP
> >> [  +0.68] Modules linked in: usb_storage tun raid1 md_mod loop fuse
> >> joydev coretemp hwmon arc4 intel_rapl x86_pkg_temp_thermal
> >> intel_powerclamp kvm_intel nls_iso8859_1 nls_cp437 iTCO_wdt kvm vfat fat
> >> iTCO_vendor_support iwldvm uvcvideo led_class crct10dif_pclmul
> >> crc32_pclmul crc32c_intel ghash_clmulni_intel mac80211 videobuf2_vmalloc
> >> videobuf2_memops videobuf2_core aesni_intel videodev aes_x86_64
> >> snd_hda_codec_hdmi lrw gf128mul mousedev glue_helper btusb
> >> snd_hda_codec_via ablk_helper media cryptd iwlwifi snd_hda_codec_generic
> >> bluetooth psmouse microcode i2c_i801 serio_raw cfg80211 6lowpan_iphc
> >> rtsx_pci_ms r8169 memstick rfkill lpc_ich mii snd_hda_intel
> >> snd_hda_codec thermal snd_hwdep wmi snd_pcm tpm_infineon snd_timer
> >> tpm_tis mei_me snd tpm mei shpchp evdev soundcore processor battery
> >> mac_hid ac
> >> [  +0.000803]  ext4 crc16 mbcache jbd2 hid_generic usbhid hid bcache
> >> sd_mod sr_mod crc_t10dif cdrom crct10dif_common rtsx_pci_sdmmc mmc_core
> >> atkbd libps2 ahci libahci ehci_pci libata xhci_hcd ehci_hcd scsi_mod
> >> rtsx_pci usbcore usb_common i8042 serio i915 video button intel_gtt
> >> i2c_algo_bit drm_kms_helper drm i2c_core
> >> [  +0.000328] CPU: 0 PID: 30835 Comm: systemd-udevd Not tainted
> >> 3.14.2-1-ARCH #1
> >> [  +0.64] Hardware name: CLEVO CO.W55xEU
> >>   /W55xEU  , BIOS 4.6.5
> >> 03/05/2013
> >> [  +0.000102] task: 880405ee6bf0 ti: 880400f4a000 task.ti:
> >> 880400f4a000
> >> [  +0.60] RIP: 0010:[]  []
> >> efi_call5+0x6f/0xf0
> >> [  +0.71] RSP: 0018:880400f4bdb0  EFLAGS: 00010002
> >> [  +0.45] RAX: 80050033 RBX: 8804040e3000 RCX:
> >> 8804040e3000
> >> [  +0.55] RDX: 8804040e3400 RSI: 8804040e3000 RDI:
> >> bff7f7af
> > 
> > So you get a #GP while executing call *rdi and %rdi is supposed to
> > contain ->get_variable. But instead it contains some very funky shit:
> > 
> > 0xbff7f7af
> > 
> > Who made it contain that nuisance of a pointer which thinks it is
> > ->get_variable, huh? If only I could get my hands on that guy! :-P
> > 
> > Ok, seriously, how reproducible is this?
> 
> I don't really know how to reproduce this, I only can say that it
> usually happens while partitioning the loop device or perhaps when the
> kernel reads the partition table afterwards.
 
It looks like it's oopsing as a result of systemd-udevd trying to
read a variable via the efivarfs mount,

 Call Trace:
  [] ? virt_efi_get_variable+0x51/0x80
  [] efivar_entry_size+0x41/0x80
  [] efivarfs_file_read+0x49/0x100
  [] vfs_read+0x97/0x160
  [] SyS_read+0x59/0xd0
  [] system_call_fastpath+0x16/0x1b

-- 
Matt Fleming, Intel Open Source Technology Center
--
To unsubscribe from this list: send the line "unsubscribe linux-efi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: kernel 3.14.2 oops: seems related to EFI

2014-05-19 Thread Francis Moreau
On 05/18/2014 03:42 PM, Borislav Petkov wrote:
> On Sat, May 17, 2014 at 05:25:47PM +0200, Francis Moreau wrote:
>> [  +0.018677] general protection fault:  [#1] PREEMPT SMP
>> [  +0.68] Modules linked in: usb_storage tun raid1 md_mod loop fuse
>> joydev coretemp hwmon arc4 intel_rapl x86_pkg_temp_thermal
>> intel_powerclamp kvm_intel nls_iso8859_1 nls_cp437 iTCO_wdt kvm vfat fat
>> iTCO_vendor_support iwldvm uvcvideo led_class crct10dif_pclmul
>> crc32_pclmul crc32c_intel ghash_clmulni_intel mac80211 videobuf2_vmalloc
>> videobuf2_memops videobuf2_core aesni_intel videodev aes_x86_64
>> snd_hda_codec_hdmi lrw gf128mul mousedev glue_helper btusb
>> snd_hda_codec_via ablk_helper media cryptd iwlwifi snd_hda_codec_generic
>> bluetooth psmouse microcode i2c_i801 serio_raw cfg80211 6lowpan_iphc
>> rtsx_pci_ms r8169 memstick rfkill lpc_ich mii snd_hda_intel
>> snd_hda_codec thermal snd_hwdep wmi snd_pcm tpm_infineon snd_timer
>> tpm_tis mei_me snd tpm mei shpchp evdev soundcore processor battery
>> mac_hid ac
>> [  +0.000803]  ext4 crc16 mbcache jbd2 hid_generic usbhid hid bcache
>> sd_mod sr_mod crc_t10dif cdrom crct10dif_common rtsx_pci_sdmmc mmc_core
>> atkbd libps2 ahci libahci ehci_pci libata xhci_hcd ehci_hcd scsi_mod
>> rtsx_pci usbcore usb_common i8042 serio i915 video button intel_gtt
>> i2c_algo_bit drm_kms_helper drm i2c_core
>> [  +0.000328] CPU: 0 PID: 30835 Comm: systemd-udevd Not tainted
>> 3.14.2-1-ARCH #1
>> [  +0.64] Hardware name: CLEVO CO.W55xEU
>>   /W55xEU  , BIOS 4.6.5
>> 03/05/2013
>> [  +0.000102] task: 880405ee6bf0 ti: 880400f4a000 task.ti:
>> 880400f4a000
>> [  +0.60] RIP: 0010:[]  []
>> efi_call5+0x6f/0xf0
>> [  +0.71] RSP: 0018:880400f4bdb0  EFLAGS: 00010002
>> [  +0.45] RAX: 80050033 RBX: 8804040e3000 RCX:
>> 8804040e3000
>> [  +0.55] RDX: 8804040e3400 RSI: 8804040e3000 RDI:
>> bff7f7af
> 
> So you get a #GP while executing call *rdi and %rdi is supposed to
> contain ->get_variable. But instead it contains some very funky shit:
> 
> 0xbff7f7af
> 
> Who made it contain that nuisance of a pointer which thinks it is
> ->get_variable, huh? If only I could get my hands on that guy! :-P
> 
> Ok, seriously, how reproducible is this?

I don't really know how to reproduce this, I only can say that it
usually happens while partitioning the loop device or perhaps when the
kernel reads the partition table afterwards.

> Can you reproduce with the
> latest upstream kernel too, i.e. 3.15-rc5+?

I don't know, I can't really afford to configure/compile/test this new
kernel, sorry.

Thanks
--
To unsubscribe from this list: send the line "unsubscribe linux-efi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: kernel 3.14.2 oops: seems related to EFI

2014-05-18 Thread Borislav Petkov
On Sat, May 17, 2014 at 05:25:47PM +0200, Francis Moreau wrote:
> [  +0.018677] general protection fault:  [#1] PREEMPT SMP
> [  +0.68] Modules linked in: usb_storage tun raid1 md_mod loop fuse
> joydev coretemp hwmon arc4 intel_rapl x86_pkg_temp_thermal
> intel_powerclamp kvm_intel nls_iso8859_1 nls_cp437 iTCO_wdt kvm vfat fat
> iTCO_vendor_support iwldvm uvcvideo led_class crct10dif_pclmul
> crc32_pclmul crc32c_intel ghash_clmulni_intel mac80211 videobuf2_vmalloc
> videobuf2_memops videobuf2_core aesni_intel videodev aes_x86_64
> snd_hda_codec_hdmi lrw gf128mul mousedev glue_helper btusb
> snd_hda_codec_via ablk_helper media cryptd iwlwifi snd_hda_codec_generic
> bluetooth psmouse microcode i2c_i801 serio_raw cfg80211 6lowpan_iphc
> rtsx_pci_ms r8169 memstick rfkill lpc_ich mii snd_hda_intel
> snd_hda_codec thermal snd_hwdep wmi snd_pcm tpm_infineon snd_timer
> tpm_tis mei_me snd tpm mei shpchp evdev soundcore processor battery
> mac_hid ac
> [  +0.000803]  ext4 crc16 mbcache jbd2 hid_generic usbhid hid bcache
> sd_mod sr_mod crc_t10dif cdrom crct10dif_common rtsx_pci_sdmmc mmc_core
> atkbd libps2 ahci libahci ehci_pci libata xhci_hcd ehci_hcd scsi_mod
> rtsx_pci usbcore usb_common i8042 serio i915 video button intel_gtt
> i2c_algo_bit drm_kms_helper drm i2c_core
> [  +0.000328] CPU: 0 PID: 30835 Comm: systemd-udevd Not tainted
> 3.14.2-1-ARCH #1
> [  +0.64] Hardware name: CLEVO CO.W55xEU
>   /W55xEU  , BIOS 4.6.5
> 03/05/2013
> [  +0.000102] task: 880405ee6bf0 ti: 880400f4a000 task.ti:
> 880400f4a000
> [  +0.60] RIP: 0010:[]  []
> efi_call5+0x6f/0xf0
> [  +0.71] RSP: 0018:880400f4bdb0  EFLAGS: 00010002
> [  +0.45] RAX: 80050033 RBX: 8804040e3000 RCX:
> 8804040e3000
> [  +0.55] RDX: 8804040e3400 RSI: 8804040e3000 RDI:
> bff7f7af

So you get a #GP while executing call *rdi and %rdi is supposed to
contain ->get_variable. But instead it contains some very funky shit:

0xbff7f7af

Who made it contain that nuisance of a pointer which thinks it is
->get_variable, huh? If only I could get my hands on that guy! :-P

Ok, seriously, how reproducible is this? Can you reproduce with the
latest upstream kernel too, i.e. 3.15-rc5+?

Thanks.

(leaving in the rest for reference).

> [  +0.56] RBP: 880400f4be80 R08:  R09:
> 880400f4bec0
> [  +0.55] R10:  R11: 0246 R12:
> 8804040e3400
> [  +0.56] R13:  R14: 880400f4bec0 R15:
> 0009b000
> [  +0.002960] FS:  7fb6167c97c0() GS:88041e20()
> knlGS:
> [  +0.002958] CS:  0010 DS:  ES:  CR0: 80050033
> [  +0.003177] CR2: 7fb61581f4c0 CR3: 0009b000 CR4:
> 001427e0
> [  +0.003258] Stack:
> [  +0.003257]  0201 8065 8804
> 8801
> [  +0.003328]    880400f4be50
> 80050033
> [  +0.003354]  00ff  00ff
> 
> [  +0.003368] Call Trace:
> [  +0.003389]  [] ? virt_efi_get_variable+0x51/0x80
> [  +0.003353]  [] efivar_entry_size+0x41/0x80
> [  +0.003315]  [] efivarfs_file_read+0x49/0x100
> [  +0.003326]  [] vfs_read+0x97/0x160
> [  +0.003305]  [] SyS_read+0x59/0xd0
> [  +0.003263]  [] system_call_fastpath+0x16/0x1b
> [  +0.003239] Code: 89 c8 48 89 f1 80 3d e8 16 7d 00 00 74 1d 4c 89 3d
> c7 16 7d 00 41 0f 20 df 4c 89 3d c4 16 7d 00 4c 8b 3d c5 16 7d 00 41 0f
> 22 df  d7 80 3d c0 16 7d 00 00 74 41 4c 8b 3d a7 16 7d 00 41 0f 22
> [  +0.003648] RIP  [] efi_call5+0x6f/0xf0
> [  +0.003511]  RSP 
> [  +0.024630] ---[ end trace 3670998c9a49abb7 ]---
> [  +0.05] note: systemd-udevd[30835] exited with preempt_count 2
> --
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majord...@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/
> 

-- 
Regards/Gruss,
Boris.

Sent from a fat crate under my desk. Formatting is fine.
--
--
To unsubscribe from this list: send the line "unsubscribe linux-efi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html