Re: gcc crashes with general protection faults in 5.9.0-rc3-00091-ge28f0104343d

2020-09-11 Thread Meelis Roos

Replying to myself:

This is 5.9.0-rc3-00091-ge28f0104343d on Lenovo t460s that has ran fine up to 
5.8.0.


Now I reproduced the same problem with 5.9.0-rc3 on a HP desktop with Core2Quad 
CPU. The call trace is very similar and it's crashing gcc again while compiling 
5.9-rc4.

But it seems 5.9-rc4 cures it here as well - whatever the reason might have 
been.

Nope, the reason was nondeterminism - it happened on the Core2Quad running 
5.9-rc4 while trying to compile todays Linux from git.

--
Meelis Roos 


Re: gcc crashes with general protection faults in 5.9.0-rc3-00091-ge28f0104343d

2020-09-08 Thread Meelis Roos

Replying to myself:

This is 5.9.0-rc3-00091-ge28f0104343d on Lenovo t460s that has ran fine up to 
5.8.0.


Now I reproduced the same problem with 5.9.0-rc3 on a HP desktop with Core2Quad 
CPU. The call trace is very similar and it's crashing gcc again while compiling 
5.9-rc4.

But it seems 5.9-rc4 cures it here as well - whatever the reason might have 
been.

[  814.959665] general protection fault, probably for non-canonical address 
0xdead0400:  [#1] SMP PTI
[  814.959671] CPU: 2 PID: 14496 Comm: cc1 Not tainted 5.9.0-rc3 #51
[  814.959672] Hardware name: Hewlett-Packard HP Compaq 6000 Pro MT PC/3048h, 
BIOS 786G2 v02.03 10/19/2015
[  814.959678] RIP: 0010:ext4_readpage+0xa/0x50
[  814.959680] Code: 60 01 fb 44 89 ca 4c 89 c6 e8 32 50 03 00 85 c0 79 ea 0f 0b c3 
66 66 2e 0f 1f 84 00 00 00 00 00 41 54 49 89 f4 55 48 8b 46 18 <48> 8b 28 48 8b 
85 68 ff ff ff a9 00 00 00 10 74 1b 66 83 bd d2 02
[  814.959681] RSP: :bcd4cb73fd68 EFLAGS: 00010286
[  814.959683] RAX: dead0400 RBX: bcd4cb73fe40 RCX: 
[  814.959685] RDX: 0001 RSI: e51507178c40 RDI: 9ff63f9e8e00
[  814.959686] RBP: e51507178c40 R08: 0002 R09: 9ff653d21d90
[  814.959687] R10: 9ff64fee2c00 R11: 9ff63f9e8e00 R12: e51507178c40
[  814.959688] R13: 0a31 R14: 9ff640b8ee98 R15: 9ff640b8f008
[  814.959690] FS:  7f9ac1c64f00() GS:9ff653d0() 
knlGS:
[  814.959692] CS:  0010 DS:  ES:  CR0: 80050033
[  814.959693] CR2: 00e31ff0 CR3: 0001d99de000 CR4: 000406e0
[  814.959694] Call Trace:
[  814.959700]  filemap_fault+0x193/0x7c0
[  814.959703]  ext4_filemap_fault+0x28/0x3a
[  814.959706]  __do_fault+0x31/0xf0
[  814.959708]  handle_mm_fault+0xf0e/0x14c0
[  814.959712]  do_user_addr_fault+0x18c/0x380
[  814.959715]  exc_page_fault+0x5b/0x80
[  814.959718]  ? asm_exc_page_fault+0x8/0x30
[  814.959720]  asm_exc_page_fault+0x1e/0x30
[  814.959722] RIP: 0033:0xe31ff0
[  814.959725] Code: Bad RIP value.
[  814.959727] RSP: 002b:7ffc2515f458 EFLAGS: 00010246
[  814.959728] RAX:  RBX: 7f9ac0118798 RCX: 0070
[  814.959730] RDX: 7ffc2515f4a0 RSI: 7f9ac1500b28 RDI: 7ffc2515f510
[  814.959731] RBP: 7f9ac00d4ea0 R08:  R09: 0008
[  814.959732] R10:  R11: 02807fe0 R12: 00861300
[  814.959733] R13: 7f9ac1500b28 R14: 7f9ac0118798 R15: 7ffc2515f4a0
[  814.959735] Modules linked in: 8021q garp stp mrp llc iptable_nat nf_nat 
nf_conntrack bluetooth libaes nf_defrag_ipv6 jitterentropy_rng nf_defrag_ipv4 
libcrc32c iptable_mangle bpfilter drbg iptable_filter ecdh_generic ecc 
snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio iTCO_wdt mei_wdt 
iTCO_vendor_support wmi_bmof kvm_intel snd_hda_intel snd_intel_dspcfg 
snd_hda_codec pcspkr kvm snd_hwdep snd_hda_core irqbypass snd_pcm mei_me 
snd_timer snd lpc_ich mfd_core mei soundcore 8250_pci tpm_infineon tpm_tis 
tpm_tis_core tpm rng_core wmi coretemp hwmon ip_tables x_tables autofs4
[  814.959762] ---[ end trace 9b16ba743f2d455c ]---
[  814.959764] RIP: 0010:ext4_readpage+0xa/0x50
[  814.959766] Code: 60 01 fb 44 89 ca 4c 89 c6 e8 32 50 03 00 85 c0 79 ea 0f 0b c3 
66 66 2e 0f 1f 84 00 00 00 00 00 41 54 49 89 f4 55 48 8b 46 18 <48> 8b 28 48 8b 
85 68 ff ff ff a9 00 00 00 10 74 1b 66 83 bd d2 02
[  814.959769] RSP: :bcd4cb73fd68 EFLAGS: 00010286
[  814.959771] RAX: dead0400 RBX: bcd4cb73fe40 RCX: 
[  814.959772] RDX: 0001 RSI: e51507178c40 RDI: 9ff63f9e8e00
[  814.959774] RBP: e51507178c40 R08: 0002 R09: 9ff653d21d90
[  814.959776] R10: 9ff64fee2c00 R11: 9ff63f9e8e00 R12: e51507178c40
[  814.959778] R13: 0a31 R14: 9ff640b8ee98 R15: 9ff640b8f008
[  814.959780] FS:  7f9ac1c64f00() GS:9ff653d0() 
knlGS:
[  814.959782] CS:  0010 DS:  ES:  CR0: 80050033
[  814.959784] CR2: 00e31fc6 CR3: 0001d99de000 CR4: 000406e0




Today I tried reproducing my linking problem with git kernel on my laptop and 
got segmentation faults in gcc. This is probably the corresponding dmesg part:

0xdead0400 loks like some kind of poisoning.

[307299.392045] general protection fault, probably for non-canonical address 
0xdead0400:  [#1] SMP PTI
[307299.392050] CPU: 1 PID: 165374 Comm: cc1 Not tainted 
5.9.0-rc3-00091-ge28f0104343d #207
[307299.392051] Hardware name: LENOVO 20F9003SMS/20F9003SMS, BIOS N1CET71W 
(1.39 ) 09/04/2018
[307299.392055] RIP: 0010:ext4_readpage+0xb/0x90
[307299.392057] Code: 9b a3 a6 e8 57 82 fe ff f0 80 63 01 df f0 80 63 01 bf 5b c3 c3 
f0 80 0e 01 eb d7 0f 1f 40 00 41 54 49 89 f4 55 53 48 8b 46 18 <48> 8b 28 0f 1f 
44 00 00 48 8b 85 68 ff ff ff a9 00 00 00 10 74 1f
[307299.392058] RSP: 0018:a08308d03d58 EFLAGS: 00010286

Re: gcc crashes with general protection faults in 5.9.0-rc3-00091-ge28f0104343d

2020-09-07 Thread Meelis Roos

Following up my yesterdays mail:


This is 5.9.0-rc3-00091-ge28f0104343d on Lenovo t460s that has ran fine up to 
5.8.0.

Today I tried reproducing my linking problem with git kernel on my laptop and 
got segmentation faults in gcc. This is probably the corresponding dmesg part:

0xdead0400 loks like some kind of poisoning.

[307299.392045] general protection fault, probably for non-canonical address 
0xdead0400:  [#1] SMP PTI


Was not reproducible in 5.9-rc4 while recompiling the kernel in a loop for 8 
hours.

--
Meelis Roos