On 2016-11-13 08:15 AM, Gerald Brandt wrote:
Hi,

I'm getting a lot of crashes on my Proxmox box. I am runing Proxmox on a Debian base install, but I have anther boxes that does the same, and it is fine.


Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442402] ------------[ cut here ]------------ Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442408] WARNING: CPU: 2 PID: 0 at kernel/rcu/tree.c:2733 rcu_process_callbacks+0x5bb/0x5e0() Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442409] Modules linked in: nfsv3 rpcsec_gss_krb5 nfsv4 ip_set ip6table_filter ip6_tables iptable_filter ip_tables softdog x_tables nfsd auth_rpcgss nfs_acl nfs lockd grace fscache sunrpc ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi nfnetlink_log nfnetlink xfs snd_hda_codec_hdmi nouveau eeepc_wmi asus_wmi kvm_amd kvm sparse_keymap irqbypass mxm_wmi crct10dif_pclmul snd_hda_codec_realtek crc32_pclmul video snd_hda_codec_generic ttm snd_hda_intel drm_kms_helper drm snd_hda_codec aesni_intel aes_x86_64 lrw gf128mul glue_helper snd_hda_core ablk_helper cryptd snd_hwdep i2c_algo_bit snd_pcm fb_sys_fops syscopyarea snd_timer sysfillrect snd sysimgblt input_leds pcspkr serio_raw soundcore edac_mce_amd k10temp fam15h_power edac_core shpchp i2c_piix4 8250_fintek mac_hid wmi vhost_net vhost macvtap macvlan it87 hwmon_vid autofs4 btrfs raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 ses enclosure uas usb_storage firewire_ohci r8169 mii firewire_core crc_itu_t sata_sil24 ahci libahci fjes Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442454] CPU: 2 PID: 0 Comm: swapper/2 Not tainted 4.4.21-1-pve #1 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442455] Hardware name: To be filled by O.E.M. To be filled by O.E.M./SABERTOOTH 990FX, BIOS 0901 11/24/2011 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442457] 0000000000000086 63ad933f85fa0f2b ffff88083fc83e70 ffffffff813f3f83 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442459] 0000000000000000 ffffffff81ccfadb ffff88083fc83ea8 ffffffff81081806 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442460] ffffffff81e576c0 ffff88083fc97f38 0000000000000246 0000000000000000
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442462] Call Trace:
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442463] <IRQ> [<ffffffff813f3f83>] dump_stack+0x63/0x90 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442469] [<ffffffff81081806>] warn_slowpath_common+0x86/0xc0 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442471] [<ffffffff8108194a>] warn_slowpath_null+0x1a/0x20 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442473] [<ffffffff810e792b>] rcu_process_callbacks+0x5bb/0x5e0 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442475] [<ffffffff8108630e>] __do_softirq+0x10e/0x2a0 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442476] [<ffffffff810865fe>] irq_exit+0x8e/0x90 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442480] [<ffffffff81857122>] smp_apic_timer_interrupt+0x42/0x50 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442481] [<ffffffff818553e2>] apic_timer_interrupt+0x82/0x90 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442482] <EOI> [<ffffffff816d23ea>] ? cpuidle_enter_state+0x10a/0x260 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442487] [<ffffffff816d23c6>] ? cpuidle_enter_state+0xe6/0x260 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442488] [<ffffffff816d2577>] cpuidle_enter+0x17/0x20 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442491] [<ffffffff810c453b>] call_cpuidle+0x3b/0x70 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442492] [<ffffffff816d2553>] ? cpuidle_select+0x13/0x20 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442494] [<ffffffff810c482f>] cpu_startup_entry+0x2bf/0x380 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442496] [<ffffffff81051a34>] start_secondary+0x154/0x190 Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442497] ---[ end trace 8a742910926b0ed4 ]--- Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.617812] BUG: unable to handle kernel paging request at 000000000000bb00 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.618057] IP: [<ffffffff811ebe57>] kmem_cache_alloc+0x77/0x200 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.618662] PGD 5cb1c5067 PUD 5cb0f2067 PMD 0
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.619431] Oops: 0000 [#1] SMP
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.620253] Modules linked in: nfsv3 rpcsec_gss_krb5 nfsv4 ip_set ip6table_filter ip6_tables iptable_filter ip_tables softdog x_tables nfsd auth_rpcgss nfs_acl nfs lockd grace fscache sunrpc ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi nfnetlink_log nfnetlink xfs snd_hda_codec_hdmi nouveau eeepc_wmi asus_wmi kvm_amd kvm sparse_keymap irqbypass mxm_wmi crct10dif_pclmul snd_hda_codec_realtek crc32_pclmul video snd_hda_codec_generic ttm snd_hda_intel drm_kms_helper drm snd_hda_codec aesni_intel aes_x86_64 lrw gf128mul glue_helper snd_hda_core ablk_helper cryptd snd_hwdep i2c_algo_bit snd_pcm fb_sys_fops syscopyarea snd_timer sysfillrect snd sysimgblt input_leds pcspkr serio_raw soundcore edac_mce_amd k10temp fam15h_power edac_core shpchp i2c_piix4 8250_fintek mac_hid wmi vhost_net vhost macvtap macvlan it87 hwmon_vid autofs4 btrfs raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 ses enclosure uas usb_storage firewire_ohci r8169 mii firewire_core crc_itu_t sata_sil24 ahci libahci fjes Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.624994] CPU: 5 PID: 23044 Comm: ps Tainted: G W 4.4.21-1-pve #1 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.626005] Hardware name: To be filled by O.E.M. To be filled by O.E.M./SABERTOOTH 990FX, BIOS 0901 11/24/2011 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.627039] task: ffff880818ed3700 ti: ffff8805cb27c000 task.ti: ffff8805cb27c000 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.628071] RIP: 0010:[<ffffffff811ebe57>] [<ffffffff811ebe57>] kmem_cache_alloc+0x77/0x200 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.629113] RSP: 0018:ffff8805cb27fc98 EFLAGS: 00010282 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.630145] RAX: 0000000000000000 RBX: 00000000024080c0 RCX: 00000000000c428b Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.631198] RDX: 00000000000c428a RSI: 00000000024080c0 RDI: ffff88081f003700 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.632239] RBP: ffff8805cb27fcc8 R08: 000000000001a480 R09: 000000000000bb00 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.633275] R10: 0000000000000006 R11: 0000000000000000 R12: 00000000024080c0 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.634310] R13: ffffffff8120f26c R14: ffff88081f003700 R15: ffff88081f003700 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.635346] FS: 00007f54269ce700(0000) GS:ffff88083fd40000(0000) knlGS:0000000000000000 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.636350] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.637388] CR2: 000000000000bb00 CR3: 000000052f4f5000 CR4: 00000000000406e0
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.638425] Stack:
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.639455] ffff8805cb27fcd0 0000000000000000 ffff880819ad3cc0 ffff8805cb27fef4 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.640500] 0000000000000000 ffff8805cb27fdd0 ffff8805cb27fcf0 ffffffff8120f26c Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.641545] ffffffff81217f1d 0000000000008000 ffff8805cb27fef4 ffff8805cb27fdc0
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.642587] Call Trace:
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.643623] [<ffffffff8120f26c>] get_empty_filp+0x5c/0x1c0 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.644660] [<ffffffff81217f1d>] ? terminate_walk+0xbd/0xd0 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.645699] [<ffffffff8121bee3>] path_openat+0x43/0x1530 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.646731] [<ffffffff8121d544>] ? putname+0x54/0x60 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.647758] [<ffffffff8121d9e5>] ? filename_lookup+0xf5/0x180 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.648781] [<ffffffff8121e5d1>] do_filp_open+0x91/0x100 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.649802] [<ffffffff8138eaba>] ? common_perm_cond+0x3a/0x50 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.650814] [<ffffffff8111e472>] ? from_kgid_munged+0x12/0x20 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.651825] [<ffffffff81212b27>] ? cp_new_stat+0x157/0x190 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.652786] [<ffffffff8122bf86>] ? __alloc_fd+0x46/0x180 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.653804] [<ffffffff8120c8a9>] do_sys_open+0x139/0x2a0 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.654795] [<ffffffff8120ca2e>] SyS_open+0x1e/0x20 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.655780] [<ffffffff81854676>] entry_SYSCALL_64_fastpath+0x16/0x75 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.656766] Code: 08 65 4c 03 05 53 e3 e1 7e 4d 8b 08 4d 85 c9 0f 84 42 01 00 00 49 83 78 10 00 0f 84 37 01 00 00 49 63 47 20 48 8d 4a 01 4d 8b 07 <49> 8b 1c 01 4c 89 c8 65 49 0f c7 08 0f 94 c0 84 c0 74 bb 49 63 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.657834] RIP [<ffffffff811ebe57>] kmem_cache_alloc+0x77/0x200 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.658878] RSP <ffff8805cb27fc98> Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.659907] CR2: 000000000000bb00 Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.667666] ---[ end trace 8a742910926b0ed5 ]---

I am non-subscriptions, and I just did an update yesterday to see if it would fix the error. I'll be running a memtest today to see if I can find anything.

I hadn't done an update in awhile before that, so I'm leaning towards a hardware issue. What do you think?

Gerald


root@gbr-proxmox-1:~# pveversion  -verbose
proxmox-ve: 4.3-71 (running kernel: 4.4.21-1-pve)
pve-manager: 4.3-10 (running version: 4.3-10/7230e60f)
pve-kernel-4.4.6-1-pve: 4.4.6-48
pve-kernel-4.4.13-1-pve: 4.4.13-56
pve-kernel-4.2.6-1-pve: 4.2.6-36
pve-kernel-4.4.8-1-pve: 4.4.8-52
pve-kernel-4.4.21-1-pve: 4.4.21-71
pve-kernel-4.4.19-1-pve: 4.4.19-66
pve-kernel-4.4.10-1-pve: 4.4.10-54
lvm2: 2.02.116-pve3
corosync-pve: 2.4.0-1
libqb0: 1.0-1
pve-cluster: 4.0-47
qemu-server: 4.0-94
pve-firmware: 1.1-10
libpve-common-perl: 4.0-80
libpve-access-control: 4.0-19
libpve-storage-perl: 4.0-68
pve-libspice-server1: 0.12.8-1
vncterm: 1.2-1
pve-docs: 4.3-14
pve-qemu-kvm: 2.7.0-6
pve-container: 1.0-81
pve-firewall: 2.0-31
pve-ha-manager: 1.0-35
ksm-control-daemon: 1.2-1
glusterfs-client: 3.5.2-2+deb8u2
lxc-pve: 2.0.5-1
lxcfs: 2.0.4-pve2
criu: 1.6.0-1
novnc-pve: 0.5-8
smartmontools: 6.5+svn4324-1~pve80

_______________________________________________
pve-user mailing list
pve-user@pve.proxmox.com
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user

Reply via email to