On Thu, 2006-02-02 at 22:05 +0000, Mike Crowe wrote: >> Perhaps this new problem is unrelated. I shall try and reproduce >> it. Is there anything I should try when the interface is locked up to >> help? On Thu, Feb 02, 2006 at 12:49:53PM -0800, Michael Chan wrote: > Did this setup use to work before?
It's a new machine. Having brought the interface back up again as mentioned in my last email and left it under network load for a while I returned to the machine to find an MCE. I'm suspicious that there may be a hardware problem or a more deep rooted software problem. :( I'll try and reproduce the problem again tomorrow. I can't reboot the machine remotely to continue today. :( CPU 2: Machine Check Exception: 4 Bank 4: b200000000070f0f TSC 92c1e455f5a CPU 0: Machine Check Exception: 4 Bank 4: b200000000070f0f TSC 92c1e65b4c9 Kernel panic - not syncing: Machine check NMI Watchdog detected LOCKUP on CPU 2 CPU 2 Modules linked in: nfsd nfs lockd nfs_acl sunrpc autofs4 ipv6 megaraid dm_mod ide_disk joydev evdev i2c_amd8111 i2c_amd756 shpchp psmouse i2c_core pcspkr serio_raw pci_hotplug hw_random xfs exportfs ide_cd cdrom ide_generic sd_mod generic megaraid_mbox amd74xx e100 scsi_mod mii ohci_hcd tg3 megaraid_mm ide_core thermal processor fan Pid: 0, comm: swapper Tainted: G M 2.6.15-1-amd64-k8-smp #1 RIP: 0010:[<ffffffff80117863>] <ffffffff80117863>{__smp_call_function+106} RSP: 0018:ffff8100f9fb4cb8 EFLAGS: 00000093 RAX: 0000000000000001 RBX: 0000000000000003 RCX: 0000000000000004 RDX: 0000ffff0000ffff RSI: 0000000000000000 RDI: ffffffff804116a0 RBP: 0000000000000000 R08: 0000000000000020 R09: 0000000000000004 R10: 0000000000000004 R11: 0000000000000000 R12: ffffffff801178f4 R13: 0000000000000000 R14: 0000092c1e455a9b R15: ffffffff802f255b FS: 00002aaaab2c5700(0000) GS:ffffffff8040a900(0000) knlGS:0000000000000000 CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b CR2: 00002aaaaaac0000 CR3: 00000001fbd29000 CR4: 00000000000006e0 Process swapper (pid: 0, threadinfo ffff8100f9fb0000, task ffff8100f9faa0c0) Stack: ffffffff801178f4 0000000000000000 0000000000000001 0000000000000000 0000000000000097 0000000000000000 0000000000000000 00000000ffffffff ffffffff8033d6a0 ffffffff80117933 Call Trace: <#MC> <ffffffff801178f4>{smp_really_stop_cpu+0} <ffffffff80117933>{smp_send_stop+53} <ffffffff80130589>{panic+210} <ffffffff8010f0d1>{oops_begin+92} <ffffffff80114463>{print_mce+136} <ffffffff8011453b>{mce_available+0} <ffffffff80114849>{do_machine_check+749} <ffffffff8010eac7>{machine_check+127} <ffffffff801114ea>{timer_interrupt+99} <EOE> <IRQ> <ffffffff801523cd>{handle_IRQ_event+41} <ffffffff80152491>{__do_IRQ+147} <ffffffff80110365>{do_IRQ+45} <ffffffff8010dd20>{ret_from_intr+0} <EOI> <ffffffff802cd943>{thread_return+0} <ffffffff8010b9ad>{default_idle+57} <ffffffff8010bbbc>{cpu_idle+93} Code: 8b 44 24 10 39 d8 75 f6 85 ed 74 12 8b 44 24 14 39 d8 74 0a console shuts up ... <0> CPU 0: Machine Check Exception: 4 Bank 4: b200000000070f0f Kernel panic - not syncing: Aiee, killing interrupt handler! <0>TSC 92c1e65b4c9 Kernel panic - not syncing: Machine check NMI Watchdog detected LOCKUP on CPU 0 CPU 0 Modules linked in: nfsd nfs lockd nfs_acl sunrpc autofs4 ipv6 megaraid dm_mod ide_disk joydev evdev i2c_amd8111 i2c_amd756 shpchp psmouse i2c_core pcspkr serio_raw pci_hotplug hw_random xfs exportfs ide_cd cdrom ide_generic sd_mod generic megaraid_mbox amd74xx e100 scsi_mod mii ohci_hcd tg3 megaraid_mm ide_core thermal processor fan Pid: 0, comm: swapper Tainted: G M 2.6.15-1-amd64-k8-smp #1 RIP: 0010:[<ffffffff80117863>] <ffffffff80117863>{__smp_call_function+106} RSP: 0018:ffffffff803b3d78 EFLAGS: 00000097 RAX: 0000000000000001 RBX: 0000000000000002 RCX: 0000000000000003 RDX: 0000ffff0000ffff RSI: 0000000000000000 RDI: ffffffff804116a0 RBP: 0000000000000000 R08: 0000000000000020 R09: 0000000000000003 R10: 0000000000000003 R11: 0000000000000000 R12: ffffffff801178f4 R13: 0000000000000000 R14: 0000092c1e65b132 R15: ffffffff802f255b FS: 00002aaaaaf15090(0000) GS:ffffffff8040a800(0000) knlGS:0000000000000000 CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b CR2: 000000000058d824 CR3: 0000000000101000 CR4: 00000000000006e0 Process swapper (pid: 0, threadinfo ffffffff80416000, task ffffffff8033a6a0) Stack: ffffffff801178f4 0000000000000000 0000000000000001 0000000000000000 0000000000000400 0000000000000001 0000000000000000 00000000ffffffff ffffffff8033d6a0 ffffffff80117933 Call Trace: <#MC> <ffffffff801178f4>{smp_really_stop_cpu+0} <ffffffff80117933>{smp_send_stop+53} <ffffffff80130589>{panic+210} <ffffffff8010f0d1>{oops_begin+92} <ffffffff80114463>{print_mce+136} <ffffffff8011453b>{mce_available+0} <ffffffff80114849>{do_machine_check+749} <ffffffff8010eac7>{machine_check+127} <ffffffff8010b9ad>{default_idle+57} <EOE> <ffffffff8010bbbc>{cpu_idle+93} <ffffffff804188a6>{start_kernel+442} <ffffffff80418293>{x86_64_start_kernel+423} Code: 8b 44 24 10 39 d8 75 f6 85 ed 74 12 8b 44 24 14 39 d8 74 0a console shuts up ... NMI Watchdog detected LOCKUP on CPU 3 CPU 3 Modules linked in: nfsd nfs lockd nfs_acl sunrpc autofs4 ipv6 megaraid dm_mod ide_disk joydev evdev i2c_amd8111 i2c_amd756 shpchp psmouse i2c_core pcspkr serio_raw pci_hotplug hw_random xfs exportfs ide_cd cdrom ide_generic sd_mod generic megaraid_mbox amd74xx e100 scsi_mod mii ohci_hcd tg3 megaraid_mm ide_core thermal processor fan Pid: 0, comm: swapper Tainted: G M 2.6.15-1-amd64-k8-smp #1 RIP: 0010:[<ffffffff802ceded>] <ffffffff802ceded>{.text.lock.spinlock+118} RSP: 0018:ffff8101044ffcf0 EFLAGS: 00000086 RAX: 0000000000000000 RBX: 0000000000000003 RCX: 0000000000000003 RDX: 0000000000000003 RSI: ffff8101044ffe08 RDI: ffffffff8033b1d0 RBP: 0000000000000000 R08: ffffffff88044459 R09: ffff8101044ffd68 R10: 0000000600000000 R11: 0000000000000001 R12: 0000000000000000 R13: ffff810004af6440 R14: ffffffff802f1de1 R15: 0000000000000000 FS: 00002aaaaaf15090(0000) GS:ffffffff8040a980(0000) knlGS:0000000000000000 CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b CR2: 00002aaaaaac0000 CR3: 00000001ed6c3000 CR4: 00000000000006e0 Process swapper (pid: 0, threadinfo ffff81000c058000, task ffff81000c052100) Stack: ffffffff8010f0ab -- Mike Crowe - To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html