Re: Oops when rebooting 2.6.13-rc4-RT-V0.7.52-*

2005-08-10 Thread Chuck Harding

On Fri, 5 Aug 2005, Ingo Molnar wrote:


it's the first oops that matters. You could try to 'freeze' the system
after printing the first stacktrace, via the patch below - but debugging
boot-time crashes without logging support is a quite tedious process.

(maybe, if the crash happens after the ethernet card is detected, you
could try netconsole logging.)

Ingo

Index: linux/arch/i386/kernel/traps.c
===
--- linux.orig/arch/i386/kernel/traps.c
+++ linux/arch/i386/kernel/traps.c
@@ -170,6 +170,7 @@ void show_trace(struct task_struct *task
printk(" ===\n");
}
print_traces(task);
+   for (;;) raw_local_irq_disable();
show_held_locks(task);
}



I patched 52-16 with this and now I see at least part of the register
dump and the stack and call trace:

Process reboot(pid : 4551, threadinfo=f13f8000 task=f195f230 stack_left=7764 
worst_left=-1)

stack: f13f8000 c0116042 0700  28121969
 :
 :
call trace:
[] disable_IO_APIC+0x5a/0x90 (8)
[] machine_restart+0x5/0xa (28)
[] sys_reboot+0x147/0x15b (4)
[] netdev_run_todo+0xa4/0x209 (4)
[] dev_change_flags+0xca/0xf3  (16)
[] devinet_IOCTL+0x419/0x526 (20)
[] invalidate_inode_buffers+0xa/0x5d (88)
[] kmem_cache_free+0x1/0x90 (8)
[] destroy_inode+0x4a/0x75 (24)
[] generic_forget_inode+0x102/0x136 (8)
[] iput+0x57/0x89 (12)
[] atomic_dec_and_spin_lock+0x30/0x40 (12)
[] dput+0x76/0x1b7 (12)
[] __fput+0x114/0x182 (16)
[] filp_close+0x33/0x60 (28)
[] sysenter_past_esp+0x54/0x75 (20)

my .config is attached.
hope this helps.

--
Charles D. (Chuck) Harding <[EMAIL PROTECTED]>  Voice: 925-423-8879
Senior Computer Associate ICCDFax: 925-423-6961
Lawrence Livermore National Laboratory  Computation Directorate
Livermore, CA USA  http://www.llnl.gov  GPG Public Key ID: B9EB6601
-- http://tinyurl.com/5w5ey ---
-- Heads I win, tails you lose. --
#
# Automatically generated make config: don't edit
# Linux kernel version: 2.6.13-rc4-RT-V0.7.52-16
# Mon Aug  8 10:36:01 2005
#
CONFIG_X86=y
CONFIG_MMU=y
CONFIG_UID16=y
CONFIG_GENERIC_ISA_DMA=y
CONFIG_GENERIC_IOMAP=y

#
# Code maturity level options
#
CONFIG_EXPERIMENTAL=y
CONFIG_CLEAN_COMPILE=y
CONFIG_BROKEN_ON_SMP=y
CONFIG_LOCK_KERNEL=y
CONFIG_INIT_ENV_ARG_LIMIT=32

#
# General setup
#
CONFIG_LOCALVERSION=""
CONFIG_SWAP=y
CONFIG_SYSVIPC=y
CONFIG_POSIX_MQUEUE=y
CONFIG_BSD_PROCESS_ACCT=y
# CONFIG_BSD_PROCESS_ACCT_V3 is not set
CONFIG_SYSCTL=y
CONFIG_AUDIT=y
CONFIG_AUDITSYSCALL=y
CONFIG_HOTPLUG=y
CONFIG_KOBJECT_UEVENT=y
CONFIG_IKCONFIG=y
CONFIG_IKCONFIG_PROC=y
# CONFIG_EMBEDDED is not set
CONFIG_KALLSYMS=y
CONFIG_KALLSYMS_ALL=y
CONFIG_KALLSYMS_EXTRA_PASS=y
CONFIG_PRINTK=y
CONFIG_BUG=y
CONFIG_BASE_FULL=y
CONFIG_FUTEX=y
CONFIG_EPOLL=y
CONFIG_SHMEM=y
CONFIG_CC_ALIGN_FUNCTIONS=0
CONFIG_CC_ALIGN_LABELS=0
CONFIG_CC_ALIGN_LOOPS=0
CONFIG_CC_ALIGN_JUMPS=0
# CONFIG_TINY_SHMEM is not set
CONFIG_BASE_SMALL=0

#
# Loadable module support
#
CONFIG_MODULES=y
CONFIG_MODULE_UNLOAD=y
# CONFIG_MODULE_FORCE_UNLOAD is not set
CONFIG_OBSOLETE_MODPARM=y
CONFIG_MODVERSIONS=y
# CONFIG_MODULE_SRCVERSION_ALL is not set
CONFIG_KMOD=y

#
# Processor type and features
#
CONFIG_X86_PC=y
# CONFIG_X86_ELAN is not set
# CONFIG_X86_VOYAGER is not set
# CONFIG_X86_NUMAQ is not set
# CONFIG_X86_SUMMIT is not set
# CONFIG_X86_BIGSMP is not set
# CONFIG_X86_VISWS is not set
# CONFIG_X86_GENERICARCH is not set
# CONFIG_X86_ES7000 is not set
# CONFIG_M386 is not set
# CONFIG_M486 is not set
# CONFIG_M586 is not set
# CONFIG_M586TSC is not set
# CONFIG_M586MMX is not set
# CONFIG_M686 is not set
# CONFIG_MPENTIUMII is not set
# CONFIG_MPENTIUMIII is not set
# CONFIG_MPENTIUMM is not set
CONFIG_MPENTIUM4=y
# CONFIG_MK6 is not set
# CONFIG_MK7 is not set
# CONFIG_MK8 is not set
# CONFIG_MCRUSOE is not set
# CONFIG_MEFFICEON is not set
# CONFIG_MWINCHIPC6 is not set
# CONFIG_MWINCHIP2 is not set
# CONFIG_MWINCHIP3D is not set
# CONFIG_MGEODEGX1 is not set
# CONFIG_MCYRIXIII is not set
# CONFIG_MVIAC3_2 is not set
CONFIG_X86_GENERIC=y
CONFIG_X86_CMPXCHG=y
CONFIG_X86_XADD=y
CONFIG_X86_L1_CACHE_SHIFT=7
CONFIG_GENERIC_CALIBRATE_DELAY=y
CONFIG_X86_WP_WORKS_OK=y
CONFIG_X86_INVLPG=y
CONFIG_X86_BSWAP=y
CONFIG_X86_POPAD_OK=y
CONFIG_X86_GOOD_APIC=y
CONFIG_X86_INTEL_USERCOPY=y
CONFIG_X86_USE_PPRO_CHECKSUM=y
CONFIG_HPET_TIMER=y
CONFIG_HPET_EMULATE_RTC=y
# CONFIG_SMP is not set
# CONFIG_PREEMPT_NONE is not set
# CONFIG_PREEMPT_VOLUNTARY is not set
# CONFIG_PREEMPT_DESKTOP is not set
CONFIG_PREEMPT_RT=y
CONFIG_PREEMPT=y
CONFIG_PREEMPT_SOFTIRQS=y
CONFIG_PREEMPT_HARDIRQS=y
CONFIG_PREEMPT_RCU=y
CONFIG_PREEMPT_BKL=y
CONFIG_RCU_STATS=y
CONFIG_RCU_TORTURE_TEST=y
CONFIG_RWSEM_GENERIC_SPINLOCK=y
CONFIG_ASM_SEMAPHORES=y
CONFIG_X86_UP_APIC=y
CONFIG_X86_UP_IOAPIC=y
CONFIG_X86_LOCAL_APIC=y
CONFIG_X86_IO_APIC=y
CONFIG_X86_IOAPIC_FAST=y
CONFIG_X86_TSC=y
CONFIG_X86_MCE=y
CONFIG_X86_MCE_NONFATAL=y

Re: Oops when rebooting 2.6.13-rc4-RT-V0.7.52-*

2005-08-10 Thread Chuck Harding

On Fri, 5 Aug 2005, Ingo Molnar wrote:


it's the first oops that matters. You could try to 'freeze' the system
after printing the first stacktrace, via the patch below - but debugging
boot-time crashes without logging support is a quite tedious process.

(maybe, if the crash happens after the ethernet card is detected, you
could try netconsole logging.)

Ingo

Index: linux/arch/i386/kernel/traps.c
===
--- linux.orig/arch/i386/kernel/traps.c
+++ linux/arch/i386/kernel/traps.c
@@ -170,6 +170,7 @@ void show_trace(struct task_struct *task
printk( ===\n);
}
print_traces(task);
+   for (;;) raw_local_irq_disable();
show_held_locks(task);
}



I patched 52-16 with this and now I see at least part of the register
dump and the stack and call trace:

Process reboot(pid : 4551, threadinfo=f13f8000 task=f195f230 stack_left=7764 
worst_left=-1)

stack: f13f8000 c0116042 0700  28121969
 :
 :
call trace:
[] disable_IO_APIC+0x5a/0x90 (8)
[] machine_restart+0x5/0xa (28)
[] sys_reboot+0x147/0x15b (4)
[] netdev_run_todo+0xa4/0x209 (4)
[] dev_change_flags+0xca/0xf3  (16)
[] devinet_IOCTL+0x419/0x526 (20)
[] invalidate_inode_buffers+0xa/0x5d (88)
[] kmem_cache_free+0x1/0x90 (8)
[] destroy_inode+0x4a/0x75 (24)
[] generic_forget_inode+0x102/0x136 (8)
[] iput+0x57/0x89 (12)
[] atomic_dec_and_spin_lock+0x30/0x40 (12)
[] dput+0x76/0x1b7 (12)
[] __fput+0x114/0x182 (16)
[] filp_close+0x33/0x60 (28)
[] sysenter_past_esp+0x54/0x75 (20)

my .config is attached.
hope this helps.

--
Charles D. (Chuck) Harding [EMAIL PROTECTED]  Voice: 925-423-8879
Senior Computer Associate ICCDFax: 925-423-6961
Lawrence Livermore National Laboratory  Computation Directorate
Livermore, CA USA  http://www.llnl.gov  GPG Public Key ID: B9EB6601
-- http://tinyurl.com/5w5ey ---
-- Heads I win, tails you lose. --
#
# Automatically generated make config: don't edit
# Linux kernel version: 2.6.13-rc4-RT-V0.7.52-16
# Mon Aug  8 10:36:01 2005
#
CONFIG_X86=y
CONFIG_MMU=y
CONFIG_UID16=y
CONFIG_GENERIC_ISA_DMA=y
CONFIG_GENERIC_IOMAP=y

#
# Code maturity level options
#
CONFIG_EXPERIMENTAL=y
CONFIG_CLEAN_COMPILE=y
CONFIG_BROKEN_ON_SMP=y
CONFIG_LOCK_KERNEL=y
CONFIG_INIT_ENV_ARG_LIMIT=32

#
# General setup
#
CONFIG_LOCALVERSION=
CONFIG_SWAP=y
CONFIG_SYSVIPC=y
CONFIG_POSIX_MQUEUE=y
CONFIG_BSD_PROCESS_ACCT=y
# CONFIG_BSD_PROCESS_ACCT_V3 is not set
CONFIG_SYSCTL=y
CONFIG_AUDIT=y
CONFIG_AUDITSYSCALL=y
CONFIG_HOTPLUG=y
CONFIG_KOBJECT_UEVENT=y
CONFIG_IKCONFIG=y
CONFIG_IKCONFIG_PROC=y
# CONFIG_EMBEDDED is not set
CONFIG_KALLSYMS=y
CONFIG_KALLSYMS_ALL=y
CONFIG_KALLSYMS_EXTRA_PASS=y
CONFIG_PRINTK=y
CONFIG_BUG=y
CONFIG_BASE_FULL=y
CONFIG_FUTEX=y
CONFIG_EPOLL=y
CONFIG_SHMEM=y
CONFIG_CC_ALIGN_FUNCTIONS=0
CONFIG_CC_ALIGN_LABELS=0
CONFIG_CC_ALIGN_LOOPS=0
CONFIG_CC_ALIGN_JUMPS=0
# CONFIG_TINY_SHMEM is not set
CONFIG_BASE_SMALL=0

#
# Loadable module support
#
CONFIG_MODULES=y
CONFIG_MODULE_UNLOAD=y
# CONFIG_MODULE_FORCE_UNLOAD is not set
CONFIG_OBSOLETE_MODPARM=y
CONFIG_MODVERSIONS=y
# CONFIG_MODULE_SRCVERSION_ALL is not set
CONFIG_KMOD=y

#
# Processor type and features
#
CONFIG_X86_PC=y
# CONFIG_X86_ELAN is not set
# CONFIG_X86_VOYAGER is not set
# CONFIG_X86_NUMAQ is not set
# CONFIG_X86_SUMMIT is not set
# CONFIG_X86_BIGSMP is not set
# CONFIG_X86_VISWS is not set
# CONFIG_X86_GENERICARCH is not set
# CONFIG_X86_ES7000 is not set
# CONFIG_M386 is not set
# CONFIG_M486 is not set
# CONFIG_M586 is not set
# CONFIG_M586TSC is not set
# CONFIG_M586MMX is not set
# CONFIG_M686 is not set
# CONFIG_MPENTIUMII is not set
# CONFIG_MPENTIUMIII is not set
# CONFIG_MPENTIUMM is not set
CONFIG_MPENTIUM4=y
# CONFIG_MK6 is not set
# CONFIG_MK7 is not set
# CONFIG_MK8 is not set
# CONFIG_MCRUSOE is not set
# CONFIG_MEFFICEON is not set
# CONFIG_MWINCHIPC6 is not set
# CONFIG_MWINCHIP2 is not set
# CONFIG_MWINCHIP3D is not set
# CONFIG_MGEODEGX1 is not set
# CONFIG_MCYRIXIII is not set
# CONFIG_MVIAC3_2 is not set
CONFIG_X86_GENERIC=y
CONFIG_X86_CMPXCHG=y
CONFIG_X86_XADD=y
CONFIG_X86_L1_CACHE_SHIFT=7
CONFIG_GENERIC_CALIBRATE_DELAY=y
CONFIG_X86_WP_WORKS_OK=y
CONFIG_X86_INVLPG=y
CONFIG_X86_BSWAP=y
CONFIG_X86_POPAD_OK=y
CONFIG_X86_GOOD_APIC=y
CONFIG_X86_INTEL_USERCOPY=y
CONFIG_X86_USE_PPRO_CHECKSUM=y
CONFIG_HPET_TIMER=y
CONFIG_HPET_EMULATE_RTC=y
# CONFIG_SMP is not set
# CONFIG_PREEMPT_NONE is not set
# CONFIG_PREEMPT_VOLUNTARY is not set
# CONFIG_PREEMPT_DESKTOP is not set
CONFIG_PREEMPT_RT=y
CONFIG_PREEMPT=y
CONFIG_PREEMPT_SOFTIRQS=y
CONFIG_PREEMPT_HARDIRQS=y
CONFIG_PREEMPT_RCU=y
CONFIG_PREEMPT_BKL=y
CONFIG_RCU_STATS=y
CONFIG_RCU_TORTURE_TEST=y
CONFIG_RWSEM_GENERIC_SPINLOCK=y
CONFIG_ASM_SEMAPHORES=y
CONFIG_X86_UP_APIC=y
CONFIG_X86_UP_IOAPIC=y
CONFIG_X86_LOCAL_APIC=y

Re: Oops when rebooting 2.6.13-rc4-RT-V0.7.52-*

2005-08-05 Thread Ingo Molnar

* Chuck Harding <[EMAIL PROTECTED]> wrote:

> couldn't see the beginning of the oops but at the end was
> Init: no more processes left in this run level
> and have to power cycle to be able to boot. I tried vanilla -rc4, -rc5
> and -rc4-mm1 which all worked just fine. But all 3 of the -RT versions
> I have on hand (08,10,13) showed the same symptom.
> 
> This is my desktop system - Dell Optiplex GX-240 2GHz P4 1Gb SDRAM ATI 
> Radeon VE/7000 QY SB Live! Value. Dell FP2000 RHEL 4.0 KDE 3.3
> 
> I don't have serial console debugging capability here at work, so what 
> can I do to debug this? Thanks.

it's the first oops that matters. You could try to 'freeze' the system 
after printing the first stacktrace, via the patch below - but debugging 
boot-time crashes without logging support is a quite tedious process.

(maybe, if the crash happens after the ethernet card is detected, you 
could try netconsole logging.)

Ingo

Index: linux/arch/i386/kernel/traps.c
===
--- linux.orig/arch/i386/kernel/traps.c
+++ linux/arch/i386/kernel/traps.c
@@ -170,6 +170,7 @@ void show_trace(struct task_struct *task
printk(" ===\n");
}
print_traces(task);
+   for (;;) raw_local_irq_disable();
show_held_locks(task);
 }
 
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: Oops when rebooting 2.6.13-rc4-RT-V0.7.52-*

2005-08-05 Thread Ingo Molnar

* Chuck Harding [EMAIL PROTECTED] wrote:

 couldn't see the beginning of the oops but at the end was
 Init: no more processes left in this run level
 and have to power cycle to be able to boot. I tried vanilla -rc4, -rc5
 and -rc4-mm1 which all worked just fine. But all 3 of the -RT versions
 I have on hand (08,10,13) showed the same symptom.
 
 This is my desktop system - Dell Optiplex GX-240 2GHz P4 1Gb SDRAM ATI 
 Radeon VE/7000 QY SB Live! Value. Dell FP2000 RHEL 4.0 KDE 3.3
 
 I don't have serial console debugging capability here at work, so what 
 can I do to debug this? Thanks.

it's the first oops that matters. You could try to 'freeze' the system 
after printing the first stacktrace, via the patch below - but debugging 
boot-time crashes without logging support is a quite tedious process.

(maybe, if the crash happens after the ethernet card is detected, you 
could try netconsole logging.)

Ingo

Index: linux/arch/i386/kernel/traps.c
===
--- linux.orig/arch/i386/kernel/traps.c
+++ linux/arch/i386/kernel/traps.c
@@ -170,6 +170,7 @@ void show_trace(struct task_struct *task
printk( ===\n);
}
print_traces(task);
+   for (;;) raw_local_irq_disable();
show_held_locks(task);
 }
 
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/