Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-13 Thread Rok Markovic
Hi

Good news (I hope). With kernel option maxcpus=1, I can not get
a lock, so the system is stable and running. Where could be a problem?

Best regards,
Rok
 

Ingo Molnar wrote:
>> Hi
>>
>> I have recompiled kernel with OPROFILE disabled. I am sending in 
>> attachment /proc/interrupts (i ran it 3 times with 2 sec delay) and 
>> netconsole log. Lockup in the lock is software made with ./lockup. The 
>> kernel still locks 4-5minutes after boot without any further messages 
>> (waited more than 2 minutes). Any ideas?
> 
> no good ideas :-( I suspect this one:
> 
>   [  185.600900] hdc: lost interrupt
> 
> was a side-effect of lockup.c too?
> 
> does the lockup go away if you boot with maxcpus=1?
> 
>   Ingo
> 
> 

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-13 Thread Ingo Molnar

* Rok Markovic <[EMAIL PROTECTED]> wrote:

> Hi
> 
> I have recompiled kernel with OPROFILE disabled. I am sending in 
> attachment /proc/interrupts (i ran it 3 times with 2 sec delay) and 
> netconsole log. Lockup in the lock is software made with ./lockup. The 
> kernel still locks 4-5minutes after boot without any further messages 
> (waited more than 2 minutes). Any ideas?

no good ideas :-( I suspect this one:

  [  185.600900] hdc: lost interrupt

was a side-effect of lockup.c too?

does the lockup go away if you boot with maxcpus=1?

Ingo
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-13 Thread Rok Markovic
Hi

I have recompiled kernel with OPROFILE disabled. I am sending in
attachment /proc/interrupts (i ran it 3 times with 2 sec delay) and
netconsole log. Lockup in the lock is software made with ./lockup.
The kernel still locks 4-5minutes after boot without any further
messages (waited more than 2 minutes). Any ideas?


Rok


Ingo Molnar wrote:
> could you disable CONFIG_OPROFILE? Maybe it interferes with the NMI 
> watchdog?
> 
> how does /proc/interrupts look like shortly after bootup? Does a simple 
> "intentional lockup" piece of code, which provokes a hard lockup from 
> user-space, get properly zapped by the NMI watchdog (within a minute or 
> so):
> 
> the NMI watchdog should produce a console message similar to:
> 
>  BUG: NMI Watchdog detected LOCKUP on CPU0, eip bff12345, registers:
>  ...
> 
> ( if you test ./lockup and get the message properly then i'd suggest a 
>   new reboot - i think we inhibit further console output after an NMI 
>   printout. )
> 
>   Ingo
> 
> 


   CPU0   CPU1   
  0:151  0   IO-APIC-edge  timer
  1:206  0   IO-APIC-edge  i8042
  5:  0  0   IO-APIC-edge  parport0
  6:  3  0   IO-APIC-edge  floppy
  8:  1  0   IO-APIC-edge  rtc
  9:  0  0   IO-APIC-fasteoi   acpi
 12:102  0   IO-APIC-edge  i8042
 15: 24 84   IO-APIC-edge  ide1
 19:  3  0   IO-APIC-fasteoi   bttv0
 20:   8873  0   IO-APIC-fasteoi   sata_via
 21:  11741  0   IO-APIC-fasteoi   uhci_hcd:usb1, uhci_hcd:usb2, uhci_hcd:usb3, uhci_hcd:usb4, ehci_hcd:usb5
 22:  0  0   IO-APIC-fasteoi   VIA8237
 23:380  0   IO-APIC-fasteoi   eth0
NMI:253194 
LOC:  61512  64324 
ERR:  0
   CPU0   CPU1   
  0:151  0   IO-APIC-edge  timer
  1:286  0   IO-APIC-edge  i8042
  5:  0  0   IO-APIC-edge  parport0
  6:  3  0   IO-APIC-edge  floppy
  8:  1  0   IO-APIC-edge  rtc
  9:  0  0   IO-APIC-fasteoi   acpi
 12:102  0   IO-APIC-edge  i8042
 15: 24120   IO-APIC-edge  ide1
 19:  5  0   IO-APIC-fasteoi   bttv0
 20:   8879  0   IO-APIC-fasteoi   sata_via
 21:  12641  0   IO-APIC-fasteoi   uhci_hcd:usb1, uhci_hcd:usb2, uhci_hcd:usb3, uhci_hcd:usb4, ehci_hcd:usb5
 22:  0  0   IO-APIC-fasteoi   VIA8237
 23:380  0   IO-APIC-fasteoi   eth0
NMI:255196 
LOC:  61993  64918 
ERR:  0
   CPU0   CPU1   
  0:151  0   IO-APIC-edge  timer
  1:292  0   IO-APIC-edge  i8042
  5:  0  0   IO-APIC-edge  parport0
  6:  3  0   IO-APIC-edge  floppy
  8:  1  0   IO-APIC-edge  rtc
  9:  0  0   IO-APIC-fasteoi   acpi
 12:102  0   IO-APIC-edge  i8042
 15: 24144   IO-APIC-edge  ide1
 19: 15  0   IO-APIC-fasteoi   bttv0
 20:   8879  0   IO-APIC-fasteoi   sata_via
 21:  13169  0   IO-APIC-fasteoi   uhci_hcd:usb1, uhci_hcd:usb2, uhci_hcd:usb3, uhci_hcd:usb4, ehci_hcd:usb5
 22:  0  0   IO-APIC-fasteoi   VIA8237
 23:380  0   IO-APIC-fasteoi   eth0
NMI:255196 
LOC:  62237  65219 
ERR:  0

[  177.078312] 
[  190.712608] ide-cd: cmd 0x1e timed out
[  190.731121] hdc: lost interrupt
[0.00] Linux version 2.6.23-rt1 ([EMAIL PROTECTED]) (gcc version 4.1.2 20070925 (Red Hat 4.1.2-27)) #5 SMP PREEMPT RT Sat Oct 13 09:36:19 CEST 2007
[0.00] Command line: ro root=/dev/VolGroup00/LogVol01 [EMAIL PROTECTED]/eth0,[EMAIL PROTECTED]/ nmi_watchdog=2
[0.00] BIOS-provided physical RAM map:
[0.00]  BIOS-e820:  - 0009fc00 (usable)
[0.00]  BIOS-e820: 0009fc00 - 000a (reserved)
[0.00]  BIOS-e820: 000e6000 - 0010 (reserved)
[0.00]  BIOS-e820: 0010 - 3ffb (usable)
[0.00]  BIOS-e820: 3ffb - 3ffc (ACPI data)
[0.00]  BIOS-e820: 3ffc - 3fff (ACPI NVS)
[0.00]  BIOS-e820: 3fff - 4000 (reserved)
[0.00]  BIOS-e820: fee0 - fee01000 (reserved)
[0.00]  BIOS-e820: ff78 - 0001 (reserved)
[0.00] end_pfn_map = 1048576
[0.00] DMI 2.3 present.
[0.00] ACPI: RSDP 000F7C80, 0014 (r0 ACPIAM)
[0.00] ACPI: RSDT 3FFB, 0034 (r1 A M I  OEMRSDT  12000521 MSFT   97)
[0.00] ACPI: FACP 3FFB0200, 0084 (r2 A M I  OEMFACP  12000521 MSFT   97)
[0.00] ACPI: DSDT 

Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-13 Thread Ingo Molnar

* Rok Markovic <[EMAIL PROTECTED]> wrote:

> I enabled netconsole but I am affraid that it won't help. In 
> attachment I am sending complete log over netconsole. While I was 
> trying to make everything, I got a lockup on linux-2.6.23-rc8, but I 
> am not sure into this. But all lockups happend in VGA console mode. 
> Any ideas.

hm:

[  104.224904] oprofile: using NMI interrupt.

could you disable CONFIG_OPROFILE? Maybe it interferes with the NMI 
watchdog?

how does /proc/interrupts look like shortly after bootup? Does a simple 
"intentional lockup" piece of code, which provokes a hard lockup from 
user-space, get properly zapped by the NMI watchdog (within a minute or 
so):

# cat > lockup.c

int main(void)
{
iopl(3);
for (;;)
asm("cli");
}
Ctrl-D
# make lockup
# ./lockup


the NMI watchdog should produce a console message similar to:

 BUG: NMI Watchdog detected LOCKUP on CPU0, eip bff12345, registers:
 ...

( if you test ./lockup and get the message properly then i'd suggest a 
  new reboot - i think we inhibit further console output after an NMI 
  printout. )

Ingo
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-13 Thread Ingo Molnar

* Rok Markovic [EMAIL PROTECTED] wrote:

 I enabled netconsole but I am affraid that it won't help. In 
 attachment I am sending complete log over netconsole. While I was 
 trying to make everything, I got a lockup on linux-2.6.23-rc8, but I 
 am not sure into this. But all lockups happend in VGA console mode. 
 Any ideas.

hm:

[  104.224904] oprofile: using NMI interrupt.

could you disable CONFIG_OPROFILE? Maybe it interferes with the NMI 
watchdog?

how does /proc/interrupts look like shortly after bootup? Does a simple 
intentional lockup piece of code, which provokes a hard lockup from 
user-space, get properly zapped by the NMI watchdog (within a minute or 
so):

# cat  lockup.c

int main(void)
{
iopl(3);
for (;;)
asm(cli);
}
Ctrl-D
# make lockup
# ./lockup
hard hang

the NMI watchdog should produce a console message similar to:

 BUG: NMI Watchdog detected LOCKUP on CPU0, eip bff12345, registers:
 ...

( if you test ./lockup and get the message properly then i'd suggest a 
  new reboot - i think we inhibit further console output after an NMI 
  printout. )

Ingo
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-13 Thread Rok Markovic
Hi

I have recompiled kernel with OPROFILE disabled. I am sending in
attachment /proc/interrupts (i ran it 3 times with 2 sec delay) and
netconsole log. Lockup in the lock is software made with ./lockup.
The kernel still locks 4-5minutes after boot without any further
messages (waited more than 2 minutes). Any ideas?


Rok


Ingo Molnar wrote:
 could you disable CONFIG_OPROFILE? Maybe it interferes with the NMI 
 watchdog?
 
 how does /proc/interrupts look like shortly after bootup? Does a simple 
 intentional lockup piece of code, which provokes a hard lockup from 
 user-space, get properly zapped by the NMI watchdog (within a minute or 
 so):
 
 the NMI watchdog should produce a console message similar to:
 
  BUG: NMI Watchdog detected LOCKUP on CPU0, eip bff12345, registers:
  ...
 
 ( if you test ./lockup and get the message properly then i'd suggest a 
   new reboot - i think we inhibit further console output after an NMI 
   printout. )
 
   Ingo
 
 


   CPU0   CPU1   
  0:151  0   IO-APIC-edge  timer
  1:206  0   IO-APIC-edge  i8042
  5:  0  0   IO-APIC-edge  parport0
  6:  3  0   IO-APIC-edge  floppy
  8:  1  0   IO-APIC-edge  rtc
  9:  0  0   IO-APIC-fasteoi   acpi
 12:102  0   IO-APIC-edge  i8042
 15: 24 84   IO-APIC-edge  ide1
 19:  3  0   IO-APIC-fasteoi   bttv0
 20:   8873  0   IO-APIC-fasteoi   sata_via
 21:  11741  0   IO-APIC-fasteoi   uhci_hcd:usb1, uhci_hcd:usb2, uhci_hcd:usb3, uhci_hcd:usb4, ehci_hcd:usb5
 22:  0  0   IO-APIC-fasteoi   VIA8237
 23:380  0   IO-APIC-fasteoi   eth0
NMI:253194 
LOC:  61512  64324 
ERR:  0
   CPU0   CPU1   
  0:151  0   IO-APIC-edge  timer
  1:286  0   IO-APIC-edge  i8042
  5:  0  0   IO-APIC-edge  parport0
  6:  3  0   IO-APIC-edge  floppy
  8:  1  0   IO-APIC-edge  rtc
  9:  0  0   IO-APIC-fasteoi   acpi
 12:102  0   IO-APIC-edge  i8042
 15: 24120   IO-APIC-edge  ide1
 19:  5  0   IO-APIC-fasteoi   bttv0
 20:   8879  0   IO-APIC-fasteoi   sata_via
 21:  12641  0   IO-APIC-fasteoi   uhci_hcd:usb1, uhci_hcd:usb2, uhci_hcd:usb3, uhci_hcd:usb4, ehci_hcd:usb5
 22:  0  0   IO-APIC-fasteoi   VIA8237
 23:380  0   IO-APIC-fasteoi   eth0
NMI:255196 
LOC:  61993  64918 
ERR:  0
   CPU0   CPU1   
  0:151  0   IO-APIC-edge  timer
  1:292  0   IO-APIC-edge  i8042
  5:  0  0   IO-APIC-edge  parport0
  6:  3  0   IO-APIC-edge  floppy
  8:  1  0   IO-APIC-edge  rtc
  9:  0  0   IO-APIC-fasteoi   acpi
 12:102  0   IO-APIC-edge  i8042
 15: 24144   IO-APIC-edge  ide1
 19: 15  0   IO-APIC-fasteoi   bttv0
 20:   8879  0   IO-APIC-fasteoi   sata_via
 21:  13169  0   IO-APIC-fasteoi   uhci_hcd:usb1, uhci_hcd:usb2, uhci_hcd:usb3, uhci_hcd:usb4, ehci_hcd:usb5
 22:  0  0   IO-APIC-fasteoi   VIA8237
 23:380  0   IO-APIC-fasteoi   eth0
NMI:255196 
LOC:  62237  65219 
ERR:  0

[  177.078312] 
[  190.712608] ide-cd: cmd 0x1e timed out
[  190.731121] hdc: lost interrupt
[0.00] Linux version 2.6.23-rt1 ([EMAIL PROTECTED]) (gcc version 4.1.2 20070925 (Red Hat 4.1.2-27)) #5 SMP PREEMPT RT Sat Oct 13 09:36:19 CEST 2007
[0.00] Command line: ro root=/dev/VolGroup00/LogVol01 [EMAIL PROTECTED]/eth0,[EMAIL PROTECTED]/ nmi_watchdog=2
[0.00] BIOS-provided physical RAM map:
[0.00]  BIOS-e820:  - 0009fc00 (usable)
[0.00]  BIOS-e820: 0009fc00 - 000a (reserved)
[0.00]  BIOS-e820: 000e6000 - 0010 (reserved)
[0.00]  BIOS-e820: 0010 - 3ffb (usable)
[0.00]  BIOS-e820: 3ffb - 3ffc (ACPI data)
[0.00]  BIOS-e820: 3ffc - 3fff (ACPI NVS)
[0.00]  BIOS-e820: 3fff - 4000 (reserved)
[0.00]  BIOS-e820: fee0 - fee01000 (reserved)
[0.00]  BIOS-e820: ff78 - 0001 (reserved)
[0.00] end_pfn_map = 1048576
[0.00] DMI 2.3 present.
[0.00] ACPI: RSDP 000F7C80, 0014 (r0 ACPIAM)
[0.00] ACPI: RSDT 3FFB, 0034 (r1 A M I  OEMRSDT  12000521 MSFT   97)
[0.00] ACPI: FACP 3FFB0200, 0084 (r2 A M I  OEMFACP  12000521 MSFT   97)
[0.00] ACPI: DSDT 3FFB0450, 3CF4 (r1  

Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-13 Thread Ingo Molnar

* Rok Markovic [EMAIL PROTECTED] wrote:

 Hi
 
 I have recompiled kernel with OPROFILE disabled. I am sending in 
 attachment /proc/interrupts (i ran it 3 times with 2 sec delay) and 
 netconsole log. Lockup in the lock is software made with ./lockup. The 
 kernel still locks 4-5minutes after boot without any further messages 
 (waited more than 2 minutes). Any ideas?

no good ideas :-( I suspect this one:

  [  185.600900] hdc: lost interrupt

was a side-effect of lockup.c too?

does the lockup go away if you boot with maxcpus=1?

Ingo
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-13 Thread Rok Markovic
Hi

Good news (I hope). With kernel option maxcpus=1, I can not get
a lock, so the system is stable and running. Where could be a problem?

Best regards,
Rok
 

Ingo Molnar wrote:
 Hi

 I have recompiled kernel with OPROFILE disabled. I am sending in 
 attachment /proc/interrupts (i ran it 3 times with 2 sec delay) and 
 netconsole log. Lockup in the lock is software made with ./lockup. The 
 kernel still locks 4-5minutes after boot without any further messages 
 (waited more than 2 minutes). Any ideas?
 
 no good ideas :-( I suspect this one:
 
   [  185.600900] hdc: lost interrupt
 
 was a side-effect of lockup.c too?
 
 does the lockup go away if you boot with maxcpus=1?
 
   Ingo
 
 

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-12 Thread Rok Markovic
I forgot attachment.

Rok

Rok Markovic wrote:
> Hi
> 
> I enabled netconsole but I am affraid that it won't help. In attachment
> I am sending complete log over netconsole. While I was trying to make
> everything, I got a lockup on linux-2.6.23-rc8, but I am not sure into
> this. But all lockups happend in VGA console mode. Any ideas.
> 
> Rok
> 
> 
> Ingo Molnar wrote:
>>> I tried to recompile with those options enabled, kernel and linux 
>>> boots and loads fine, but after few minutes (if i try to recompile 
>>> kernel in console) the system locks completely (hard reset) without 
>>> any messages. Is there anything I can do, maybe console on serial 
>>> port? My new config is atached.
>> yes, serial console (or netconsole) output would be useful, if you can 
>> solve that. Another way would be to boot with nmi_watchdog=2, do the 
>> kernel recompile in a VGA text console, and wait for the lockup to 
>> occur. Either you get some crash message to the text console 
>> immediately, or you should get the NMI watchdog print something within a 
>> minute or so. If neither happens you just get a blank hard lockup then 
>> the wedge is very deep ...
>>
>> (also make sure that the NMI counter in /proc/interrupts is increasing 
>> on all CPUs properly, with nmi_watchdog=2. If not all CPUs/cores are 
>> increasing their NMI counters once per second then the NMI watchdog wont 
>> be able to print out a stackdump.)
>>
>>  Ingo

[  859.950668] sd 2:0:0:1: [sdc] 494080 512-byte hardware sectors (253 MB)
[  859.951294] sd 2:0:0:1: [sdc] Write Protect is off
[  859.951313] sd 2:0:0:1: [sdc] Assuming drive cache: write through
[  860.618633] sd 2:0:0:1: [sdc] 494080 512-byte hardware sectors (253 MB)
[  860.619414] sd 2:0:0:1: [sdc] Write Protect is off
[  860.619433] sd 2:0:0:1: [sdc] Assuming drive cache: write through
[ 1007.878314] nfsd: last server has exited
[ 1007.926075] nfsd: unexporting all filesystems
[ 1324.899757] nfsd: last server has exited
[ 1324.952596] nfsd: unexporting all filesystems
[0.00] Linux version 2.6.23-rt1 ([EMAIL PROTECTED]) (gcc version 4.1.2 20070925 (Red Hat 4.1.2-27)) #4 SMP PREEMPT RT Fri Oct 12 22:07:05 CEST 2007
[0.00] Command line: ro root=/dev/VolGroup00/LogVol01 [EMAIL PROTECTED]/eth0,[EMAIL PROTECTED]/ nmi_watchdog=2
[0.00] BIOS-provided physical RAM map:
[0.00]  BIOS-e820:  - 0009fc00 (usable)
[0.00]  BIOS-e820: 0009fc00 - 000a (reserved)
[0.00]  BIOS-e820: 000e6000 - 0010 (reserved)
[0.00]  BIOS-e820: 0010 - 3ffb (usable)
[0.00]  BIOS-e820: 3ffb - 3ffc (ACPI data)
[0.00]  BIOS-e820: 3ffc - 3fff (ACPI NVS)
[0.00]  BIOS-e820: 3fff - 4000 (reserved)
[0.00]  BIOS-e820: fee0 - fee01000 (reserved)
[0.00]  BIOS-e820: ff78 - 0001 (reserved)
[0.00] end_pfn_map = 1048576
[0.00] DMI 2.3 present.
[0.00] ACPI: RSDP 000F7C80, 0014 (r0 ACPIAM)
[0.00] ACPI: RSDT 3FFB, 0034 (r1 A M I  OEMRSDT  12000521 MSFT   97)
[0.00] ACPI: FACP 3FFB0200, 0084 (r2 A M I  OEMFACP  12000521 MSFT   97)
[0.00] ACPI: DSDT 3FFB0450, 3CF4 (r1  75D8P 75D8P0044 INTL  2002026)
[0.00] ACPI: FACS 3FFC, 0040
[0.00] ACPI: APIC 3FFB0390, 0078 (r1 A M I  OEMAPIC  12000521 MSFT   97)
[0.00] ACPI: MCFG 3FFB0410, 003C (r1 A M I  OEMMCFG  12000521 MSFT   97)
[0.00] ACPI: OEMB 3FFC0040, 0046 (r1 A M I  AMI_OEM  12000521 MSFT   97)
[0.00] Zone PFN ranges:
[0.00]   DMA 0 -> 4096
[0.00]   DMA324096 ->  1048576
[0.00]   Normal1048576 ->  1048576
[0.00] Movable zone start PFN for each node
[0.00] early_node_map[2] active PFN ranges
[0.00] 0:0 ->  159
[0.00] 0:  256 ->   262064
[0.00] ACPI: PM-Timer IO Port: 0x808
[0.00] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
[0.00] Processor #0 (Bootup-CPU)
[0.00] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] enabled)
[0.00] Processor #1
[0.00] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x82] disabled)
[0.00] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x83] disabled)
[0.00] ACPI: IOAPIC (id[0x02] address[0xfec0] gsi_base[0])
[0.00] IOAPIC[0]: apic_id 2, address 0xfec0, GSI 0-23
[0.00] ACPI: IOAPIC (id[0x03] address[0xfecc] gsi_base[24])
[0.00] IOAPIC[1]: apic_id 3, address 0xfecc, GSI 24-47
[0.00] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[0.00] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 low level)
[0.00] Setting APIC routing to flat
[0.00] Using ACPI (MADT) for SMP configuration information
[0.00] Allocating PCI 

Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-12 Thread Rok Markovic
Hi

I enabled netconsole but I am affraid that it won't help. In attachment
I am sending complete log over netconsole. While I was trying to make
everything, I got a lockup on linux-2.6.23-rc8, but I am not sure into
this. But all lockups happend in VGA console mode. Any ideas.

Rok


Ingo Molnar wrote:
>> I tried to recompile with those options enabled, kernel and linux 
>> boots and loads fine, but after few minutes (if i try to recompile 
>> kernel in console) the system locks completely (hard reset) without 
>> any messages. Is there anything I can do, maybe console on serial 
>> port? My new config is atached.
> 
> yes, serial console (or netconsole) output would be useful, if you can 
> solve that. Another way would be to boot with nmi_watchdog=2, do the 
> kernel recompile in a VGA text console, and wait for the lockup to 
> occur. Either you get some crash message to the text console 
> immediately, or you should get the NMI watchdog print something within a 
> minute or so. If neither happens you just get a blank hard lockup then 
> the wedge is very deep ...
> 
> (also make sure that the NMI counter in /proc/interrupts is increasing 
> on all CPUs properly, with nmi_watchdog=2. If not all CPUs/cores are 
> increasing their NMI counters once per second then the NMI watchdog wont 
> be able to print out a stackdump.)
> 
>   Ingo

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-12 Thread Rok Markovic
Hi again!

I tried to boot into VGA mode with NMI watchdog=2. System loads OK,
nmi counter is increasing on both CPUs (procesor is HT), but
when I start to load the CPU (compiling kernel), system locks
without any message. Even blinking cursor disapears. I will
try to do netconsole loging, but I can't promise any results for today.

Rok


Ingo Molnar wrote:
>> I tried to recompile with those options enabled, kernel and linux 
>> boots and loads fine, but after few minutes (if i try to recompile 
>> kernel in console) the system locks completely (hard reset) without 
>> any messages. Is there anything I can do, maybe console on serial 
>> port? My new config is atached.
> 
> yes, serial console (or netconsole) output would be useful, if you can 
> solve that. Another way would be to boot with nmi_watchdog=2, do the 
> kernel recompile in a VGA text console, and wait for the lockup to 
> occur. Either you get some crash message to the text console 
> immediately, or you should get the NMI watchdog print something within a 
> minute or so. If neither happens you just get a blank hard lockup then 
> the wedge is very deep ...
> 
> (also make sure that the NMI counter in /proc/interrupts is increasing 
> on all CPUs properly, with nmi_watchdog=2. If not all CPUs/cores are 
> increasing their NMI counters once per second then the NMI watchdog wont 
> be able to print out a stackdump.)
> 
>   Ingo

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-12 Thread Ingo Molnar

* Rok Markovic <[EMAIL PROTECTED]> wrote:

> I tried to recompile with those options enabled, kernel and linux 
> boots and loads fine, but after few minutes (if i try to recompile 
> kernel in console) the system locks completely (hard reset) without 
> any messages. Is there anything I can do, maybe console on serial 
> port? My new config is atached.

yes, serial console (or netconsole) output would be useful, if you can 
solve that. Another way would be to boot with nmi_watchdog=2, do the 
kernel recompile in a VGA text console, and wait for the lockup to 
occur. Either you get some crash message to the text console 
immediately, or you should get the NMI watchdog print something within a 
minute or so. If neither happens you just get a blank hard lockup then 
the wedge is very deep ...

(also make sure that the NMI counter in /proc/interrupts is increasing 
on all CPUs properly, with nmi_watchdog=2. If not all CPUs/cores are 
increasing their NMI counters once per second then the NMI watchdog wont 
be able to print out a stackdump.)

Ingo
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-12 Thread Ingo Molnar

does anything get printed prior the lockup if you enable all of these:

# CONFIG_DEBUG_SLAB is not set
# CONFIG_DEBUG_RT_MUTEXES is not set
# CONFIG_RT_MUTEX_TESTER is not set
# CONFIG_DEBUG_SPINLOCK is not set
# CONFIG_DEBUG_LOCK_ALLOC is not set
# CONFIG_PROVE_LOCKING is not set
# CONFIG_DEBUG_SPINLOCK_SLEEP is not set
# CONFIG_DEBUG_VM is not set
# CONFIG_DEBUG_LIST is not set
# CONFIG_FRAME_POINTER is not set

?

Ingo
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-12 Thread Ingo Molnar

does anything get printed prior the lockup if you enable all of these:

# CONFIG_DEBUG_SLAB is not set
# CONFIG_DEBUG_RT_MUTEXES is not set
# CONFIG_RT_MUTEX_TESTER is not set
# CONFIG_DEBUG_SPINLOCK is not set
# CONFIG_DEBUG_LOCK_ALLOC is not set
# CONFIG_PROVE_LOCKING is not set
# CONFIG_DEBUG_SPINLOCK_SLEEP is not set
# CONFIG_DEBUG_VM is not set
# CONFIG_DEBUG_LIST is not set
# CONFIG_FRAME_POINTER is not set

?

Ingo
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-12 Thread Rok Markovic
Hi again!

I tried to boot into VGA mode with NMI watchdog=2. System loads OK,
nmi counter is increasing on both CPUs (procesor is HT), but
when I start to load the CPU (compiling kernel), system locks
without any message. Even blinking cursor disapears. I will
try to do netconsole loging, but I can't promise any results for today.

Rok


Ingo Molnar wrote:
 I tried to recompile with those options enabled, kernel and linux 
 boots and loads fine, but after few minutes (if i try to recompile 
 kernel in console) the system locks completely (hard reset) without 
 any messages. Is there anything I can do, maybe console on serial 
 port? My new config is atached.
 
 yes, serial console (or netconsole) output would be useful, if you can 
 solve that. Another way would be to boot with nmi_watchdog=2, do the 
 kernel recompile in a VGA text console, and wait for the lockup to 
 occur. Either you get some crash message to the text console 
 immediately, or you should get the NMI watchdog print something within a 
 minute or so. If neither happens you just get a blank hard lockup then 
 the wedge is very deep ...
 
 (also make sure that the NMI counter in /proc/interrupts is increasing 
 on all CPUs properly, with nmi_watchdog=2. If not all CPUs/cores are 
 increasing their NMI counters once per second then the NMI watchdog wont 
 be able to print out a stackdump.)
 
   Ingo

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-12 Thread Ingo Molnar

* Rok Markovic [EMAIL PROTECTED] wrote:

 I tried to recompile with those options enabled, kernel and linux 
 boots and loads fine, but after few minutes (if i try to recompile 
 kernel in console) the system locks completely (hard reset) without 
 any messages. Is there anything I can do, maybe console on serial 
 port? My new config is atached.

yes, serial console (or netconsole) output would be useful, if you can 
solve that. Another way would be to boot with nmi_watchdog=2, do the 
kernel recompile in a VGA text console, and wait for the lockup to 
occur. Either you get some crash message to the text console 
immediately, or you should get the NMI watchdog print something within a 
minute or so. If neither happens you just get a blank hard lockup then 
the wedge is very deep ...

(also make sure that the NMI counter in /proc/interrupts is increasing 
on all CPUs properly, with nmi_watchdog=2. If not all CPUs/cores are 
increasing their NMI counters once per second then the NMI watchdog wont 
be able to print out a stackdump.)

Ingo
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-12 Thread Rok Markovic
I forgot attachment.

Rok

Rok Markovic wrote:
 Hi
 
 I enabled netconsole but I am affraid that it won't help. In attachment
 I am sending complete log over netconsole. While I was trying to make
 everything, I got a lockup on linux-2.6.23-rc8, but I am not sure into
 this. But all lockups happend in VGA console mode. Any ideas.
 
 Rok
 
 
 Ingo Molnar wrote:
 I tried to recompile with those options enabled, kernel and linux 
 boots and loads fine, but after few minutes (if i try to recompile 
 kernel in console) the system locks completely (hard reset) without 
 any messages. Is there anything I can do, maybe console on serial 
 port? My new config is atached.
 yes, serial console (or netconsole) output would be useful, if you can 
 solve that. Another way would be to boot with nmi_watchdog=2, do the 
 kernel recompile in a VGA text console, and wait for the lockup to 
 occur. Either you get some crash message to the text console 
 immediately, or you should get the NMI watchdog print something within a 
 minute or so. If neither happens you just get a blank hard lockup then 
 the wedge is very deep ...

 (also make sure that the NMI counter in /proc/interrupts is increasing 
 on all CPUs properly, with nmi_watchdog=2. If not all CPUs/cores are 
 increasing their NMI counters once per second then the NMI watchdog wont 
 be able to print out a stackdump.)

  Ingo

[  859.950668] sd 2:0:0:1: [sdc] 494080 512-byte hardware sectors (253 MB)
[  859.951294] sd 2:0:0:1: [sdc] Write Protect is off
[  859.951313] sd 2:0:0:1: [sdc] Assuming drive cache: write through
[  860.618633] sd 2:0:0:1: [sdc] 494080 512-byte hardware sectors (253 MB)
[  860.619414] sd 2:0:0:1: [sdc] Write Protect is off
[  860.619433] sd 2:0:0:1: [sdc] Assuming drive cache: write through
[ 1007.878314] nfsd: last server has exited
[ 1007.926075] nfsd: unexporting all filesystems
[ 1324.899757] nfsd: last server has exited
[ 1324.952596] nfsd: unexporting all filesystems
[0.00] Linux version 2.6.23-rt1 ([EMAIL PROTECTED]) (gcc version 4.1.2 20070925 (Red Hat 4.1.2-27)) #4 SMP PREEMPT RT Fri Oct 12 22:07:05 CEST 2007
[0.00] Command line: ro root=/dev/VolGroup00/LogVol01 [EMAIL PROTECTED]/eth0,[EMAIL PROTECTED]/ nmi_watchdog=2
[0.00] BIOS-provided physical RAM map:
[0.00]  BIOS-e820:  - 0009fc00 (usable)
[0.00]  BIOS-e820: 0009fc00 - 000a (reserved)
[0.00]  BIOS-e820: 000e6000 - 0010 (reserved)
[0.00]  BIOS-e820: 0010 - 3ffb (usable)
[0.00]  BIOS-e820: 3ffb - 3ffc (ACPI data)
[0.00]  BIOS-e820: 3ffc - 3fff (ACPI NVS)
[0.00]  BIOS-e820: 3fff - 4000 (reserved)
[0.00]  BIOS-e820: fee0 - fee01000 (reserved)
[0.00]  BIOS-e820: ff78 - 0001 (reserved)
[0.00] end_pfn_map = 1048576
[0.00] DMI 2.3 present.
[0.00] ACPI: RSDP 000F7C80, 0014 (r0 ACPIAM)
[0.00] ACPI: RSDT 3FFB, 0034 (r1 A M I  OEMRSDT  12000521 MSFT   97)
[0.00] ACPI: FACP 3FFB0200, 0084 (r2 A M I  OEMFACP  12000521 MSFT   97)
[0.00] ACPI: DSDT 3FFB0450, 3CF4 (r1  75D8P 75D8P0044 INTL  2002026)
[0.00] ACPI: FACS 3FFC, 0040
[0.00] ACPI: APIC 3FFB0390, 0078 (r1 A M I  OEMAPIC  12000521 MSFT   97)
[0.00] ACPI: MCFG 3FFB0410, 003C (r1 A M I  OEMMCFG  12000521 MSFT   97)
[0.00] ACPI: OEMB 3FFC0040, 0046 (r1 A M I  AMI_OEM  12000521 MSFT   97)
[0.00] Zone PFN ranges:
[0.00]   DMA 0 - 4096
[0.00]   DMA324096 -  1048576
[0.00]   Normal1048576 -  1048576
[0.00] Movable zone start PFN for each node
[0.00] early_node_map[2] active PFN ranges
[0.00] 0:0 -  159
[0.00] 0:  256 -   262064
[0.00] ACPI: PM-Timer IO Port: 0x808
[0.00] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
[0.00] Processor #0 (Bootup-CPU)
[0.00] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] enabled)
[0.00] Processor #1
[0.00] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x82] disabled)
[0.00] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x83] disabled)
[0.00] ACPI: IOAPIC (id[0x02] address[0xfec0] gsi_base[0])
[0.00] IOAPIC[0]: apic_id 2, address 0xfec0, GSI 0-23
[0.00] ACPI: IOAPIC (id[0x03] address[0xfecc] gsi_base[24])
[0.00] IOAPIC[1]: apic_id 3, address 0xfecc, GSI 24-47
[0.00] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[0.00] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 low level)
[0.00] Setting APIC routing to flat
[0.00] Using ACPI (MADT) for SMP configuration information
[0.00] Allocating PCI resources starting at 5000 (gap: 4000:bee0)
[

Re: 2.6.23-rt1 lockup after loading HAL deamon

2007-10-12 Thread Rok Markovic
Hi

I enabled netconsole but I am affraid that it won't help. In attachment
I am sending complete log over netconsole. While I was trying to make
everything, I got a lockup on linux-2.6.23-rc8, but I am not sure into
this. But all lockups happend in VGA console mode. Any ideas.

Rok


Ingo Molnar wrote:
 I tried to recompile with those options enabled, kernel and linux 
 boots and loads fine, but after few minutes (if i try to recompile 
 kernel in console) the system locks completely (hard reset) without 
 any messages. Is there anything I can do, maybe console on serial 
 port? My new config is atached.
 
 yes, serial console (or netconsole) output would be useful, if you can 
 solve that. Another way would be to boot with nmi_watchdog=2, do the 
 kernel recompile in a VGA text console, and wait for the lockup to 
 occur. Either you get some crash message to the text console 
 immediately, or you should get the NMI watchdog print something within a 
 minute or so. If neither happens you just get a blank hard lockup then 
 the wedge is very deep ...
 
 (also make sure that the NMI counter in /proc/interrupts is increasing 
 on all CPUs properly, with nmi_watchdog=2. If not all CPUs/cores are 
 increasing their NMI counters once per second then the NMI watchdog wont 
 be able to print out a stackdump.)
 
   Ingo

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/