Bug#339080: Frequent crash in handle_IRQ_event on alpha with kernel 2.6

2006-01-06 Thread Norbert Tretkowski
* Norbert Tretkowski wrote:
> Alpha was switched to the generic irq framework a few hours ago in
> the git-tree. I'm thinking about adding that patch to 2.6.15-2.

http://www.kernel.org/git/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=eee45269b0f5979c70bc151c6c2f4e5f4f5ababe
http://www.kernel.org/git/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=0595bf3bca9d9932a05b06dd438f40f01d27cd33

Norbert


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Bug#339080: Frequent crash in handle_IRQ_event on alpha with kernel 2.6

2006-01-06 Thread Norbert Tretkowski
tags 339080 +pending
thanks

* Uwe Schindler wrote:
> Looking through the kernel surces of the different architectures I
> have seen that almost all architectures use the same irq.c code. In
> newer kernels (> 2.6.8) for example x86, ia64, amd64, powerpc,
> parisc change to a generic IRQ handler code. The others are not yet
> changed, others have different IRQ handlers.

Alpha was switched to the generic irq framework a few hours ago in the
git-tree. I'm thinking about adding that patch to 2.6.15-2.

Thanks for your time, Norbert


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Bug#339080: Frequent crash in handle_IRQ_event on alpha with kernel 2.6

2005-11-14 Thread Uwe Schindler

Package: linux-2.6
Tags: patch

Since beginning of 2005 I tried different kernel/Linux-images 2.6.x 
on my Alphastation 500/500:

=
cpu : Alpha
cpu model   : EV56
cpu variation   : 7
cpu revision: 0
cpu serial number   :
system type : Alcor
system variation: Alcor
system revision : 0
system serial number:
cycle frequency [Hz]: 5
timer frequency [Hz]: 1024.00
page size [bytes]   : 8192
phys. address bits  : 40
max. addr. space #  : 127
BogoMIPS: 994.44
kernel unaligned acc: 0 (pc=0,va=0)
user unaligned acc  : 0 (pc=0,va=0)
platform string : Digital AlphaStation 500/500
cpus detected   : 1
L1 Icache   : 8K, 1-way, 32b line
L1 Dcache   : 8K, 1-way, 32b line
L2 cache: 96K, 3-way, 64b line
L3 cache: 8192K, 1-way, 64b line

Tried kernels were 2.6.8-1, 2.6.8-2, 2.6.10, 2.6.12,... All kernels 
crash on this machine with the following message (ksymoops):

ksymoops 2.4.9 on alpha 2.6.8-2-generic.  Options used
 -V (default)
 -k /proc/ksyms (default)
 -l /proc/modules (default)
 -o /lib/modules/2.6.8-2-generic/ (default)
 -m /boot/System.map-2.6.8-2-generic (default)

Warning: You did not tell me where to find symbol information.  I will
assume that the log matches the kernel and modules that are running
right now and I'll use the default options above for symbol resolution.
If the current kernel and/or modules do not match the log, you can get
more accurate output by telling me the kernel version and where to find
map, modules, ksyms etc.  ksymoops -h explains the options.

Error (regular_file): read_ksyms stat /proc/ksyms failed
No modules in ksyms, skipping objects
No ksyms, skipping lsmod
Trace:
[] handle_IRQ_event+0x74/0xf0
[] handle_irq+0xe0/0x1c0
[] srm_device_interrupt+0x24/0x40
[] do_entInt+0xf4/0x140
[] ret_from_sys_call+0x0/0x10
[] default_idle+0x0/0x10
[] cpu_idel+0x58/0x80
[] default_idle+0x0/0x10
[] default_idle+0x0/0x10
[] rest_init+0x34/0x50
[] __start+0x1c/0x20
Code: 243f0010  245f0020  21c10100  21a20200  a4490008  a429 
 b422

Using defaults from ksymoops -t elf64-alpha -a alpha


Trace; fc31a164 
Trace; fc31ab50 
Trace; fc329b04 
Trace; fc31b1f4 
Trace; fc315260 
Trace; fc316e30 
Trace; fc316e98 
Trace; fc316e30 
Trace; fc316e30 
Trace; fc310234 
Trace; fc31001c <_stext+1c/20>

Code;  ffe8 
 <_PC>:
Code;  ffe8 
   0:   10 00 3f 24   ldah t0,16
Code;  ffec 
   4:   20 00 5f 24   ldah t1,32
Code;  fff0 
   8:   00 01 c1 21   lda  s5,256(t0)
Code;  fff4 
   c:   00 02 a2 21   lda  s4,512(t1)
Code;  fff8 
  10:   08 00 49 a4   ldq  t1,8(s0)
Code;  fffc 
  14:   00 00 29 a4   ldq  t0,0(s0)
Code;   Before first symbol
  18:   08 00 41 b4   stq  t1,8(t0)
Code;  0004 Before first symbol
  1c:   00 00 22 b4   stq  t0,0(t1)

Kernel panic: Aiee, killing interrupt handler!
=

The time of the crash depends on multiple factors. Sometimes after 3 
hours, sometimes after two days, but mostly during idle time. A 
device driver is not affected because the crash always occurs inside 
arch/alpha/kernel/irq.c in function handle_irq_event. This could be a 
problem after the call to an interrupt handler of a driver but this 
also happens with changed hardware/drivers (2 different drivers for 
scsi, 3 different drivers for ethernet, with/without SATA, 
with/without USB). Nevertheless, here the hardware configuration:


=
:00:06.0 Ethernet controller: Digital Equipment Corporation 
DECchip 21040 [Tulip] (rev 26)

Flags: bus master, medium devsel, latency 255, IRQ 29
I/O ports at 9400 [size=128]
Memory at 022dd000 (32-bit, non-prefetchable) [size=128]

:00:07.0 RAID bus controller: Silicon Image, Inc. (formerly CMD 
Technology Inc) SiI 3114 [SATALink/SATARaid] Serial ATA Controller (rev 02)
Subsystem: Silicon Image, Inc. (formerly CMD Technology Inc) 
SiI 3114 SATARaid Controller

Flags: bus master, 66MHz, medium devsel, latency 240, IRQ 24
I/O ports at 9810 [size=8]
I/O ports at 9820 [size=4]
I/O ports at 9818 [size=8]
I/O ports at 9824 [size=4]
I/O ports at 9800 [size=16]
Memory at 022db000 (32-bit, non-prefetchable) [size=1K]
Expansion ROM at 0220 [disabled] [size=512K]
Capabilities: [60] Power Management version 2

:00:08.0 VGA