Regarding the "deadlock" report that I occasionally see on my CPU
server console, I won't bore anyone with PC addresses or anything like
that, but I will recommend something I believe to be a possible
trigger: the failure always seems to occur within "exportfs", which in
this case is used exclusively to run stats(1) remotely from my
workstation.  So the recommendation is that somebody like Erik, who is
infinitely more clued up than I am in the kernel arcana should run one
or more stats sessions into a cpu server (I happen to be running
fossil, so maybe Erik won't see this) and see if he can also trigger this 
behaviour.  I'm hoping that it is not platform specific.

Right now, I'm short of skills as well as a serial console :-(

++L

PS: here is a kmesg from the server:

        Plan 9
        E820: 00000000 0009fc00 memory
        E820: 0009fc00 000a0000 reserved
        E820: 000e0000 00100000 reserved
        E820: 00100000 47740000 memory
        E820: 47740000 47750000 acpi reclaim
        E820: 47750000 47800000 acpi nvs
        126 holes free
        00018000 0009f000 552960
        00468000 0642b000 100413440
        100966400 bytes free
        cpu0: 2599MHz GenuineIntel PentiumIV/Xeon (cpuid: AX 0x0F29 DX 
0xBFEBFBFF)
        ELCR: 0E28
        #l0: i82557: 100Mbps port 0xDC00 irq 11: 00111104e0b6
        1143M memory: 100M kernel data, 1043M user, 1668M swap
        root is from (tcp, local)[local!#S/sdC0/fossil]: time...
        venti...2010/1115 17:36:16 venti: conf.../boot/venti: mem 31,972,556 
bcmem 63,945,112 icmem 95,917,670...httpd tcp!127.1!8000...init...icache 
95,917,670 bytes = 1,498,714 entries; 16 scache
        sync...announce tcp!127.1!17034...serving.
        fossil(#S/sdC0/fossil)...fsys: dialing venti at tcp!127.1!17034
        version...time...
        
        init: starting /bin/rc

which also supplies:

lock 0xf09d8980 loop key 0xdeaddead pc 0xf01e736a held by pc 0xf01e736a proc 
2052
 17: #I0tcpack pc f01ff12a dbgpc        0   Running (Running) ut 530 st 0 bss 0 
qpc f014583c nl 0 nd 0 lpc f01e2cc8 pri 13
2052:  exportfs pc f01efc9f dbgpc     94ad    Pwrite (Ready) ut 43 st 209 bss 
40000 qpc f0145b62 nl 1 nd 0 lpc f01e2c60 pri 10

and, a bit later:

lock 0xf0057d74 loop key 0xdeaddead pc 0xf01e736a held by pc 0xf01e736a proc 
2052
 61:etherread4 pc f01ef8a0 dbgpc        0   Running (Running) ut 2923 st 0 bss 
0 qpc f0148c8a nl 0 nd 0 lpc f0100f6e pri 13
2052:  exportfs pc f01e7377 dbgpc     94ad    Pwrite (Ready) ut 55 st 270 bss 
40000 qpc f0145b62 nl 1 nd 0 lpc f01e2c60 pri 10

to my surprise.


Reply via email to