This is the first time I had kdb running.  All other times I lost the console
from the deadlock.  The deadlock is bad enough to prevent any more access to
basically everything.  I'm still running the same kernel and will do so for
several more days.  It usually happens every two to three days.

Chris Mason wrote:

> On Friday, November 10, 2000 06:15:40 -0800 David Ford <[EMAIL PROTECTED]>
> wrote:
>
> > Over the last three weeks my box has been locking up w/ a black screen
> > of death.  This time I had kdb patched in and got the following:
> >
> > Entering kdb (current=0xcf906000, pid 16808) Panic: invalid operand
> > due to panic @ 0xc0163d7a
>
> Odd.  There is nothing in de_still_valid that should panic, unless there is
> some major memory corruption going on.  Do you always get the same trace?
>
> [ ... ]
>
> > I have been writing code on it for the last two days straight.  It was
> > fully functional until I left for 15 minutes for a shower.  I came back
> > and the system is hosed.  Everything is quickly going to D state.  I can
> > move and type until I attempt to execute or reference anything.  It's
> > all downhill from there.
> >
> > It is 2.4.0-test11-pre1 with reiserfs 3.6.18.
> >
> > With kdb, after the panic happens, I can hit 'sr s' then 'g', it will
> > OOPS (process sendmail) then continue.  Without kdb, I am SOL and have
> > to hit the power button.  sysrq won't react.
> >
>
> Once you get inside reiserfs_rename, you've started a transaction.  If you
> oops inside there, the transaction never finishes, and all the other
> transactions end up waiting on it.  So, if you can continue without hanging
> the box, the oops didn't happen in reiserfs_rename ;-)  Could you send a
> decoded version?
>
> -chris

No, I can't continue.  Everything is halted beyond that.  I can type into an
rxvt only until I hit enter, then that rxvt is hosed.  I cannot even jump from
X to console.  Even input via the serial console was dead.  Luckily the
keyboard remained alive, I suspect it was only from the grace of kdb.
Normally it is also dead.

>>EIP; c0163d7a <de_still_valid+3e/dc>   <=====
Trace; c02ba2e5 <devfsd_buf_size+1c45/8e58>
Trace; c02ba37d <devfsd_buf_size+1cdd/8e58>
Trace; c0163e31 <entry_points_to_object+19/80>
Trace; c01642fa <reiserfs_rename+432/748>
Trace; c02bf60f <devfsd_buf_size+6f6f/8e58>
Trace; c0139c94 <vfs_rename_other+26c/2c4>
Trace; c0139d25 <vfs_rename+39/90>
Trace; c0139ef9 <sys_rename+17d/204>
Trace; c010aadb <system_call+33/38>
Code;  c0163d7a <de_still_valid+3e/dc>
00000000 <_EIP>:
Code;  c0163d7a <de_still_valid+3e/dc>   <=====
   0:   0f 0b                     ud2a      <=====
Code;  c0163d7c <de_still_valid+40/dc>
   2:   8b 7d c4                  mov    0xffffffc4(%ebp),%edi
Code;  c0163d7f <de_still_valid+43/dc>
   5:   8b 45 bc                  mov    0xffffffbc(%ebp),%eax
Code;  c0163d82 <de_still_valid+46/dc>
   8:   8b 4d c8                  mov    0xffffffc8(%ebp),%ecx
Code;  c0163d85 <de_still_valid+49/dc>
   b:   0f b7 57 14               movzwl 0x14(%edi),%edx
Code;  c0163d89 <de_still_valid+4d/dc>
   f:   03 50 34                  add    0x34(%eax),%edx
Code;  c0163d8c <de_still_valid+50/dc>
  12:   89 c8                     mov    %ecx,%eax

-d


--
"The difference between 'involvement' and 'commitment' is like an
eggs-and-ham breakfast: the chicken was 'involved' - the pig was
'committed'."


begin:vcard 
n:Ford;David
x-mozilla-html:TRUE
adr:;;;;;;
version:2.1
email;internet:[EMAIL PROTECTED]
title:Blue Labs Developer
x-mozilla-cpt:;14688
fn:David Ford
end:vcard

Reply via email to