On Sun, 22 Mar 2009 16:21:15 -0700 Michael Schuster <Michael.Schuster at Sun.COM> wrote:
> Brett Monroe wrote: > > Hey all, > > > > I am seeing an issue on one of our Solaris 10 servers and I would like > > to get more insight into what is going on. I suspect it is a kernel > > bug and I think mdb is the only way I can look into the kernel to see > > what's going on (with respect to this issue). My mdb skills are close > > to non-existent so please bear with me. :) Anyway, here is what I am > > seeing: > > > > The Server is running Solaris 10 Kernel 138888-02. I have some > > processes that appear in the process table and in /proc but they won't > > die if killed and can't be trussed and p* commands fail with the error > > "no such process." > > do they appear in a 'ps -ef' listing, perhaps as "defunct"? in that case, > you have so-called zombies, which are processed that have exited but whose > exit code still needs to be reaped. Hi Brett, a mate just pointed out that you might have come across the situation where if the proc is large enough and it recently received a signal, it could be in the process of dumping core. iirc you'd want to check the p_siginfo part of the proc structure to make sure of that. cheers, James -- Senior Kernel Software Engineer, Solaris Sun Microsystems http://blogs.sun.com/jmcp http://www.jmcp.homeunix.com/blog