On Sun, 22 Mar 2009 16:21:15 -0700
Michael Schuster <Michael.Schuster at Sun.COM> wrote:

> Brett Monroe wrote:
> > Hey all,
> > 
> > I am seeing an issue on one of our Solaris 10 servers and I would like
> > to get more insight into what is going on.  I suspect it is a kernel
> > bug and I think mdb is the only way I can look into the kernel to see
> > what's going on (with respect to this issue).  My mdb skills are close
> > to non-existent so please bear with me. :)  Anyway, here is what I am
> > seeing:
> > 
> > The Server is running Solaris 10 Kernel 138888-02.  I have some
> > processes that appear in the process table and in /proc but they won't
> > die if killed and can't be trussed and p* commands fail with the error
> > "no such process."  
> 
> do they appear in a 'ps -ef' listing, perhaps as "defunct"? in that case, 
> you have so-called zombies, which are processed that have exited but whose 
> exit code still needs to be reaped.


Hi Brett,
a mate just pointed out that you might have come across the
situation where if the proc is large enough and it recently
received a signal, it could be in the process of dumping core.

iirc you'd want to check the p_siginfo part of the proc structure
to make sure of that.


cheers,
James
--
Senior Kernel Software Engineer, Solaris
Sun Microsystems
http://blogs.sun.com/jmcp       http://www.jmcp.homeunix.com/blog

Reply via email to