Splunk On Jul 19, 2011 11:52 AM, "Steve Dibb" <[email protected]> wrote: > Hey guys, > > So this is off-topic from PHP in general, but I know there's sysadmins > on here too, so maybe you guys can help me out. :) > > I had a server go down this morning :sadface:, and looking at the logs > from monit, something happened to suddenly spike the CPU and suck up all > the RAM. > > These just popped up out of the blue: > > [MDT Jul 19 07:53:39] error : 'localhost' cpu system usage of 54.7% > matches resource limit [cpu system usage>30.0%] > [MDT Jul 19 07:53:45] error : 'localhost' swap usage of 100.0% > matches resource limit [swap usage>25.0%] > [MDT Jul 19 07:53:51] error : 'localhost' mem usage of 98.5% matches > resource limit [mem usage>75.0%] > > I can tell it wasn't gradual buildup of cpu/swap/memory usage, since > this was the first alert, and monit runs every 2 minutes. So, I assume > something happened to spike the server and bring everything down. > > I've got two questions -- how do you guys usually go about monitoring > this stuff? Monit can check the system general usage, but how do I know > which applications are doing that? > > My second question is, where in the world do you start to diagnose > something like this? Looking at the system and apache logs, it looks > like everything just STOPPED. There's no red flags that I can see, so > I'm having a hard time diagnosing it. > > Thanks guys, any help is appreciated. > > Steve > > _______________________________________________ > > UPHPU mailing list > [email protected] > http://uphpu.org/mailman/listinfo/uphpu > IRC: #uphpu on irc.freenode.net
_______________________________________________ UPHPU mailing list [email protected] http://uphpu.org/mailman/listinfo/uphpu IRC: #uphpu on irc.freenode.net
