Hi,
I have a bit of a strange situation occurring with opsview-web (Community
Edition Version: 3.11.1)
This was last upgraded in Feb and has been running well since.
We are running it as a clustered service using heartbeat/pacemaker if that
makes any difference.
Yesterday we needed to restart the machine.
After that, opsview-web was using a lot of resources (100% CPU on an 8 core
box!),
and was (what looked like), continually recycling this script:
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
i.e. starting up multiple copies and then stopping them, but always quite a few
running.
I managed to work out that if this command was run manually from a shell,
it is fine (normal CPU and no multiple copies starting, etc).
However, starting opsview-web using the init script results in the bad
behaviour.
Theopsview-web log with DEBUG enabled doesnt give any useful info.
I found some similar issues on the forums and added this to opsview.conf:
$use_prefork = 1;
That seems to have stabilized the machine, theopsview_web_server.pl processes
(although a lot of them now), seem to be fairly static.
However, I can see the CPU slowly creeping up again, heading towards 100%.
# ps auxf
nagios 28974 0.2 1.4 432764 172424 ? Ss Jun14 1:32 perl
script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 26954 10.4 1.4 440612 181248 ? S 07:36 11:49 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 13495 10.4 1.4 440676 181212 ? S 07:53 9:57 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 26760 10.4 1.4 440552 181004 ? S 08:05 8:40 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 474 10.7 1.4 439900 180540 ? S 09:09 2:04 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 475 11.4 1.4 440476 181008 ? S 09:09 2:11 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 476 11.3 1.4 440456 181088 ? S 09:09 2:10 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 478 11.0 1.4 440860 181436 ? R 09:09 2:07 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 485 11.1 1.4 440004 180536 ? S 09:09 2:08 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 1209 10.6 1.4 440016 180636 ? S 09:10 1:59 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 1210 10.4 1.4 440296 180920 ? S 09:10 1:58 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 1211 10.3 1.4 440168 180684 ? S 09:10 1:56 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 1213 10.5 1.4 440808 181448 ? S 09:10 1:58 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 1214 11.2 1.4 440456 181100 ? S 09:10 2:06 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 1215 10.8 1.4 440504 181140 ? S 09:10 2:02 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 1216 11.1 1.4 439116 179576 ? S 09:10 2:05 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 1219 10.9 1.4 440092 180664 ? S 09:10 2:03 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 1220 10.7 1.4 440424 181060 ? S 09:10 2:00 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 1234 11.5 1.4 440428 181048 ? S 09:10 2:09 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 5231 10.9 1.4 440188 180764 ? S 09:13 1:39 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 6235 10.9 1.4 440280 180860 ? S 09:14 1:33 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 7000 11.3 1.4 443296 181956 ? S 09:15 1:30 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 10570 10.1 1.4 440388 180872 ? S 09:18 1:02 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 21779 11.6 1.4 434980 175380 ? S 09:28 0:01 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
nagios 21887 0.0 1.3 432764 171528 ? S 09:28 0:00 \_
perl script/opsview_web_server.pl -f --background --pidfile
/var/run/opsview/opsview-web.pid --host 0.0.0.0
I just cant work out why this would suddenly start happening now.
It has been running fine for several months.
Any ideas?
Thanks.
Regards,
Andy.
--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.
_______________________________________________
Opsview-users mailing list
[email protected]
http://lists.opsview.org/lists/listinfo/opsview-users