Hi,

I have a bit of a strange situation occurring with opsview-web  (Community 
Edition Version: 3.11.1)
This was last upgraded in Feb and has been running well since.
We are running it as a clustered service using heartbeat/pacemaker if that 
makes any difference.

Yesterday we needed to restart the machine.
After that, opsview-web was using a lot of resources (100% CPU on an 8 core 
box!),
and was (what looked like), continually recycling this script:

perl script/opsview_web_server.pl -f --background --pidfile 
/var/run/opsview/opsview-web.pid --host 0.0.0.0

i.e. starting up multiple copies and then stopping them, but always quite a few 
running.
I managed to work out that if this command was run manually from a shell,
it is fine (normal CPU and no multiple copies starting, etc).
However, starting opsview-web using the init script results in the bad 
behaviour.
Theopsview-web  log with DEBUG enabled doesnt give any useful info.

I found some similar issues on the forums and added this to opsview.conf:
$use_prefork  = 1;

That seems to have stabilized the machine, theopsview_web_server.pl  processes 
(although a lot of them now), seem to be fairly static.
However, I can see the CPU slowly creeping up again, heading towards 100%.


# ps auxf

nagios 28974 0.2 1.4 432764 172424 ? Ss Jun14 1:32 perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 26954 10.4 1.4 440612 181248 ? S 07:36 11:49 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 13495 10.4 1.4 440676 181212 ? S 07:53 9:57 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 26760 10.4 1.4 440552 181004 ? S 08:05 8:40 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 474 10.7 1.4 439900 180540 ? S 09:09 2:04 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 475 11.4 1.4 440476 181008 ? S 09:09 2:11 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 476 11.3 1.4 440456 181088 ? S 09:09 2:10 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 478 11.0 1.4 440860 181436 ? R 09:09 2:07 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 485 11.1 1.4 440004 180536 ? S 09:09 2:08 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 1209 10.6 1.4 440016 180636 ? S 09:10 1:59 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 1210 10.4 1.4 440296 180920 ? S 09:10 1:58 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 1211 10.3 1.4 440168 180684 ? S 09:10 1:56 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 1213 10.5 1.4 440808 181448 ? S 09:10 1:58 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 1214 11.2 1.4 440456 181100 ? S 09:10 2:06 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 1215 10.8 1.4 440504 181140 ? S 09:10 2:02 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 1216 11.1 1.4 439116 179576 ? S 09:10 2:05 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 1219 10.9 1.4 440092 180664 ? S 09:10 2:03 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 1220 10.7 1.4 440424 181060 ? S 09:10 2:00 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 1234 11.5 1.4 440428 181048 ? S 09:10 2:09 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 5231 10.9 1.4 440188 180764 ? S 09:13 1:39 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 6235 10.9 1.4 440280 180860 ? S 09:14 1:33 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 7000 11.3 1.4 443296 181956 ? S 09:15 1:30 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 10570 10.1 1.4 440388 180872 ? S 09:18 1:02 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 21779 11.6 1.4 434980 175380 ? S 09:28 0:01 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0 nagios 21887 0.0 1.3 432764 171528 ? S 09:28 0:00 \_ perl script/opsview_web_server.pl -f --background --pidfile /var/run/opsview/opsview-web.pid --host 0.0.0.0


I just cant work out why this would suddenly start happening now.
It has been running fine for several months.
Any ideas?

Thanks.

Regards,
Andy.


--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

_______________________________________________
Opsview-users mailing list
[email protected]
http://lists.opsview.org/lists/listinfo/opsview-users

Reply via email to