Hello,
I've just copied my my.cnf and edited the original. Rebooted the server just to
be sure (reboot cmd).
Still no update on my status screens in opsview , still shows last check some
hours ago and clicking on it shows last check some minutes ago (less then set
time of 5 mins).
my opsview/opsview-web.log last entries :
[2009/12/13 11:40:57] [Opsview.Web.Controller.Root] [ERROR] Errors encountered:
DBIx::Class::ResultSet::first(): DBI Connection failed: DBI
connect('database=opsview;host=l$
[2009/12/13 11:40:57] [Catalyst] [ERROR] DBIx::Class::ResultSet::first(): DBI
Connection failed: DBI connect('database=opsview;host=localhost','opsview',...)
failed: Can't $
[2009/12/13 11:40:57] [Catalyst] [ERROR] Caught exception in
Opsview::Web::Controller::Root->end "Can't insert new Opsview::Auditlog: DBI
connect('database=opsview;host=loc$
at /usr/local/opsview-web/script/../lib/Opsview/Web/Controller/Root.pm line
344"
[2009/12/13 11:55:48] [Opsview.Web.Controller.Root] [INFO] Username 'admin'
logged in via auth_tkt
Seems it's unable to connect to the database. Or do I read this error
incorrectly ?
my user.log last entries :
Dec 13 11:46:38 nagios ndo2db: Error: mysql_query() failed for 'INSERT INTO
nagios_servicechecks SET instance_id='1', service_object_id='6778',
check_type='0', current_chec$
Dec 13 11:46:38 nagios ndo2db: mysql_error: 'Lock wait timeout exceeded; try
restarting transaction'
After the reboot at 11:48 no more messages in te user.log
I also wanted to note the following :
One of my hosts had downtime planned. This downtime is show in opsview status
page. However when I click on it and remove it , it is not removed. I've waited
already 10 minutes to see it go away , but it's still marked as downtime , and
I still can remove it (again and again..)
Would be great to get it fixed. If we cannot fix it quickly , I might have to
rebuild the machine as we depend on the monitoring for now.
________________________________
Van: [email protected] namens James Whittington
Verzonden: zo 13-12-2009 4:46
Aan: Opsview Users
Onderwerp: Re: [opsview-users] Status Page contains old info
Peter,
That is an interesting MySQL error, I haven't run across that before but a
google search brought up some good hits on "total number of locks exceeds the
lock table size"
Here is one such link..
http://mrothouse.wordpress.com/2006/10/20/mysql-error-1206/
<http://mrothouse.wordpress.com/2006/10/20/mysql-error-1206/>
It appears others were able to resolve the error by increasing the
innodb_buffer_pool_size
<http://dev.mysql.com/doc/refman/4.1/en/innodb-parameters.html> variable in
/etc/my.cnf.
I would suggest making a copy of my.cnf and they try to adjust that variable
and restarting mysql to see if that makes a difference.
Good luck..
James Whittington
VC3, Inc.
From: [email protected]
[mailto:[email protected]] On Behalf Of Peter Plate
Sent: Saturday, December 12, 2009 4:49 PM
To: Opsview Users
Subject: Re: [opsview-users] Status Page contains old info
Syslog :
Dec 12 22:35:02 nagios /USR/SBIN/CRON[11973]: (nagios) CMD
(/usr/local/nagios/bin/call_nmis nmis.pl type=collect mthread=true > /dev/null
2>&1)
Dec 12 22:35:02 nagios /USR/SBIN/CRON[11972]: (root) CMD (if [ -x /usr/bin/mrtg
] && [ -r /etc/mrtg.cfg ]; then env LANG=C /usr/bin/mrtg /etc/mrtg.cfg >>
/var/log/mrtg/mrtg$
Dec 12 22:35:03 nagios /USR/SBIN/CRON[11975]: (nagios) CMD
(/usr/local/nagios/bin/mrtg_genstats.sh > /dev/null 2>&1)
Dec 12 22:35:15 nagios ndo2db: Successfully disconnected from MySQL database
Dec 12 22:35:15 nagios ndo2db: Successfully connected to MySQL database
Dec 12 22:36:14 nagios ndo2db: Successfully disconnected from MySQL database
Dec 12 22:36:14 nagios ndo2db: Successfully connected to MySQL database
Dec 12 22:36:34 nagios ndo2db: Successfully disconnected from MySQL database
Dec 12 22:36:34 nagios ndo2db: Successfully connected to MySQL database
There are some older items which contain innodb instructions but seem ok to me.
Opsview is still 3.03.2290
My user.log contains the following errors :
Dec 12 16:48:31 nagios ndo2db: Error: mysql_query() failed for 'INSERT INTO
nagios_servicechecks SET instance_id='1', service_object_id='7636',
check_type='0', current_chec$
Dec 12 16:48:31 nagios ndo2db: mysql_error: 'The total number of locks exceeds
the lock table size'
Dec 12 16:48:31 nagios ndo2db: Error: mysql_query() failed for 'INSERT INTO
nagios_servicechecks SET instance_id='1', service_object_id='3663',
check_type='0', current_chec$
Dec 12 16:48:31 nagios ndo2db: mysql_error: 'The total number of locks exceeds
the lock table size'
{Truncated !}
They seem to be related to the instert fails. However Í really don't know how
to fix it.
________________________________
From: [email protected]
[mailto:[email protected]] On Behalf Of Ton Voon
Sent: zaterdag 12 december 2009 22:33
To: Opsview Users
Subject: Re: [opsview-users] Status Page contains old info
On 12 Dec 2009, at 21:13, Peter Plate wrote:
Hello All,
Last night our server with OpsView stopped responding , so we needed to hard
reboot it (power off / on).
Now it seems the checks are ok and still scheduled , however our status page
contains old / outdated info.
For example a service check had a problem today at 1:00 am , it was ok at 1:15
am (restart of service) , even the status of the service check shows ok now
(clicking on it shows status : OK , and last check time was some minutes ago)
But on the status page (Alerts > All Unhandled) it is still in CRITICAL state ,
and last check is about 8 hours ago , while it is checked every 5 minutes.
Can you help us get the Opsview pages in sync again ?
PS : My Unix knowledge is limited , the system was installed by previous system
administrators , however I learn quickly.
You likely have some database problem where the data is not being inserted
correctly. Check syslog messages for mysql problems.
Which version of Opsview are you using?
Ton
Hello,
I've just copied my my.cnf and edited the original. Rebooted the server just to
be sure (reboot cmd).
Still no update on my status screens in opsview , still shows last check some
hours ago and clicking on it shows last check some minutes ago (less then set
time of 5 mins).
my opsview/opsview-web.log last entries :
[2009/12/13 11:40:57] [Opsview.Web.Controller.Root] [ERROR] Errors encountered:
DBIx::Class::ResultSet::first(): DBI Connection failed: DBI
connect('database=opsview;host=l$
[2009/12/13 11:40:57] [Catalyst] [ERROR] DBIx::Class::ResultSet::first(): DBI
Connection failed: DBI connect('database=opsview;host=localhost','opsview',...)
failed: Can't $
[2009/12/13 11:40:57] [Catalyst] [ERROR] Caught exception in
Opsview::Web::Controller::Root->end "Can't insert new Opsview::Auditlog: DBI
connect('database=opsview;host=loc$
at /usr/local/opsview-web/script/../lib/Opsview/Web/Controller/Root.pm line
344"
[2009/12/13 11:55:48] [Opsview.Web.Controller.Root] [INFO] Username 'admin'
logged in via auth_tkt
Seems it's unable to connect to the database. Or do I read this error
incorrectly ?
my user.log last entries :
Dec 13 11:46:38 nagios ndo2db: Error: mysql_query() failed for 'INSERT INTO
nagios_servicechecks SET instance_id='1', service_object_id='6778',
check_type='0', current_chec$
Dec 13 11:46:38 nagios ndo2db: mysql_error: 'Lock wait timeout exceeded; try
restarting transaction'
After the reboot at 11:48 no more messages in te user.log
I also wanted to note the following :
One of my hosts had downtime planned. This downtime is show in opsview status
page. However when I click on it and remove it , it is not removed. I've waited
already 10 minutes to see it go away , but it's still marked as downtime , and
I still can remove it (again and again..)
Would be great to get it fixed. If we cannot fix it quickly , I might have to
rebuild the machine as we depend on the monitoring for now.
________________________________
Van: [email protected] namens James Whittington
Verzonden: zo 13-12-2009 4:46
Aan: Opsview Users
Onderwerp: Re: [opsview-users] Status Page contains old info
Peter,
That is an interesting MySQL error, I haven't run across that before but a
google search brought up some good hits on "total number of locks exceeds the
lock table size"
Here is one such link..
http://mrothouse.wordpress.com/2006/10/20/mysql-error-1206/
<http://mrothouse.wordpress.com/2006/10/20/mysql-error-1206/>
It appears others were able to resolve the error by increasing the
innodb_buffer_pool_size
<http://dev.mysql.com/doc/refman/4.1/en/innodb-parameters.html> variable in
/etc/my.cnf.
I would suggest making a copy of my.cnf and they try to adjust that variable
and restarting mysql to see if that makes a difference.
Good luck..
James Whittington
VC3, Inc.
From: [email protected]
[mailto:[email protected]] On Behalf Of Peter Plate
Sent: Saturday, December 12, 2009 4:49 PM
To: Opsview Users
Subject: Re: [opsview-users] Status Page contains old info
Syslog :
Dec 12 22:35:02 nagios /USR/SBIN/CRON[11973]: (nagios) CMD
(/usr/local/nagios/bin/call_nmis nmis.pl type=collect mthread=true > /dev/null
2>&1)
Dec 12 22:35:02 nagios /USR/SBIN/CRON[11972]: (root) CMD (if [ -x /usr/bin/mrtg
] && [ -r /etc/mrtg.cfg ]; then env LANG=C /usr/bin/mrtg /etc/mrtg.cfg >>
/var/log/mrtg/mrtg$
Dec 12 22:35:03 nagios /USR/SBIN/CRON[11975]: (nagios) CMD
(/usr/local/nagios/bin/mrtg_genstats.sh > /dev/null 2>&1)
Dec 12 22:35:15 nagios ndo2db: Successfully disconnected from MySQL database
Dec 12 22:35:15 nagios ndo2db: Successfully connected to MySQL database
Dec 12 22:36:14 nagios ndo2db: Successfully disconnected from MySQL database
Dec 12 22:36:14 nagios ndo2db: Successfully connected to MySQL database
Dec 12 22:36:34 nagios ndo2db: Successfully disconnected from MySQL database
Dec 12 22:36:34 nagios ndo2db: Successfully connected to MySQL database
There are some older items which contain innodb instructions but seem ok to me.
Opsview is still 3.03.2290
My user.log contains the following errors :
Dec 12 16:48:31 nagios ndo2db: Error: mysql_query() failed for 'INSERT INTO
nagios_servicechecks SET instance_id='1', service_object_id='7636',
check_type='0', current_chec$
Dec 12 16:48:31 nagios ndo2db: mysql_error: 'The total number of locks exceeds
the lock table size'
Dec 12 16:48:31 nagios ndo2db: Error: mysql_query() failed for 'INSERT INTO
nagios_servicechecks SET instance_id='1', service_object_id='3663',
check_type='0', current_chec$
Dec 12 16:48:31 nagios ndo2db: mysql_error: 'The total number of locks exceeds
the lock table size'
{Truncated !}
They seem to be related to the instert fails. However Í really don't know how
to fix it.
________________________________
From: [email protected]
[mailto:[email protected]] On Behalf Of Ton Voon
Sent: zaterdag 12 december 2009 22:33
To: Opsview Users
Subject: Re: [opsview-users] Status Page contains old info
On 12 Dec 2009, at 21:13, Peter Plate wrote:
Hello All,
Last night our server with OpsView stopped responding , so we needed to hard
reboot it (power off / on).
Now it seems the checks are ok and still scheduled , however our status page
contains old / outdated info.
For example a service check had a problem today at 1:00 am , it was ok at 1:15
am (restart of service) , even the status of the service check shows ok now
(clicking on it shows status : OK , and last check time was some minutes ago)
But on the status page (Alerts > All Unhandled) it is still in CRITICAL state ,
and last check is about 8 hours ago , while it is checked every 5 minutes.
Can you help us get the Opsview pages in sync again ?
PS : My Unix knowledge is limited , the system was installed by previous system
administrators , however I learn quickly.
You likely have some database problem where the data is not being inserted
correctly. Check syslog messages for mysql problems.
Which version of Opsview are you using?
Ton
_______________________________________________
Opsview-users mailing list
[email protected]
http://lists.opsview.org/lists/listinfo/opsview-users