Hi Ton,
You where right, the /var partition was full. And the "/usr/local/nagios/var"
path is a symllink to "/var/nagios".
Aparantly 32GB was not enough to hold all the backups in
"/usr/local/nagios/var/backups".
I've removed some of the older backups (September) and a few very large "files"
backups.
Unfortunatly... it still will not start ;-)
"[2009/10/15 06:55:43] [Opsview.Web.Controller.Root] [ERROR] Errors
encountered: Caught exception in
Opsview::Web::Controller::Status::Service->default "DBD::mysql::st execute
failed: Got error 28 from storage engine [for Statement "SELECT
DISTINCT(opsview_host_services.service_object_id) as service_object_id,
nagios_hosts.alias as alias, opsview_host_services.hostname as host,
opsview_host_services.host_object_id as host_object_id,
opsview_host_services.icon_filename as icon, nagios_hoststatus.current_state as
host_state, nagios_hoststatus.is_flapping as host_flapping,
nagios_hoststatus.problem_has_been_acknowledged as host_acknowledged,
(nagios_hoststatus.current_state != 0 and
nagios_hoststatus.problem_has_been_acknowledged != 1 and
nagios_hoststatus.scheduled_downtime_depth = 0) as host_unhandled,
opsview_host_services.servicename as service,
nagios_servicestatus.current_state as service_state,
nagios_servicestatus.is_flapping as service_flapping,
nagios_servicestatus.problem_has_been_acknowledged as service_acknowledged,
nagios_servicestatus.output as service_output,
(nagios_servicestatus.current_state != 0 and nagios_hoststatus.current_state =
0 and nagios_hoststatus.scheduled_downtime_depth = 0 and
nagios_servicestatus.problem_has_been_acknowledged!=1 and
nagios_servicestatus.scheduled_downtime_depth = 0) as service_unhandled,
opsview_host_services.perfdata_available as perfdata_available,
nagios_servicestatus.current_check_attempt as current_check_attempt,
nagios_servicestatus.max_check_attempts as max_check_attempts,
CONVERT_TZ(nagios_servicestatus.last_check, '+00:00', 'SYSTEM') as last_check,
UNIX_TIMESTAMP(nagios_servicestatus.last_state_change) as
last_state_change_timev FROM opsview_host_services, nagios_hosts,
nagios_hoststatus, nagios_servicestatus WHERE ( ( nagios_hosts.host_object_id =
opsview_host_services.host_object_id AND opsview_host_services.host_object_id =
nagios_hoststatus.host_object_id AND opsview_host_services.hostname LIKE ? AND
opsview_host_services.service_object_id =
nagios_servicestatus.service_object_id ) ) ORDER BY host, service" with
ParamValues: 0='paixpearl'] at
/usr/local/nagios/perl/lib/DBIx/ContextualFetch.pm line 52."
"
Thanks,
Mark
----- "Ton Voon" <[email protected]> wrote:
>
On 27 Oct 2009, at 06:16, Mark Maas-Martin wrote:
>
>
> Today Opsview stopped functioning... I rebooted the server, did
> /etc/init.d/opsview restart, /etc/init.d/opsview-web restart, and similar
> actions.
> But to no avail, all I'm seeing is "Opsview is not running" in the
> webbrowser.
>
> in the logs I'm seeing:
> [2009/10/26 15:02:47] [Catalyst] [ERROR] Caught exception in
> Opsview::Web::Controller::Root->end "Can't call method "status_update_time"
> on an undefined value at
> /usr/local/opsview-web/script/../lib/Opsview/Web/Controller/Root.pm line
> 457."
> [2009/10/27 06:49:27] [Opsview.Web.Controller.Admin.Reload] [INFO] Running
> asynchronous reload
>
>
This looks like a bug in Opsview. The database has not got a row in the
nagios_programstatus table. Usually this is always there. Is there a problem
with mysql? Maybe the table is crashed?
>
Try this patch, to ignore that information so you won't get this error:
>
Index: opsview-web/lib/Opsview/Web/Controller/Root.pm
===================================================================
--- opsview-web/lib/Opsview/Web/Controller/Root.pm (revision 3451)
+++ opsview-web/lib/Opsview/Web/Controller/Root.pm (working copy)
@@ -454,8 +454,10 @@
my $output = defined $c->stash->{output} ? $c->stash->{output} : (
$c->req->param("output") ? $c->req->param("output") : "" );
if ( $output eq "ajax" || $output eq "json" ) {
- my $server_status =
$c->model("RuntimeSchema::NagiosProgramstatus")->first->status_update_time;
- $c->res->header( "X-Opsview-Server-Status" =>
$c->stash->{dt_formatter}->($server_status) );
+ my $server_status = $c->model("RuntimeSchema::NagiosProgramstatus")->first;
+ if ($server_status) {
+ $c->res->header( "X-Opsview-Server-Status" =>
$c->stash->{dt_formatter}->($server_status->status_update_time) );
+ }
}
# If json output, stop here
>
>
>
>
>
>
> A lot of these as well:
> [2009/10/27 06:50:41] [import_ndologsd] [INFO] Ignore 1256603567.214967 as
> size 0
> [2009/10/27 06:50:41] [import_ndologsd] [INFO] Ignore 1256603572.258430 as
> size 0
> [snipped]
>
>
>
> [2009/10/27 06:50:41] [import_ndologsd] [INFO] Ignore 1256603652.211023 as
> size 0
>
> This can occur if the filesystem where the ndologs are written to is 100%
> full.
>
So my guess is you had a full filesystem where /usr/local/nagios/var/ndologs
resides.
>
Ton
>
> _______________________________________________
> Opsview-users mailing list
> [email protected]
> http://lists.opsview.org/lists/listinfo/opsview-users
>
>_______________________________________________
Opsview-users mailing list
[email protected]
http://lists.opsview.org/lists/listinfo/opsview-users