Hi,

I have a problem that mon suddenly stops logging to file (monshow.cgi works
ok)
It stays ok for 2 or 3 days than it stops logging.
the network is in a closed environment (no internet access)
No logs in "dtlog" or "alert file.alert". PS shows process running.
Could anyone help me fixing this situation? I´ve never used mon before.
Any additional switches to mon to do more debugging?

thanks in advance,

Marcelo


<code>
daemon.log shows -snip-

Mar 23 17:06:12 gesnix01 mon[31278]: failure for Routers ping 1269363972
rt05 rt06
Mar 23 17:07:08 gesnix01 mon[31278]: client connection from 127.0.0.1:58639
Mar 23 17:07:09 gesnix01 mon[31278]: client command "protid 9745"
Mar 23 17:07:09 gesnix01 mon[31278]: client command "list disabled"
Mar 23 17:07:09 gesnix01 mon[31278]: client command "list state"
Mar 23 17:07:09 gesnix01 mon[31278]: client command "list watch"
Mar 23 17:07:09 gesnix01 mon[31278]: client command "list opstatus"
Mar 23 17:07:09 gesnix01 mon[31278]: client command "list descriptions"
Mar 23 17:08:09 gesnix01 mon[31278]: client connection from 127.0.0.1:58640
Mar 23 17:08:10 gesnix01 mon[31278]: client command "protid 9745"
Mar 23 17:08:10 gesnix01 mon[31278]: client command "list disabled"
Mar 23 17:08:10 gesnix01 mon[31278]: client command "list state"
Mar 23 17:08:10 gesnix01 mon[31278]: client command "list watch"
Mar 23 17:08:10 gesnix01 mon[31278]: client command "list opstatus"
Mar 23 17:08:10 gesnix01 mon[31278]: client command "list descriptions"
Mar 23 17:09:10 gesnix01 mon[31278]: client connection from 127.0.0.1:36287
Mar 23 17:09:11 gesnix01 mon[31278]: client command "protid 9745"
Mar 23 17:09:11 gesnix01 mon[31278]: client command "list disabled"
Mar 23 17:09:11 gesnix01 mon[31278]: client command "list state"
Mar 23 17:09:11 gesnix01 mon[31278]: client command "list watch"
Mar 23 17:09:11 gesnix01 mon[31278]: client command "list opstatus"
Mar 23 17:09:11 gesnix01 mon[31278]: client command "list descriptions"
Mar 23 17:10:11 gesnix01 mon[31278]: client connection from 127.0.0.1:36289
Mar 23 17:10:12 gesnix01 mon[31278]: client command "protid 9745"
Mar 23 17:10:12 gesnix01 mon[31278]: client command "list disabled"
Mar 23 17:10:12 gesnix01 mon[31278]: client command "list state"
Mar 23 17:10:12 gesnix01 mon[31278]: client command "list watch"
Mar 23 17:10:12 gesnix01 mon[31278]: client command "list opstatus"
Mar 23 17:10:12 gesnix01 mon[31278]: client command "list descriptions"


my mon.cf

alertdir     = /usr/lib/mon/alert.d
mondir       = /usr/lib/mon/mon.d
logdir       = /var/log/mon
histlength   = 500
historicfile = /var/log/mon/history.log
maxprocs     = 20
randstart    = 30s
# authfile = /etc/mon/auth.cf

dtlogging = yes
dtlogfile = dtlog

hostgroup TimeServers 10.184.0.2 10.184.29.238


hostgroup Gateway xxxgw


hostgroup Localhost xxxnix


hostgroup Routers
rt01
rt02
rt03
...
rtnn


watch Gateway
service ping
  description Check Default Gateway conectivity
  interval 2m
  monitor fping.monitor -r 5 -t 2000
  period wd {Mon-Sun} hr {0am-24pm}
    alertafter 1
    alertevery 2h


watch Localhost
service dns
  description Check DNS server availability
  interval 15m
  monitor dns.monitor -zone ges.nso -master gesnix.ges.nso
  period
    alertafter 1
    numalerts 1
    alert mail.alert r...@localhost
service tacacs
  description Check TACACS server availability
  interval 15m
  monitor tacacsplus.monitor -k cle_tacacs
  period
    alertafter 1
    numalerts 1
    alert mail.alert r...@localhost
service ssh
  description Check SSH server availability
  interval 15m
  monitor telnet.monitor -p 22 -l /SSH-/
  period
    alertafter 1
    numalerts 1
    alert mail.alert r...@localhost


watch Routers
service ping
  description Check Routers conectivity
  interval 5m
  monitor fping.monitor -r 5 -t 2000
  depend Gateway:ping
  period wd {Mon-Sun} hr {0am-24pm}
    alertafter 1
    alertevery 4h
    numalerts 1
    alert file.alert /var/log/HOSTS/outages.log
    alert mail.alert r...@localhost


watch TimeServers
service ntp
  description Check NTP server availability
  interval 12h
  depend Gateway:ping
  monitor ntpdate.monitor -s6 -o30
  period
    alertafter 1
    numalerts 1
    alert mail.alert r...@localhost

</code>
_______________________________________________
mon mailing list
mon@linux.kernel.org
http://linux.kernel.org/mailman/listinfo/mon

Reply via email to