Even when tomcat7 is stopped. The pid exists and monit is throwing the error.
On Mon, Feb 24, 2014 at 12:04 PM, Martin Pala <[email protected]>wrote: > I'm not sure if it's copy&paste typo, but the size test is not allowed in > the "check process" context: > > --8<-- > check process tomcat7 with pidfile /var/run/tomcat7.pid > > if size = 0 then exec "/bin/rm /var/run/tomcat7.pid" <------------------------ > error > start program = "/etc/init.d/tomcat7 start" with timeout 60 seconds > ... > --8<-- > > The "check file" test is not necessary ... the pidfile based process check > will automatically trigger restart if the pidfile is missing. > > Regards, > Martin > > > > > On 24 Feb 2014, at 20:55, Sanjay Veeramachaneni <[email protected]> > wrote: > > That worked thank you. Now i am running into another error > > For tomcat, and Mongo db. I try to check the pid file and start if pid > file doesnt exist. The the config is shown below. > > > ## > > # Check Tomcat7 > check process tomcat7 with pidfile /var/run/tomcat7.pid > > if size = 0 then exec "/bin/rm /var/run/tomcat7.pid" > start program = "/etc/init.d/tomcat7 start" with timeout 60 seconds > stop program = "/etc/init.d/tomcat7 stop" > if failed host 127.0.0.1 port 8080 protocol http then restart > if 5 restarts within 5 cycles then timeout > group web > ### > > I get the error > monit: Error reading pid from file '/var/run/tomcat7.pid' ... > > monit: Error reading pid from file '/var/lib/mongodb/mongod.lock' > > How would go about fixing this error . I tried adding the line > > check file tomcat7.pid with path /var/run/tomca7.pid if size = 0 then > exec "/bin/rm /var/run/tomcat7.pid" > > But it gives me syntax error that ###/etc/monit/monitrc:86: Error: syntax > error 'size' > > How can i fix this... > > > > > > > > > On Mon, Feb 24, 2014 at 5:25 AM, Martin Pala <[email protected]>wrote: > >> Hello, >> >> the problem is this line (4x in your configuration file): >> >> --8<-- >> if 5 restarts within 5 cycles then timeout and alert >> --8<-- >> >> Only one action is possible (and alert is implicit part of timeout) => it >> should be changed to this: >> >> --8<-- >> if 5 restarts within 5 cycles then timeout >> --8<-- >> >> Regards, >> Martin >> >> >> >> On 24 Feb 2014, at 04:27, Sanjay Veeramachaneni <[email protected]> >> wrote: >> >> Hello, i am setting up monit for monitoring our servers. Please check my >> configuration file for monitrc. For some reason it starts failing from >> check filesystem rule which i have highlighted below. If i comment that >> whole section of for the check file system.. the error starts moving on to >> the next rules down................. not sure how to trouble shoot this >> error. This has been really frustrating. Any help is appreciated >> here................... >> >> #### >> >> set daemon 120 # check services at 2-minute intervals >> set logfile /var/log/monit.log >> set idfile /var/lib/monit/id >> set statefile /var/lib/monit/state >> set mailserver smtp.gmail.com port 587 >> using tlsv1 with timeout 30 seconds >> username "[email protected]" password "******" >> set mail-format { from: [email protected] } >> set alert [email protected] # Email to send notifications >> set eventqueue >> basedir /var/lib/monit/events >> slots 100 >> >> set mail-format { >> subject: monit alert -- $EVENT $SERVICE >> message: $EVENT Service $SERVICE >> Received: >> Date: $DATE >> Action: $ACTION >> Host: $HOST >> Description: $DESCRIPTION >> } >> >> set httpd port 8181 and # Port used for the WEB access, listen on >> LOCALHOST >> use address localhost >> allow 0.0.0.0/0.0.0.0 >> allow admin:monit # Username/password >> >> >> ############################################################################### >> ## Services >> >> ############################################################################### >> check system localhost >> if loadavg (1min) > 4 then alert >> if loadavg (5min) > 2 then alert >> if memory usage > 75% then alert >> if swap usage > 25% then alert >> if cpu usage (user) > 70% then alert >> if cpu usage (system) > 30% then alert >> if cpu usage (wait) > 20% then alert >> >> # Check the binary Apache file (MD5 checksum + permission + UID + GUID) >> check file apache_bin with path /usr/lib/apache2/mpm-prefork/apache2 >> if failed permission 755 then unmonitor >> if failed uid root then unmonitor >> if failed gid root then unmonitor >> group web >> >> # Check apache service (service alive + resource consumption) >> check process apache with pidfile /var/run/apache2.pid >> start program = "/etc/init.d/apache2 start" with timeout 20 seconds >> stop program = "/etc/init.d/apache2 stop" >> if children > 150 then alert >> if children > 250 then restart >> if loadavg(5min) greater than 15 for 8 cycles then stop >> if totalcpu > 50% for 2 cycles then alert >> if totalcpu > 80% for 5 cycles then restart >> if failed host 127.0.0.1 port 80 protocol http >> then restart >> if 5 restarts within 5 cycles then timeout and alert >> group web >> >> # Check ROOT filesystem space usage--------------------------------> >> Sytax error start from the check filesystem rule below. >> check filesystem rootfs with path / >> if space usage > 80% for 5 times within 15 cycles then alert >> group server >> >> # Check DATA filesystem space usage >> check filesystem datafs with path /dev/xvda1 >> if space usage > 80% for 5 times within 15 cycles then alert >> group server >> >> >> # Check mysqld service (depends mysqld binary + init.d startup script) >> check process mysqld with pidfile /var/run/mysqld/mysqld.pid >> group database >> start program = "/etc/init.d/mysql start" with timeout 20 seconds >> stop program = "/etc/init.d/mysql stop" >> if failed host 127.0.0.1 port 3306 protocol mysql then restart >> if 5 restarts within 5 cycles then timeout and alert >> >> >> # Check Tomcat7 >> check process tomcat7 with pidfile /var/run/tomcat7.pid >> start program = "/etc/init.d/tomcat7 start" with timeout 60 seconds >> stop program = "/etc/init.d/tomcat7 stop" >> if failed host 127.0.0.1 port 8080 protocol http then restart >> if 5 restarts within 5 cycles then timeout and alert >> group web >> >> # Check mongodb process >> check process mongodb with pidfile /var/lib/mongodb/mongod.lock >> group database >> start program = "/etc/init.d/mongodb start" with timeout 20 seconds >> stop program = "/etc/init.d/mongodb stop" >> if failed host 127.0.0.1 port 27017 protocol http for 3 times within 5 >> cycles then restart >> if 5 restarts within 5 cycles then timeout and alert >> >> -- >> To unsubscribe: >> https://lists.nongnu.org/mailman/listinfo/monit-general >> >> >> >> -- >> To unsubscribe: >> https://lists.nongnu.org/mailman/listinfo/monit-general >> > > -- > To unsubscribe: > https://lists.nongnu.org/mailman/listinfo/monit-general > > > > -- > To unsubscribe: > https://lists.nongnu.org/mailman/listinfo/monit-general >
-- To unsubscribe: https://lists.nongnu.org/mailman/listinfo/monit-general
