Please can you check that your monit binary matches the system architecture? 
(i.e. for example 64-bit monit binary on 64-bit system - not 32-bit monit on 
64-bit system) 

To verify provide please the output of following commands:
        1.) uname -m
        2.) file `which monit`

Monit takes the statistics from the /proc/stat kernel interface. You can 
collect the statistics manually like this - for example to fetch the state in 1 
second intervals (30 samples):

        $ for ((i=0; i<30; i++)); do date; grep "cpu " /proc/stat; sleep 1; done

Note: monit takes the first /proc/stat line ("cpu") which contains the overall 
cpu usage in the system (summary of all cpus). The /proc/stat also contains 
per-cpu statistics if you want to collect all the statistics, replace the "grep 
'cpu '" simply with "cat".

Regards,
Martin


On Dec 7, 2011, at 10:04 AM, Lawrence, Wayne wrote:

> Hi Martin,
>  
> I have tried various methods to dientify the cause of this and took your 
> advice and used vmstat. I simply restarted the httpd process from the monit 
> web interface while the comand was running and got the following warning.
>  
>       Description: cpu system usage of 50.0% matches resource limit [cpu 
> system usage>30.0%]
>  
> But vmstat doesnt show that level of usage at the point of alert. As you can 
> see there is some usage in the 3rd line of the output when i restarted the 
> httpd service but it doesnt seem enough to trigger an alert.
>  
> vmstat 1 10
> procs -----------memory---------- ---swap-- -----io---- --system-- 
> -----cpu-----
>  r  b   swpd   free   buff  cache   si   so    bi    bo   in   cs us sy id wa 
> st
>  0  0      0 859596 114684 856908    0    0     4     6   81   77  0  0 99  0 
>  0
>  0  0      0 859448 114684 856916    0    0     0     0  100   94  1  0 99  0 
>  0
>  0  0      0 898352 114692 815600    0    0     0   168  555  605 23 15 61  1 
>  0
>  
> Not sure if there are any other tests i can run to narrow this down a bit 
> further as it still isn't making sense.
>  
> Regards
>  
> Wayne
>  
>  
> 
> 
>  
> On 7 December 2011 08:27, Martin Pala <[email protected]> wrote:
> Hi Lawrence,
> 
> the test which triggers the alert is "system" cpu => it's the time the system 
> spend in kernel mode. The cpu usage could be triggered by some background 
> kernel task, to verify the monit report matches the system cpu usage, you 
> should use either "vmstat" or "top" instead of "ps".
> 
> Best regards,
> Martin 
> 
> 
> On Dec 6, 2011, at 1:19 PM, Lawrence, Wayne wrote:
> 
>> Hi Igor,
>>  
>> the operating system is RHEL6 and monit version is 5.3.1
>>  
>> this is what i have in my config
>>  
>>     if cpu usage (user) > 70% then alert
>>     if cpu usage (system) > 30% then alert
>>     if cpu usage (wait) > 20% then alert
>> 
>> this is one of the errors
>> Description: cpu system usage of 50.0% matches resource limit [cpu system 
>> usage>30.0%]
>>  
>> this is what i get in /var/log/messages
>> Dec  6 12:01:29 <hostname-removed> monit[864]: <hostname-removed> cpu system 
>> usage of 50.0% matches resource limit [cpu system usage>30.0%]
>> Dec  6 12:02:29 <hostname-removed> monit[864]: 
>> <hostname-removed><hostname-removed>' cpu system usage check succeeded 
>> [current cpu system usage=0.9%]
>>  
>> this is the output of ps --no-headers -A -o "%cpu sz ucomm" | sort -k1nr | 
>> head -20
>>  
>>  12:01:29 up 4 days, 20:24,  2 users,  load average: 0.04, 0.01, 0.00
>>              total       used       free     shared    buffers     cached
>> Mem:       2055108    1092176     962932          0      53156     811864
>> -/+ buffers/cache:     227156    1827952
>> Swap:      4128760          0    4128760
>>  1.2 44308 perl
>>  0.0     0 aio/0
>>  0.0     0 async/mgr
>>  0.0     0 ata/0
>>  0.0     0 ata_aux
>>  0.0     0 bdi-default
>>  0.0     0 cpuset
>>  0.0     0 crypto/0
>>  0.0     0 events/0
>>  0.0     0 ext4-dio-unwrit
>>  0.0     0 flush-253:0
>>  0.0     0 jbd2/dm-0-8
>>  0.0     0 kacpi_hotplug
>>  0.0     0 kacpi_notify
>>  0.0     0 kacpid
>>  0.0     0 kauditd
>>  0.0     0 kblockd/0
>>  0.0     0 kdmflush
>>  0.0     0 khelper
>>  0.0     0 khubd
>> 
>> Have to say i am at a total loss as there is no way the usage figures are 
>> accurate.
>> If there is any other info i can supply that will be useful please let me 
>> know.
>>  
>> Regards
>>  
>> Wayne
>> 
>> 
>> On 6 December 2011 12:03, Igor Homyakov <[email protected]> 
>> wrote:
>> Hi Lawrence,
>> 
>> Could you be a little bit more specific ?  Please provide information
>> about you operation system, monit version on which the problem
>> occurred and so on.
>> 
>> Regards
>> Igor Homyakov
>> 
>> On Tue, Dec 6, 2011 at 15:35, Lawrence, Wayne
>> <[email protected]> wrote:
>> > Hi,
>> >
>> > I have a few CPU usage checks in my monitrc but it seems monit is
>> > misreporting the usage.
>> >
>> > I have run several tests and it seems that monit is multiplying the actual
>> > usage by 10.
>> >
>> > I ran a process with top running in another shell and CPU usage for the 
>> > user
>> > was never above 10% yet monit informed me that there was 100% cpu usage.
>> >
>> > I have tried various configurations including the one that came with the
>> > default config for system cpu monitoring and all seem to demonstrate the
>> > same issue.
>> >
>> > Any advice welcomed on this
>> >
>> > Regards
>> >
>> > Wayne Lawrence
>> >
>> >
>> >
>> > --
>> > To unsubscribe:
>> > https://lists.nongnu.org/mailman/listinfo/monit-general
>> 
>> --
>> To unsubscribe:
>> https://lists.nongnu.org/mailman/listinfo/monit-general
>> 
>> --
>> To unsubscribe:
>> https://lists.nongnu.org/mailman/listinfo/monit-general
> 
> 
> --
> To unsubscribe:
> https://lists.nongnu.org/mailman/listinfo/monit-general
> 
> --
> To unsubscribe:
> https://lists.nongnu.org/mailman/listinfo/monit-general

--
To unsubscribe:
https://lists.nongnu.org/mailman/listinfo/monit-general

Reply via email to