So, I've changed the thread title...

A few updates.

I didn't think it was load, so I tried running the Alert/MTR script *by 
hand/manually*, while smokeping and nagios are doing their thing - just to test 
what load was and what the effect was.

I ran about 15 alerts/MTR runs in quick succession - all while smokeping and 
nagios were also running doing their work.
Load does peak higher than I suspected - at ~2 for the 1 min average - but 
those queries complete fairly quickly and load drops back to around 0.3-0.4. 
[and this is way more load from the MTR script than should have been occurring 
in the automated runs I was doing before.]

However, even with the much higher load, there were no drops in writing the 
smokeping RRD's and Nagios doesn't complain about them.

So, I think it's safe to say that it's not a load issue - it's that for some 
reason when smokeping runs the "alert" script, that it has to wait for that 
script to finish before it goes on to do anything more - and this causes the 
other issues.

---
So, I also tried appending a "&" to the smokeping alert line in the config - in 
the hopes that it would run the process in the background. No luck. [I'd guess 
it places the "&" before the passed arguments and the script doesn't get any of 
the passed arguments it needs.]

I thought about creating a script that would run a second script and append the 
"&" to it, and run it.

e.g. 
"MTR-Create" [a (bash?) script] - would take the arguments it was passed from 
smokeping [you'd call MTR-Create from the smokeping alert]
MTR-Create would simply take it's arguments and call the "regular" MTR/Alert, 
passing along those arguments and appending "&" at the end to run it in the 
background.

I suspect I can struggle my way through doing that - but does any BASH guru 
know how best to do that, offhand. It could save me a lot of poking, trial and 
error! :)

TIA!

-Greg



Greg,

How many alerts are firing when your box starts to bog?  I have been running my 
fork of the mtr script for several months now with no issues.  Matter of fact, 
I am now working on an expanded version that will dump the mtr's into mysql for 
easy access for our NOC.  Currently, I just have the script appending a file in 
/var/log with each mtr.

Could you be pushing the box you are running from too hard?




On Wed, Jun 25, 2014 at 8:12 PM, Gregory Sloop <gr...@sloop.net> wrote:



FP> On 21.02.2014 06:42, Philip Wehunt wrote:
>> I could hackishly work around this in my python but I wanted to
>> identify if I am doing something wrong on the SP side or if it is a
>> bug. Mainly in the spirit of KISS. I don't like to let hackish
>> scripts linger.

FP> You probably found the same script on gist, but here's my version[1]
FP> which doesn't fail when the 6th arg is missing. It will not add "
FP> cleared" to the subject without the arg, but it will send you the report.

FP> [1]: https://git.server-speed.net/users/flo/bin/tree/smokemtr.py

FP> From the documentation in smokeping_config I'd say this is a bug, but
FP> given I get my mails I didn't bother fixing it yet.

Florian et.al.

First, thanks for the script. I've had to mod it a bit - my MTR isn't quite the 
same as yours and I want to use a non-local SMTP server and port - but those 
were easy mods. [MTR is in a different spot too, again easy mod.]

So, I'm very excited about the prospects of automated mtr stats when a 
smokeping alert gets triggered - however I run into a substantial snag.

I use a 60s poll in smokeping, and if I get a bunch of [smokeping] alerts that 
kick off, then, when each MTR takes a while to run, it stalls smokeping. 

This causes a ripple-effect, and a raft of nagios alerts...since I use a 
smokeping nagios plug-in.  When SP stalls [running the mtr's] the RRD's go dry, 
and then nagios starts alerting on an "unknown" target state. ["This RRD hasn't 
been written to in 180s" etc.]

So, is there some way I can fork off the mtr script, and allow smokeping to 
continue while the mtr stats are gathered and a report sent?

[This is something I'm woefully un-knowledgeable about...]

TIA
-Greg

_______________________________________________
smokeping-users mailing list
smokeping-users@lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users


-- 
Gregory Sloop, Principal: Sloop Network & Computer Consulting
Voice: 503.251.0452 x82
EMail: gr...@sloop.net
http://www.sloop.net
---
_______________________________________________
smokeping-users mailing list
smokeping-users@lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users

Reply via email to