Pfennig, Klaus (BIT B 3) wrote:

Hilist,

We're about to migrate from nagios to icinga. After dealing with some kind of trouble, we now have a problem, which we - until now - can't solve on our own. Before opening a new issue about the bug (?), I want to ask you, if there is an error in our setup, I've missed an email about this issue on the list or it's even a (new) bug in icinga:

Most of our host- and service-checks are being passively checked in a regular interval (distributed monitoring). But some of our (passive) checks receive no regular check results, they receive only a check result if the state of the particular host/service changes. In many cases, it is possible, that there is no state-change / check-result-update for about 2 weeks or even longer.

The ClassicUI displays everything correct: last check, duration, number of host-/service-checks etc. (In this case: last check was on 30-08-2011). All entries in status.dat and retention.dat are okay and up-to-date.


could i get (also in a private mail) get some insights on such an example?

If we activate the "glorious three" (idomod/ido2db/mysql) and do a '/etc/init-d/icinga start', the following scenario occurs: the retained information seems to be only transferred to the database if they're not too old (I guess, about a week or so and it matters both, host- and service-checks).


retained data is being read from the core (retention.dat) and then signalled via event broker callback to the idomod module, which then dumps the config onto the database at that moment (this is what we call core blocking by a config dump). although it would be interesting why some data rushes into and some does not. could you then turn on debuggong on idomod and ido2db (seperated files) and track the hosts/services which are dumped correctly, to get an idea what might happen over there?


So they're not written to our database (I've checked tables icinga_servicechecks and icinga_servicestatus; in icinga_objects everything is correct),


configs are written to the config tables, i.e. icinga_services. the icinga_servicechecks table is only populated if a check happens and the callback triggers that event in the event broker module. same goes for the status update.

hence icinga-web doesn't display them and calculates a wrong number of host-/service-checks.


are you speaking about "checks" meaning that a host / service shows up in the interface e.g. on the status cronk? is the object in the database flagged as is_active ?

And some more curious: if we submit passive service- and host-check-results via interface or command-line, the results will be displayed correct -- for about a week and then they're "gone" (although max_age for service- and host-checks is one day)...


what are you exactly talking about? state history? check history? some screenshots might be helpful.

At this time, we have a difference of about 800 service-checks between ClassicUI and Icinga-web (or better: between status.dat/retention.dat and ido) and this is the only problem that prevents us from migrating to icinga.

For a better view on our setup, I've added some extracts of our config:


i would be interested in the

- os
- icinga, classicui, idoutils version
- mysql version, any performance tweaks for innodb applied?
. icinga web version


max_servicechecks_age=1440

max_hostchecks_age=1440


on what purpose you are saving the check data?

max_eventhandlers_age=10080

max_externalcommands_age=0

max_logentries_age=0


this table may grow and grow and grow - i hope you are aware of that.

max_acknowledgements_age=0

clean_realtime_tables_on_core_startup=1

clean_config_tables_on_core_startup=1

trim_db_interval=60


this means every 60 seconds - the new default in recent release was changed to 3600s.

data_processing_options=-1


is there *any* logical reason to use -1 ?


config_output_options=2


so retained config ...

kind regards,
michael

Any ideas?

Thanks in advance and kind regards

Klaus


------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1


_______________________________________________
icinga-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/icinga-users


--
DI (FH) Michael Friedrich

Vienna University Computer Center
Universitaetsstrasse 7 A-1010 Vienna, Austria

email:  [email protected]
phone:  +43 1 4277 14359
mobile: +43 664 60277 14359
fax:    +43 1 4277 14338
web:    http://www.univie.ac.at/zid
        http://www.aco.net

Icinga Core&  IDOUtils Developer
http://www.icinga.org

------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
_______________________________________________
icinga-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/icinga-users

Reply via email to