[Nagios-users] Nagios web GUI, contact groups, hosts and services
Dear list, I'm presently working on a Nagios 3.2.0 setup that monitors approximately 1000 hosts and about 5000 services. The setup doesn't make use of Nagios' notification system, instead people at a control center (NOC) use the Nagios web interface to alert the appropriate people in case of an alert. The Nagios configuration is based on a set of host (group) templates, where services are assigned to host groups. For example: the 'check_swap' service definition is associated with the host group 'all unix hosts'. The problem I'm confronted with is that the people at the NOC don't need to see all services on a given host. For example, they shouldn't call the sysadmin at night if an SSL certificate check goes into critical state because the certificate is only valid for ten more days. So we want to remove that service from their view. In the present situation contact groups (used to determine which servers are visible to which department) are added to each specific host, but according to the Nagios docs (http://nagios.sourceforge.net/docs/3_0/cgiauth.html) a contact group can see _all_ services on a given host if it is listed as a contract group for the host. So I decided to remove the NOC contact group from the individual host definitions and to assign the NOC contact group only to specific services. This leads me to another problem. Some service checks (i.e. host groups) are used in one department only. This works fine. However, some other service checks (like check_swap for the 'all unix hosts' hostgroup) are shared by all departments, but some departments don't want the NOC to see check_swap alerts whereas others do want to pass these alerts to NOC. It would be possible to make services with slightly different names (e.g. check_swap_dept1, check_swap_dept2), each with the correct contact group. However, that seems to be a needless increase of complexity. Another approach would be to make host groups for each department and somehow change the service contact group for each host group. Unfortunately I haven't been able to get that to work. Any suggestions would be highly appreciated. Lennart Karssen. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios web GUI, contact groups, hosts and services
Karssen, Your problem appears to be process not the tool. I am not suggesting how to conduct business in your organization but here is my suggestion. I would reach out to noc management and put together a wiki or how to for each unique possible critical alert scenario. The I would add the "notes_url" parameter of Nagios to point to that particular wiki page. Only thing noc has to do click on the notes_url page and they will know exactly what to do with the alert. Thanks On Mon, Feb 8, 2010 at 3:40 AM, L.C. Karssen wrote: > Dear list, > > I'm presently working on a Nagios 3.2.0 setup that monitors approximately > 1000 hosts and about 5000 services. The setup doesn't make use of Nagios' > notification system, instead people at a control center (NOC) use the > Nagios web interface to alert the appropriate people in case of an alert. > > The Nagios configuration is based on a set of host (group) templates, > where services are assigned to host groups. For example: the 'check_swap' > service definition is associated with the host group 'all unix hosts'. > > The problem I'm confronted with is that the people at the NOC don't need > to see all services on a given host. For example, they shouldn't call the > sysadmin at night if an SSL certificate check goes into critical state > because the certificate is only valid for ten more days. So we want to > remove that service from their view. In the present situation contact > groups (used to determine which servers are visible to which department) > are added to each specific host, but according to the Nagios docs > (http://nagios.sourceforge.net/docs/3_0/cgiauth.html) a contact group can > see _all_ services on a given host if it is listed as a contract group for > the host. > So I decided to remove the NOC contact group from the individual host > definitions and to assign the NOC contact group only to specific services. > > This leads me to another problem. Some service checks (i.e. host groups) > are used in one department only. This works fine. > However, some other service checks (like check_swap for the 'all unix > hosts' hostgroup) are shared by all departments, but some departments > don't want the NOC to see check_swap alerts whereas others do want to pass > these alerts to NOC. It would be possible to make services with slightly > different names (e.g. check_swap_dept1, check_swap_dept2), each with the > correct contact group. However, that seems to be a needless increase of > complexity. > Another approach would be to make host groups for each department and > somehow change the service contact group for each host group. > Unfortunately I haven't been able to get that to work. > > > Any suggestions would be highly appreciated. > > > Lennart Karssen. > > > > -- > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Cordially, Shadhin Rahman -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios web GUI, contact groups, hosts and services
Hi Shadhin Rahman, Thanks for your reply. I hadn't mentioned in my e-mail, but adding the notes_url is indeed the option I had in mind as a last resort. I like your suggestion of linking to a wiki. If no others solution comes up, I guess that's what we'll do. Regards, Lennart Karssen. On Mon, 2010-02-08 at 10:41 -0500, shadih rahman wrote: > Karssen, > > Your problem appears to be process not the tool. I am not > suggesting how to conduct business in your organization but here is my > suggestion. > > I would reach out to noc management and put together a wiki or how > to for each unique possible critical alert scenario. The I would add > the "notes_url" parameter of Nagios to point to that particular wiki > page. > >Only thing noc has to do click on the notes_url page and they will > know exactly what to do with the alert. Thanks > > On Mon, Feb 8, 2010 at 3:40 AM, L.C. Karssen > wrote: > Dear list, > > I'm presently working on a Nagios 3.2.0 setup that monitors > approximately > 1000 hosts and about 5000 services. The setup doesn't make use > of Nagios' > notification system, instead people at a control center (NOC) > use the > Nagios web interface to alert the appropriate people in case > of an alert. > > The Nagios configuration is based on a set of host (group) > templates, > where services are assigned to host groups. For example: the > 'check_swap' > service definition is associated with the host group 'all unix > hosts'. > > The problem I'm confronted with is that the people at the NOC > don't need > to see all services on a given host. For example, they > shouldn't call the > sysadmin at night if an SSL certificate check goes into > critical state > because the certificate is only valid for ten more days. So we > want to > remove that service from their view. In the present situation > contact > groups (used to determine which servers are visible to which > department) > are added to each specific host, but according to the Nagios > docs > (http://nagios.sourceforge.net/docs/3_0/cgiauth.html) a > contact group can > see _all_ services on a given host if it is listed as a > contract group for > the host. > So I decided to remove the NOC contact group from the > individual host > definitions and to assign the NOC contact group only to > specific services. > > This leads me to another problem. Some service checks (i.e. > host groups) > are used in one department only. This works fine. > However, some other service checks (like check_swap for the > 'all unix > hosts' hostgroup) are shared by all departments, but some > departments > don't want the NOC to see check_swap alerts whereas others do > want to pass > these alerts to NOC. It would be possible to make services > with slightly > different names (e.g. check_swap_dept1, check_swap_dept2), > each with the > correct contact group. However, that seems to be a needless > increase of > complexity. > Another approach would be to make host groups for each > department and > somehow change the service contact group for each host group. > Unfortunately I haven't been able to get that to work. > > > Any suggestions would be highly appreciated. > > > Lennart Karssen. > > > > -- > The Planet: dedicated and managed hosting, cloud storage, > colocation > Stay online with enterprise data centers and the best network > in the business > Choose flexible plans and management services without > long-term contracts > Personal 24x7 support from experience hosting pros just a > phone call away. > http://p.sf.net/sfu/theplanet-com > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent > to /dev/null > > > > -- > Cordially, > Shadhin Rahman -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and man