Hello All,

 

I'm testing zenoss in our primarily windows environment.  I have
zenoss-1.1.2 installed on a CentOS 5 (RHEL 5) box, monitoring 10 windows
servers.  I've been really impressed with the product.  I began
monitoring 5 servers with just SNMP, and 2 days ago installed ZenWin to
gather service/log info from our Windows boxes.

 

That install seemed pretty straight forward, worked through a few kinks,
but had it working great.  I added another 5 windows servers to monitor,
and everything seemed to be working well.  I noticed 2 of my servers
this morning were not reporting WMI info.  I restarted the 3 Zenoss
services on the Windows server running ZenWin, same situation.  

 

I decided to restart the server running ZenWin (it's just on a Windows
Server 2003 Standard Ed. Devel box).  When the server came back up,
there were 5 servers not reporting.  All of these servers have worked in
the past, and now they were different servers.  

 

Upon restart, the zeneventlog.log showed:

 

2007-05-09 13:34:14 INFO zen.SendEvent: SendEvent thread started.

2007-05-09 13:34:14 INFO zen.zeneventlog: reloading configuration

2007-05-09 13:34:15 WARNING zen.zeneventlog: skipping srv105.domain.lcl
has bad wmi state

2007-05-09 13:34:15 WARNING zen.zeneventlog: skipping srv106.domain.lcl
has bad wmi state

2007-05-09 13:34:15 WARNING zen.zeneventlog: skipping srv107.domain.lcl
has bad wmi state

2007-05-09 13:34:15 WARNING zen.zeneventlog: skipping srv108.domain.lcl
has bad wmi state

2007-05-09 13:34:15 INFO zen.zeneventlog: Com InterfaceCount: 24

2007-05-09 13:34:15 INFO zen.zeneventlog: Com GatewayCount: 0

2007-05-09 13:34:15 INFO zen.zeneventlog: tested 6 devices in 0.86
seconds

2007-05-09 13:35:14 INFO zen.zeneventlog: Com InterfaceCount: 24

2007-05-09 13:35:14 INFO zen.zeneventlog: Com GatewayCount: 0

... etc.

 

zenwinmodeler.log shows:

2007-05-09 13:24:27 INFO zen.SendEvent: SendEvent thread started.

2007-05-09 13:24:27 INFO zen.zenwinmodeler: reloading configuration

2007-05-09 13:24:27 INFO zen.zenwinmodeler: collecting from
srv001.domain.lcl using user domain\zenoss

2007-05-09 13:24:30 INFO zen.zenwinmodeler: collecting from
srv002.domain.lcl using user domain\zenoss

2007-05-09 13:24:32 INFO zen.zenwinmodeler: collecting from
srv004.domain.lcl using user domain\zenoss

2007-05-09 13:24:35 INFO zen.zenwinmodeler: collecting from
srv101.domain.lcl using user domain\zenoss

2007-05-09 13:24:38 INFO zen.zenwinmodeler: collecting from
srv102.domain.lcl using user domain\zenoss

2007-05-09 13:24:41 INFO zen.zenwinmodeler: collecting from
srv103.domain.lcl using user domain\zenoss

2007-05-09 13:24:44 WARNING zen.zenwinmodeler: skipping
srv105.domain.lcl has bad wmi state

2007-05-09 13:24:44 WARNING zen.zenwinmodeler: skipping
srv106.domain.lcl has bad wmi state

2007-05-09 13:24:44 WARNING zen.zenwinmodeler: skipping
srv107.domain.lcl has bad wmi state

2007-05-09 13:24:44 WARNING zen.zenwinmodeler: skipping
srv108.domain.lcl has bad wmi state

2007-05-09 13:24:44 INFO zen.zenwinmodeler: tested 10 devices in 16.69
seconds

2007-05-09 13:24:44 INFO zen.zenwinmodeler: reloading configuration

2007-05-09 13:24:44 INFO zen.zenwinmodeler: tested 0 devices in 0.05
seconds

2007-05-09 13:24:45 INFO zen.zenwinmodeler: reloading configuration

... etc.

 

I have successfully monitored all of these servers (prior to adding the
5 more), so it should not be a rights issue.  I have restarted the
"Windows Management Instrumentation" service on one of the failing
servers, but it made no difference.  The servers are a  mix of Windows
2000 Server SP4, Windows Server 2003 Standard Ed. SP1, and Windows
Server 2003 Standard Ed.  R2.  The server hosting the ZenWin processes
is an R2 server.  I've replaced domain/computer names in the logs above.

 

It almost seems as if it gets too many WMI queries going and subsequent
ones fail.  Any thoughts?  Any of you using ZenWin to monitor many
Windows Servers?

 

Any help is greatly appreciated.  Getting ZenWin working will be
critical to using Zenoss in production.

 

Kindest Regards,

 

Chris Hillman 

Systems Administrator
Clearwater Research, Inc. 



 

 

_______________________________________________
zenoss-users mailing list
[email protected]
http://lists.zenoss.org/mailman/listinfo/zenoss-users

Reply via email to