We're starting to grow our ZFS environment and really need to start
standardizing our monitoring procedures.
OS tools are great for spot troubleshooting and sar can be used for
some trending, but we'd really like to tie this into an SNMP based
system that can generate graphs for us (via RRD or other).
Whether or not we do this via our standard enterprise monitoring tool
or write some custom scripts I don't really care... but I do have the
following questions:
- What metrics are you guys tracking? I'm thinking:
- IOPS
- ZIL statistics
- L2ARC hit ratio
- Throughput
- "IO Wait" (I know there's probably a better term here)
- How do you gather this information? Some but not all is
available via SNMP. Has anyone written a ZFS specific MIB or
plugin to make the info available via the standard Solaris SNMP
daemon? What information is available only via zdb/mdb?
- Anyone have any RRD-based setups for monitoring their ZFS
environments they'd be willing to share or talk about?
Thanks in advance,
Ray
_______________________________________________
zfs-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss