I think it is time to create an operations/ops guide for GlusterFS.
Operating guide should address issues which administrators face while
running/maintaining GlusterFS storage nodes. Openstack project has an
operating guide [2] which try to address similar issues and it is pretty
cool.
IMO these are the typical/example questions which operating guide should
try to address.
* /Maintenance, Failures, and Debugging/
* What are steps for planned maintenance for GlusterFS node?
* Steps for replacing a failed node?
* Steps to decommission a brick?
* /Logging and Monitoring/
* Where are the log files?
* How to find-out if self-heal is working properly?
* Which log files to monitor for detecting failures?
Operating guide needs good amount of work, hence we all need to come
together for this. You can contribute for this by either of the following
* Let know others about the questions you want to get answered in
the operating guide. ( I have set-up a etherpad for this [1])
* Answer the questions/issues raised by others.
Comments, suggestions?
Should this be part of gluster code base i.e. /doc or somewhere else?
[1] http://titanpad.com/op-guide
[2] http://docs.openstack.org/ops/oreilly-openstack-ops-guide.pdf
Thanks,
Lala
#lalatenduM on freenode
_______________________________________________
Gluster-users mailing list
Gluster-users@gluster.org
http://supercolony.gluster.org/mailman/listinfo/gluster-users