Jakob Praher schrieb:
hi all,

i would like to ask some questions regarding best practices for zfs
recovery if disk errors occur.

currently i have zfs boot (nv62) and the following setup:

2 si3224 controllers (each 4 sata disks)
8 sata disks, same size, same type

i have two pools:
a) rootpool
b) datapool

the rootpool is a mirrored pool, where every disk has a slice (the s0,
which is 5 % of the whole disk) and this is devoted to the rootpool,
just for mirroring.

the rest of the disk (s1) is added to the datapool which is raidz.

my idea is that if any disk is corrupt i am still be able to boot.

now I have some questions:

a) if i want to boot from every disk in case of error, i have to setup
grub on every disk, such that if the controller sets this disk as the
booting, the rootpool is able to be loaded from that.

b) what is the best way to as fast as possible replace a disk.
adding a disk as hotspare for the raidz is a good idea. but i also would
like to replace the disk during runtime as simple as possible.

the problem is that for the root pool the disks are labeled (the slices
thingy). So I cannot simply  detach the volumes and replace the disk and
attach them again, but I have to format the disk such that the slicing
exists. Is there some clever way to automatically re-label a replacement
disk?


i found out that storing or getting the label information from another disk should work:

prtvtoc /dev/rdsk/xxxxs2 | fmthard -s - /dev/rdsk/xxxxs2

for instance i could simply store the label of all disks on the root pool, which should be available as long as any of the 8 disks is still availabe. So in case of repair i simply have to fmthard -s <disknumber> before attaching the replaced disk.

c) si 3224 related question: is it possible to simply hot swap the disk
(i have the disks in special hot-swappable units, but have no experience
in hotswapping under solaris, such that i want to have some echo).

d) do you have best practices for systems like that above? what are the
best resources on the web for learning about monitoring the health of
the zfs system (like email notifications in case of disk failures...)

thannks in advance
-- Jakob

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to