I'm not an authority, but on my 'vanilla' filer, using the same controller chipset as the thumper, I've been in really good shape since moving to zfs boot in 10/08 and doing 'zpool upgrade' and 'zfs upgrade' to all my mirrors (3 3-way). I'd been having similar troubles to yours in the past.
My system is pretty puny next to yours, but it's been reliable now for slightly over a month. On Tue, Jan 27, 2009 at 12:19 AM, Jorgen Lundman <lund...@gmo.jp> wrote: > > The vendor wanted to come in and replace an HDD in the 2nd X4500, as it > was "constantly busy", and since our x4500 has always died miserably in > the past when a HDD dies, they wanted to replace it before the HDD > actually died. > > The usual was done, HDD replaced, resilvering started and ran for about > 50 minutes. Then the system hung, same as always, all ZFS related > commands would just hang and do nothing. System is otherwise fine and > completely idle. > > The vendor for some reason decided to fsck root-fs, not sure why as it > is mounted with "logging", and also decided it would be best to do so > from a CDRom boot. > > Anyway, that was 12 hours ago and the x4500 is still down. I think they > have it at single-user prompt resilvering again. (I also noticed they'd > decided to break the mirror of the root disks for some very strange > reason). It still shows: > > raidz1 DEGRADED 0 0 0 > c0t1d0 ONLINE 0 0 0 > replacing UNAVAIL 0 0 0 insufficient replicas > c1t1d0s0/o OFFLINE 0 0 0 > c1t1d0 UNAVAIL 0 0 0 cannot open > > So I am pretty sure it'll hang again sometime soon. What is interesting > though is that this is on x4500-02, and all our previous troubles mailed > to the list was regarding our first x4500. The hardware is all > different, but identical. Solaris 10 5/08. > > Anyway, I think they want to boot CDrom to fsck root again for some > reason, but since customers have been without their mail for 12 hours, > they can go a little longer, I guess. > > What I was really wondering, has there been any progress or patches > regarding the system always hanging whenever a HDD dies (or is replaced > it seems). It really is rather frustrating. > > Lund > > -- > Jorgen Lundman | <lund...@lundman.net> > Unix Administrator | +81 (0)3 -5456-2687 ext 1017 (work) > Shibuya-ku, Tokyo | +81 (0)90-5578-8500 (cell) > Japan | +81 (0)3 -3375-1767 (home) > _______________________________________________ > zfs-discuss mailing list > zfs-discuss@opensolaris.org > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss > _______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss