Re: [zfs-discuss] help diagnosing system hang

2008-12-06 Thread Ethan Erchinger
Ethan Erchinger wrote: > Here is a sample set of messages at that time. It looks like timeouts > on the SSD for various requested blocks. Maybe I need to talk with > Intel about this issue. > Keeping everyone up-to-date, for those who care, I've RMAd the Intel drive, and will retest when

Re: [zfs-discuss] help diagnosing system hang

2008-12-05 Thread Ethan Erchinger
Richard Elling wrote: > The answer may lie in the /var/adm/messages file which should report > if a reset was received or sent. Here is a sample set of messages at that time. It looks like timeouts on the SSD for various requested blocks. Maybe I need to talk with Intel about this issue. Ethan

Re: [zfs-discuss] help diagnosing system hang

2008-12-05 Thread Richard Elling
Ethan Erchinger wrote: > > Richard Elling wrote: >>> >>>asc = 0x29 >>>ascq = 0x0 >> >> ASC/ASCQ 29/00 is POWER ON, RESET, OR BUS DEVICE RESET OCCURRED >> http://www.t10.org/lists/asc-num.htm#ASC_29 >> >> [this should be more descriptive as the codes are, more-or-less, >> standardiz

Re: [zfs-discuss] help diagnosing system hang

2008-12-04 Thread Ethan Erchinger
Richard Elling wrote: >> >>asc = 0x29 >>ascq = 0x0 > > ASC/ASCQ 29/00 is POWER ON, RESET, OR BUS DEVICE RESET OCCURRED > http://www.t10.org/lists/asc-num.htm#ASC_29 > > [this should be more descriptive as the codes are, more-or-less, > standardized, I'll try to file an RFE, unless

Re: [zfs-discuss] help diagnosing system hang

2008-12-04 Thread Richard Elling
Ethan Erchinger wrote: > > > Richard Elling wrote: >> >> I've seen these symptoms when a large number of errors were reported >> in a short period of time and memory was low. What does "fmdump -eV" >> show? >> > fmdump -eV shows lots of messages like this, and yea, I believe that > to be sd16 whi

Re: [zfs-discuss] help diagnosing system hang

2008-12-04 Thread Ethan Erchinger
Tim wrote: > > > Are you leaving ANY ram for zfs to do it's thing? If you're consuming > ALL system memory for just this file/application, I would expect the > system to fall over and die. > Hmm. I believe that the kernel should manage that relationship for me. If the system cannot manage swa

Re: [zfs-discuss] help diagnosing system hang

2008-12-04 Thread Tim
On Thu, Dec 4, 2008 at 11:55 AM, Ethan Erchinger <[EMAIL PROTECTED]> wrote: > > > Ross wrote: > > I'm no expert, but the first thing I'd ask is whether you could repeat > that test without using compression? I'd be quite worried about how a > system is going to perform when it's basically running

Re: [zfs-discuss] help diagnosing system hang

2008-12-04 Thread Ethan Erchinger
Ross wrote: > I'm no expert, but the first thing I'd ask is whether you could repeat that > test without using compression? I'd be quite worried about how a system is > going to perform when it's basically running off a 50GB compressed file. > > Yes this does occur with compression off, but

Re: [zfs-discuss] help diagnosing system hang

2008-12-04 Thread Ethan Erchinger
Richard Elling wrote: > > I've seen these symptoms when a large number of errors were reported > in a short period of time and memory was low. What does "fmdump -eV" > show? > fmdump -eV shows lots of messages like this, and yea, I believe that to be sd16 which is the SSD: Dec 03 2008 08:31:11

Re: [zfs-discuss] help diagnosing system hang

2008-12-04 Thread Richard Elling
Ethan Erchinger wrote: > Hi all, > > First, I'll say my intent is not to spam a bunch of lists, but after > posting to opensolaris-discuss I had someone communicate with me offline > that these lists would possibly be a better place to start. So here we > are. For those on all three lists, sorr

Re: [zfs-discuss] help diagnosing system hang

2008-12-04 Thread Ross
I'm no expert, but the first thing I'd ask is whether you could repeat that test without using compression? I'd be quite worried about how a system is going to perform when it's basically running off a 50GB compressed file. There seem to be a lot of variables here, on quite a few new systems, a

[zfs-discuss] help diagnosing system hang

2008-12-03 Thread Ethan Erchinger
Hi all, First, I'll say my intent is not to spam a bunch of lists, but after posting to opensolaris-discuss I had someone communicate with me offline that these lists would possibly be a better place to start. So here we are. For those on all three lists, sorry for the repetition. Second, thi