Re: [zfs-discuss] Troubleshooting help on ZFS

2011-01-23 Thread Steve Kellam
Thanks for everyone who replied to my call for help.  I really appreciate it.  
I turned the server off for a few hours and rebooted again.  This time many of 
the problems seemed to resolve.  It has been running smoothly now for a day.

I took the advice of one post and updated to OpenIndiana build 148 as that 
seems the most promising upgrade path at this point and it has a more recent 
version of ZFS which hopefully will be more stable.

THanks again.
-- 
This message posted from opensolaris.org
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Troubleshooting help on ZFS

2011-01-20 Thread Michael Schuster
On Thu, Jan 20, 2011 at 01:47, Steve Kellam
opensolaris-sjksn...@sneakemail.com wrote:
 I have a home media server set up using OpenSolaris.   All my experience with 
 OpenSolaris has been through setting up and maintaining this server so it is 
 rather limited.   I have run in to some problems recently and I am not sure 
 how the best way to troubleshoot this.  I was hoping to get some feedback on 
 possible fixes for this.

 I am running SunOS 5.11 snv_134.  It is running on a tower with 6 HDD 
 configured in as raidz2 array.  Motherboard: ECS 945GCD-M(1.0) Intel Atom 330 
 Intel 945GC Micro ATX Motherboard/CPU Combo.  Memory: 4GB.

 I set this up about a year ago and have had very few problems.  I was 
 streaming a movie off the server a few days ago and it all of a sudden lost 
 connectivity with the server.  When I checked the server, there was no output 
 on the display from the server but the power supply seemed to be running and 
 the fans were going.
 The next day it started working again and I was able to log in.  The SMB and 
 NFS file server was connecting without problems.

 Now I am able to connect remotely via SSH.  I am able to bring up a zpool 
 status screen that shows no problems.  It reports no known data errors.  I am 
 able to go to the top level data directories but when I cd into the 
 sub-directories the SSH connection freezes.

 I have tried to do a ZFS scrub on the pool and it only gets to 0.02% and 
 never gets beyond that but does not report any errors.  Now, also, I am 
 unable to stop the scrub.  I use the zpool scrub -s command but this freezes 
 the SSH connection.
 When I reboot, it is still trying to scrub but not making progress.

 I have the system set up to a battery back up with surge protection and I'm 
 not aware of any spikes in electricity recently.  I have not made any 
 modifications to the system.  All the drives have been run through SpinRite 
 less than a couple months ago without any data errors.

 I can't figure out how this happened all of the sudden and how best to 
 troubleshoot it.

 If you have any help or technical wisdom to offer, I'd appreciate it as this 
 has been frustrating.

look in /var/adm/messages (.*) to see whether there's anything
interesting around the time you saw the loss of connectivity, and also
since, then take it from there.

HTH
Michael
-- 
regards/mit freundlichen Grüssen
Michael Schuster
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Troubleshooting help on ZFS

2011-01-20 Thread Remco Lengers

Hi Steve,

Anything in:

cat /var/adm/messages
fmdump -ev

?

..Remco

On 1/20/11 1:47 AM, Steve Kellam wrote:

I have a home media server set up using OpenSolaris.   All my experience with 
OpenSolaris has been through setting up and maintaining this server so it is 
rather limited.   I have run in to some problems recently and I am not sure how 
the best way to troubleshoot this.  I was hoping to get some feedback on 
possible fixes for this.

I am running SunOS 5.11 snv_134.  It is running on a tower with 6 HDD 
configured in as raidz2 array.  Motherboard: ECS 945GCD-M(1.0) Intel Atom 330 
Intel 945GC Micro ATX Motherboard/CPU Combo.  Memory: 4GB.

I set this up about a year ago and have had very few problems.  I was streaming 
a movie off the server a few days ago and it all of a sudden lost connectivity 
with the server.  When I checked the server, there was no output on the display 
from the server but the power supply seemed to be running and the fans were 
going.
The next day it started working again and I was able to log in.  The SMB and 
NFS file server was connecting without problems.

Now I am able to connect remotely via SSH.  I am able to bring up a zpool 
status screen that shows no problems.  It reports no known data errors.  I am 
able to go to the top level data directories but when I cd into the 
sub-directories the SSH connection freezes.

I have tried to do a ZFS scrub on the pool and it only gets to 0.02% and never 
gets beyond that but does not report any errors.  Now, also, I am unable to 
stop the scrub.  I use the zpool scrub -s command but this freezes the SSH 
connection.
When I reboot, it is still trying to scrub but not making progress.

I have the system set up to a battery back up with surge protection and I'm not 
aware of any spikes in electricity recently.  I have not made any modifications 
to the system.  All the drives have been run through SpinRite less than a 
couple months ago without any data errors.

I can't figure out how this happened all of the sudden and how best to 
troubleshoot it.

If you have any help or technical wisdom to offer, I'd appreciate it as this 
has been frustrating.

Thanks!

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Troubleshooting help on ZFS

2011-01-20 Thread Brandon High
On Wed, Jan 19, 2011 at 4:47 PM, Steve Kellam
opensolaris-sjksn...@sneakemail.com wrote:
 I set this up about a year ago and have had very few problems.  I was 
 streaming a movie off the server a few days ago and it all of a sudden lost 
 connectivity with the server.  When I checked the server, there was no output 
 on the display from the server but the power supply seemed to be running and 
 the fans were going.
 The next day it started working again and I was able to log in.  The SMB and 
 NFS file server was connecting without problems.

What NIC are you using? This sounds exactly like the problem that I
had with the Realtek controller on a D945GCLF2.

Look into using the gani drivers instead of the shipped realtek drivers.

-B

-- 
Brandon High : bh...@freaks.com
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss