from:"thomas"

On Thu, Aug 19, 2010 at 4:33 PM, Mike Kirk mike.k...@halcyoninc.com wrote:

 Hi all,

 Halcyon recently started to add ZFS pool stats to our Solaris Agent, and
 because many people were interested in the previous OpenSolaris beta* we've
 rolled it into our OpenSolaris build as well.

 I've already heard some great feedback about supporting ZIL and ARC stats,
 which we're hoping to add soon. If you'd like to see what we have now, and
 maybe try it on your OpenSolaris system, please see the download/screenshot
 page here:

 http://forums.halcyoninc.com/showthread.php?p=1018

 I know this isn't the best time to be posting about legacy OpenSolaris:
 we're keeping our eyes on Solaris 11 Express / Illumos and aim to support
 the more advanced features of Solaris 11 the day it's pushed out the door.

 Thanks for your time!

 Regards,

 Mike dot Kirk at HalcyonInc dot com


I just tried this, and i'm getting an error on install.  I've also posted in
your forums but i thought perhaps someone else on list might know the
solutions.

anyways, I'm runniong Opensolaris b134, this is the error i receive

Seeding the new agent ...

ERROR:  Failed to run command /opt/Neuron/bin/na usm-seed -s xxx
agent. STDOUT/STDERR: /opt/Neuron/bin/na[1009]: eval: line 1: 6470: Memory
fault(coredump)

Moving log file /tmp/HALNeuronSolaris-install_20100820-29.log to
/var/opt/Neuron/install/HALNeuronSolaris-install_20100820-29.log ...




any help would be greatly appreciated, i really love the screenshots for
this software.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] make df have accurate out upon zfs?

df serves a purpose though.

There are other commands which output that information..

On Thu, Aug 19, 2010 at 3:01 PM, Fred Liu fred_...@issi.com wrote:

 Not sure if there was similar threads in this list before.
 Three scenarios:
 1): df cannot count snapshot space in a file system with quota set.
 2): df cannot count sub-filesystem space in a file system with quota set.
 3): df cannot count space saved by de-dup in a file system with quota set.

 Are they possible?

 Btw, what is the difference between  /usr/gnu/bin/df and /bin/df?

 Thanks.

 Fred
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] make df have accurate out upon zfs?

can't the zfs command provide that information?


2010/8/20 Fred Liu fred_...@issi.com

  Can you shed more lights on **other commands** which output that
 information?

 Appreciations.



 Fred



 *From:* Thomas Burgess [mailto:wonsl...@gmail.com]
 *Sent:* 星期五, 八月 20, 2010 17:34
 *To:* Fred Liu
 *Cc:* ZFS Discuss
 *Subject:* Re: [zfs-discuss] make df have accurate out upon zfs?



 df serves a purpose though.



 There are other commands which output that information..

 On Thu, Aug 19, 2010 at 3:01 PM, Fred Liu fred_...@issi.com wrote:

 Not sure if there was similar threads in this list before.
 Three scenarios:
 1): df cannot count snapshot space in a file system with quota set.
 2): df cannot count sub-filesystem space in a file system with quota set.
 3): df cannot count space saved by de-dup in a file system with quota set.

 Are they possible?

 Btw, what is the difference between  /usr/gnu/bin/df and /bin/df?

 Thanks.

 Fred
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss



___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] make df have accurate out upon zfs?

as for the difference between the two df's, one is the gnu df (liek you'd
have on linux) and the other is the solaris df.



2010/8/20 Thomas Burgess wonsl...@gmail.com

 can't the zfs command provide that information?


 2010/8/20 Fred Liu fred_...@issi.com

  Can you shed more lights on **other commands** which output that
 information?

 Appreciations.



 Fred



 *From:* Thomas Burgess [mailto:wonsl...@gmail.com]
 *Sent:* 星期五, 八月 20, 2010 17:34
 *To:* Fred Liu
 *Cc:* ZFS Discuss
 *Subject:* Re: [zfs-discuss] make df have accurate out upon zfs?



 df serves a purpose though.



 There are other commands which output that information..

 On Thu, Aug 19, 2010 at 3:01 PM, Fred Liu fred_...@issi.com wrote:

 Not sure if there was similar threads in this list before.
 Three scenarios:
 1): df cannot count snapshot space in a file system with quota set.
 2): df cannot count sub-filesystem space in a file system with quota set.
 3): df cannot count space saved by de-dup in a file system with quota set.

 Are they possible?

 Btw, what is the difference between  /usr/gnu/bin/df and /bin/df?

 Thanks.

 Fred
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss





___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] make df have accurate out upon zfs?

try something like

zfs list -o space

zfs -t snapshot

stuff like that

2010/8/20 Fred Liu fred_...@issi.com

  Sure, I know this.

 What I want to say is following:

 r...@cn03:~# /usr/gnu/bin/df -h /cn03/3

 FilesystemSize  Used Avail Use% Mounted on

 cn03/3298G  154K  298G   1% /cn03/3

 r...@cn03:~# /bin/df -h /cn03/3

 Filesystem size   used  avail capacity  Mounted on

 cn03/3 800G   154K   297G 1%/cn03/3



 r...@cn03:~# zfs get all cn03/3

 NAMEPROPERTY   VALUE  SOURCE

 cn03/3  type   filesystem -

 cn03/3  creation   Sat Jul 10  9:35 2010  -

 cn03/3  used   503G   -

 cn03/3  available  297G   -

 cn03/3  referenced 154K   -

 cn03/3  compressratio  1.00x  -

 cn03/3  mountedyes-

 cn03/3  quota  800G   local

 cn03/3  reservationnone   default

 cn03/3  recordsize 128K   default

 cn03/3  mountpoint /cn03/3default

 cn03/3  sharenfs   rw,root=nfsrootlocal

 cn03/3  checksum   on default

 cn03/3  compressionoffdefault

 cn03/3  atime  on default

 cn03/3  deviceson default

 cn03/3  exec   on default

 cn03/3  setuid on default

 cn03/3  readonly   offdefault

 cn03/3  zoned  offdefault

 cn03/3  snapdirhidden default

 cn03/3  aclmodegroupmask  default

 cn03/3  aclinherit restricted default

 cn03/3  canmount   on default

 cn03/3  shareiscsi offdefault

 cn03/3  xattr  on default

 cn03/3  copies 1  default

 cn03/3  version4  -

 cn03/3  utf8only   off-

 cn03/3  normalization  none   -

 cn03/3  casesensitivitysensitive  -

 cn03/3  vscan  offdefault

 cn03/3  nbmand offdefault

 cn03/3  sharesmb   offdefault

 cn03/3  refquota   none   default

 cn03/3  refreservation none   default

 cn03/3  primarycache   alldefault

 cn03/3  secondarycache alldefault

 cn03/3  usedbysnapshots46.8G  -

 cn03/3  usedbydataset  154K   -

 cn03/3  usedbychildren 456G   -

 cn03/3  usedbyrefreservation   0  -

 cn03/3  logbiaslatencydefault

 cn03/3  dedup  offdefault

 cn03/3  mlslabel   none   default

 cn03/3  com.sun:auto-snapshot  true   inherited from cn03



 Thanks.



 Fred



 *From:* Thomas Burgess [mailto:wonsl...@gmail.com]
 *Sent:* 星期五, 八月 20, 2010 18:44

 *To:* Fred Liu
 *Cc:* ZFS Discuss
 *Subject:* Re: [zfs-discuss] make df have accurate out upon zfs?



 as for the difference between the two df's, one is the gnu df (liek you'd
 have on linux) and the other is the solaris df.





 2010/8/20 Thomas Burgess wonsl...@gmail.com

 can't the zfs command provide that information?



 2010/8/20 Fred Liu fred_...@issi.com



 Can you shed more lights on **other commands** which output that
 information?

 Appreciations.



 Fred



 *From:* Thomas Burgess [mailto:wonsl...@gmail.com]
 *Sent:* 星期五, 八月 20, 2010 17:34
 *To:* Fred Liu
 *Cc:* ZFS Discuss
 *Subject:* Re: [zfs-discuss] make df have accurate out upon zfs?



 df serves a purpose though.



 There are other commands which output that information..

 On Thu, Aug 19, 2010 at 3:01 PM, Fred Liu fred_...@issi.com wrote:

 Not sure if there was similar threads in this list before.
 Three scenarios:
 1): df cannot count snapshot space in a file system with quota set.
 2): df cannot count sub-filesystem space in a file system with quota set.
 3): df cannot count space saved by de-dup in a file system with quota set.

 Are they possible?

 Btw, what is the difference between  /usr/gnu/bin/df and /bin/df?

 Thanks.

 Fred
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

[zfs-discuss] lots of errors in logs?

I've been running opensolaris for months, and today while poking around, i
noticed a ton of errors in my logs...I'm wondering what they mean and if
it's anything to worry about


I've found a few things on google but not a whole lotanyways, heres a
pastie of the log

http://pastie.org/1104916


any help would be greatly appreciated
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Opensolaris is apparently dead

2010-08-16 Thread Thomas Burgess

On Mon, Aug 16, 2010 at 11:17 PM, Frank Cusack
frank+lists/z...@linetwo.netwrote:

 On 8/16/10 9:57 AM -0400 Ross Walker wrote:

 No, the only real issue is the license and I highly doubt Oracle will
 re-release ZFS under GPL to dilute it's competitive advantage.


 You're saying Oracle wants to keep zfs out of Linux?

 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss



why would Oracle want ZFS in linux when it makes the value of Solaris
greater?
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Raidz - what is stored in parity?

2010-08-11 Thread Thomas Burgess

On Wed, Aug 11, 2010 at 12:57 AM, Peter Taps ptr...@yahoo.com wrote:

 Hi Eric,

 Thank you for your help. At least one part is clear now.

 I still am confused about how the system is still functional after one disk
 fails.

 Consider my earlier example of 3 disks zpool configured for raidz-1. To
 keep it simple let's not consider block sizes.

 Let's say I send a write value abcdef to the zpool.

 As the data gets striped, we will have 2 characters per disk.

 disk1 = ab + some parity info
 disk2 = cd + some parity info
 disk3 = ef + some parity info

 Now, if disk2 fails, I lost cd. How will I ever recover this? The parity
 info may tell me that something is bad but I don't see how my data will get
 recovered.

 The only good thing is that any newer data will now be striped over two
 disks.

 Perhaps I am missing some fundamental concept about raidz.

 Regards,
 Peter






I find the best way to understand how parity works is to think back to your
algebra class when you'd have something like

1x +2 = 3

and you could solve for xit's not EXACTLY like that but solving the
parity stuff is similar to solving for x







 --
 This message posted from opensolaris.org
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Best usage of SSD-disk in ZFS system

2010-08-06 Thread Thomas Burgess

On Fri, Aug 6, 2010 at 6:44 AM, P-O Yliniemi p...@bsd-guide.net wrote:

  Hello!

 I have built a OpenSolaris / ZFS based storage system for one of our
 customers. The configuration is about this:

 Motherboard/CPU: SuperMicro X7SBE / Xeon (something, sorry - can't remember
 and do not have my specification nearby)
 RAM: 8GB ECC (X7SBE won't take more)
 Drives for storage: 16*1.5TB Seagate ST31500341AS, connected to two
 AOC-SAT2-MV8 controllers
 Drives for operating system: 2*80GB Intel X25-M (mirror)

 ZFS configuration: Two vdevs, raid-z of 7+1 disks per set, striped together
 (gives a zpool with about 21TB storage space)

 Disk performance: around 700-800MB/s, tested and timed with 'mkfile' and
 'time' (a 40GB file is created in just about a minute)
 I have a spare X25-M drive of 40GB to use for cache or log (or both), but
 since the disk array is a lot faster than the SSD-disk, I can not see the
 advantage in using it as a cache device.

 Is there any advantages for using a separate log or cache device in this
 case ?

 Regards,
  PeO

 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


I can tell you for sure that there can be a really nice advantage for
sequential writes.

to see this yourself, do the following:

create a filesystem, share it out NFS
create a really big tar.gz file and put it in the filessytem

log in from a network client via nfs and extract the tar.ball using
something like:


time tar xzfv some.tar.gz


do this a few times to get an average, then add the SSD as a log device.

I have the exact same motherboard with a very similar setup, and i noticed a
400% nfs performance boost by doing this.


try it yourself =)
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Confused about consumer drives and zfs can someone help?

I've found the Seagate 7200.12 1tb drives and Hitachi 7k2000 2TB drives to
be by far the best.

I've read lots of horror stories about any WD drive with 4k
sectorsit'sbest to stay away from them.

I've also read plenty of people say that the green drives are terrible.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] L2ARC and ZIL on same SSD?

On Wed, Jul 21, 2010 at 12:42 PM, Orvar Korvar 
knatte_fnatte_tja...@yahoo.com wrote:

 Are there any drawbacks to partition a SSD in two parts and use L2ARC on
 one partition, and ZIL on the other? Any thoughts?
 --
 This message posted from opensolaris.org
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss



It's not going to be as good as having separate but i can tell you that i
did this on my home system and it was WELL worth it.

I used one of the sandforce 1500 based SSD's 50 gb

i used 9 gb for ZIL, and the rest for L2ARC.   adding the zil gave me about
400-500% nfs write performance.   Seeing as you can't ever use more than
half your ram for ZIL anyways, the only real downside to doing this is that
i/o becomes split between zil and L2arc but realistically it depends on your
workloadfor mine, i noticed a HUGE benefit from doing this.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] NFS performance?

On Fri, Jul 23, 2010 at 3:11 AM, Sigbjorn Lie sigbj...@nixtra.com wrote:

 Hi,

 I've been searching around on the Internet to fine some help with this, but
 have been
 unsuccessfull so far.

 I have some performance issues with my file server. I have an OpenSolaris
 server with a Pentium D
 3GHz CPU, 4GB of memory, and a RAIDZ1 over 4 x Seagate (ST31500341AS) 1,5TB
 SATA drives.

 If I compile or even just unpack a tar.gz archive with source code (or any
 archive with lots of
 small files), on my Linux client onto a NFS mounted disk to the OpenSolaris
 server, it's extremely
 slow compared to unpacking this archive on the locally on the server. A
 22MB .tar.gz file
 containng 7360 files takes 9 minutes and 12seconds to unpack over NFS.

 Unpacking the same file locally on the server is just under 2 seconds.
 Between the server and
 client I have a gigabit network, which at the time of testing had no other
 significant load. My
 NFS mount options are: rw,hard,intr,nfsvers=3,tcp,sec=sys.

 Any suggestions to why this is?


 Regards,
 Sigbjorn


 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss




as someone else said, adding an ssd log device can help hugely.  I saw about
a 500% nfs write increase by doing this.
I've heard of people getting even more.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] NFS performance?

On Fri, Jul 23, 2010 at 5:00 AM, Sigbjorn Lie sigbj...@nixtra.com wrote:

 I see I have already received several replies, thanks to all!

 I would not like to risk losing any data, so I believe a ZIL device would
 be the way for me. I see
 these exists in different prices. Any reason why I would not buy a cheap
 one? Like the Intel X25-V
 SSD 40GB 2,5?

 What size of ZIL device would be recommened for my pool consisting for 4 x
 1,5TB drives? Any
 brands I should stay away from?



 Regards,
 Sigbjorn

 Like i said, i bought a 50 gb OCZ Vertex Limited Edition...it's like 200
dollars, up to 15,000 random iops (iops is what you want for fast zil)


I've gotten excelent performance out of it.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Maximum zfs send/receive throughput

2010-06-25 Thread Thomas Maier-Komor

On 25.06.2010 14:32, Mika Borner wrote:
 
 It seems we are hitting a boundary with zfs send/receive over a network
 link (10Gb/s). We can see peak values of up to 150MB/s, but on average
 about 40-50MB/s are replicated. This is far away from the bandwidth that
 a 10Gb link can offer.
 
 Is it possible, that ZFS is giving replication a too low
 priority/throttling it too much?
 
 
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

you can probably improve overall performance by using mbuffer [1] to
stream the data over the network. At least some people have reported
increased performance. mbuffer will buffer the datastream and disconnect
zfs send operations from network latencies.

Get it there:
original source: http://www.maier-komor.de/mbuffer.html
binary package:  http://www.opencsw.org/packages/CSWmbuffer/

- Thomas
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] OCZ Vertex 2 Pro performance numbers

2010-06-25 Thread Thomas Burgess



 Conclusion: This device will make an excellent slog device. I'll order
 them today ;)


I have one and i love it...I sliced it though, used 9 gb for ZIL and the
rest for L2ARC (my server is on a smallish network with about 10 clients)

It made a huge difference in NFS performance and other stuff as well (for
instance, doing something like du will run a TON faster than before)

For the money, it's a GREAT deal.  I am very impressed



 --Arne
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Erratic behavior on 24T zpool

2010-06-18 Thread Thomas Burgess

On Fri, Jun 18, 2010 at 4:42 AM, Pasi Kärkkäinen pa...@iki.fi wrote:

 On Fri, Jun 18, 2010 at 01:26:11AM -0700, artiepen wrote:
  Well, I've searched my brains out and I can't seem to find a reason for
 this.
 
  I'm getting bad to medium performance with my new test storage device.
 I've got 24 1.5T disks with 2 SSDs configured as a zil log device. I'm using
 the Areca raid controller, the driver being arcmsr. Quad core AMD with 16
 gig of RAM OpenSolaris upgraded to snv_134.
 
  The zpool has 2 11-disk raidz2's and I'm getting anywhere between 1MB/sec
 to 40MB/sec with zpool iostat. On average, though it's more like 5MB/sec if
 I watch while I'm actively doing some r/w. I know that I should be getting
 better performance.
 

 How are you measuring the performance?
 Do you understand raidz2 with that big amount of disks in it will give you
 really poor random write performance?

 -- Pasi


i have a media server with 2 raidz2 vdevs 10 drives wide myself without a
ZIL (but with a 64 gb l2arc)

I can write to it about 400 MB/s over the network, and scrubs show 600 MB/s
but it really depends on the type of i/o you haverandom i/o across 2
vdevs will be REALLY slow (as slow as the slowest 2 drives in your pool
basically)

40 MB/s might be right if it's randomthough i'd still expect to see
more.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Erratic behavior on 24T zpool

2010-06-18 Thread Thomas Burgess

On Fri, Jun 18, 2010 at 6:34 AM, Curtis E. Combs Jr. ceco...@uga.eduwrote:

 Oh! Yes. dedup. not compression, but dedup, yes.





dedup may be your problem...it requires some heavy ram and/or decent L2ARC
from what i've been reading.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Pool is wrong size in b134

2010-06-17 Thread Thomas Burgess




 Also, the disks were replaced one at a time last year from 73GB to 300GB to
 increase the size of the pool.  Any idea why the pool is showing up as the
 wrong size in b134 and have anything else to try?  I don't want to upgrade
 the pool version yet and then not be able to revert back...

 thanks,
 Ben

 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss



sometimes when you upgrade a pool by replacing drives with bigger ones, you
have to export the pool, then import it.

Or at least that's what i've always done
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] size of slog device

2010-06-14 Thread Thomas Burgess

On Mon, Jun 14, 2010 at 4:41 AM, Arne Jansen sensi...@gmx.net wrote:

 Hi,

 I known it's been discussed here more than once, and I read the
 Evil tuning guide, but I didn't find a definitive statement:

 There is absolutely no sense in having slog devices larger than
 then main memory, because it will never be used, right?
 ZFS will rather flush the txg to disk than reading back from
 zil?
 So there is a guideline to have enough slog to hold about 10
 seconds of zil, but the absolute maximum value is the size of
 main memory. Is this correct?




I thought it was half the size of memory.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

[zfs-discuss] panic after zfs mount

2010-06-13 Thread Thomas Nau

Dear all

We ran into a nasty problem the other day. One of our mirrored zpool
hosts several ZFS filesystems. After a reboot (all FS mounted at that
time an in use) the machine paniced (console output further down). After
detaching one of the mirrors the pool fortunately imported automatically
in a faulted state without mounting the filesystems. Offling the
unplugged device and clearing the fault allowed us to disable
auto-mounting the filesystems. Going through them one by one all but one
mounted OK. The one again triggered a panic. We left mounting on that
one disabled for now to be back in production after pulling data from
the backup tapes. Scrubbing didn't show any error so any idea what's
behind the problem? Any chance to fix the FS?

Thomas


---

panic[cpu3]/thread=ff0503498400: BAD TRAP: type=e (#pf Page fault)
rp=ff001e937320 addr=20 occurred in module zfs due to a NULL
pointer dereference

zfs: #pf Page fault
Bad kernel fault at addr=0x20
pid=27708, pc=0xf806b348, sp=0xff001e937418, eflags=0x10287
cr0: 8005003bpg,wp,ne,et,ts,mp,pe cr4: 6f8xmme,fxsr,pge,mce,pae,pse,de
cr2: 20cr3: 4194a7000cr8: c

rdi: ff0503aaf9f0 rsi:0 rdx:0
rcx: 155cda0b  r8: eaa325f0  r9: ff001e937480
rax:  7ff rbx:0 rbp: ff001e937460
r10:  7ff r11:0 r12: ff0503aaf9f0
r13: ff0503aaf9f0 r14: ff001e9375d0 r15: ff001e937610
fsb:0 gsb: ff04e7e5c040  ds:   4b
 es:   4b  fs:0  gs:  1c3
trp:e err:0 rip: f806b348
 cs:   30 rfl:10287 rsp: ff001e937418
 ss:   38

ff001e937200 unix:die+dd ()
ff001e937310 unix:trap+177e ()
ff001e937320 unix:cmntrap+e6 ()
ff001e937460 zfs:zap_leaf_lookup_closest+40 ()
ff001e9374f0 zfs:fzap_cursor_retrieve+c9 ()
ff001e9375b0 zfs:zap_cursor_retrieve+19a ()
ff001e937780 zfs:zfs_purgedir+4c ()
ff001e9377d0 zfs:zfs_rmnode+52 ()
ff001e937810 zfs:zfs_zinactive+b5 ()
ff001e937860 zfs:zfs_inactive+ee ()
ff001e9378b0 genunix:fop_inactive+af ()
ff001e9378d0 genunix:vn_rele+5f ()
ff001e937ac0 zfs:zfs_unlinked_drain+af ()
ff001e937af0 zfs:zfsvfs_setup+fb ()
ff001e937b50 zfs:zfs_domount+16a ()
ff001e937c70 zfs:zfs_mount+1e4 ()
ff001e937ca0 genunix:fsop_mount+21 ()
ff001e937e00 genunix:domount+ae3 ()
ff001e937e80 genunix:mount+121 ()
ff001e937ec0 genunix:syscall_ap+8c ()
ff001e937f10 unix:brand_sys_sysenter+1eb ()


-
GPG fingerprint: B1 EE D2 39 2C 82 26 DA  A5 4D E0 50 35 75 9E ED
___
cifs-discuss mailing list
cifs-disc...@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/cifs-discuss


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] panic after zfs mount

2010-06-13 Thread Thomas Nau

Thanks for the link Arne.


On 06/13/2010 03:57 PM, Arne Jansen wrote:
 Thomas Nau wrote:
 Dear all

 We ran into a nasty problem the other day. One of our mirrored zpool
 hosts several ZFS filesystems. After a reboot (all FS mounted at that
 time an in use) the machine paniced (console output further down). After
 detaching one of the mirrors the pool fortunately imported automatically
 in a faulted state without mounting the filesystems. Offling the
 unplugged device and clearing the fault allowed us to disable
 auto-mounting the filesystems. Going through them one by one all but one
 mounted OK. The one again triggered a panic. We left mounting on that
 one disabled for now to be back in production after pulling data from
 the backup tapes. Scrubbing didn't show any error so any idea what's
 behind the problem? Any chance to fix the FS?
 
 We had the same problem. Victor pointed my to
 
 http://bugs.opensolaris.org/bugdatabase/view_bug.do?bug_id=6742788
 
 with a workaround to mount the filesystem read-only to save the data.
 I still hope to figure out the chain of events that causes this. Did you
 use any extended attributes on this filesystem?
 
 -- 
 Arne


To my knowledge we haven't used any extended attributes but I'll double
check after mounting the filesystem read-only. As it's one that's
exported using Samba it might be indeed the case. For sure a lot of
ACLs are used

Thomas

-
GPG fingerprint: B1 EE D2 39 2C 82 26 DA  A5 4D E0 50 35 75 9E ED
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] panic after zfs mount

2010-06-13 Thread Thomas Nau

Arne,

On 06/13/2010 03:57 PM, Arne Jansen wrote:
Thomas Nau wrote:
Dear all

We ran into a nasty problem the other day. One of our mirrored zpool
hosts several ZFS filesystems. After a reboot (all FS mounted at that
time an in use) the machine paniced (console output further down). After
detaching one of the mirrors the pool fortunately imported automatically
in a faulted state without mounting the filesystems. Offling the
unplugged device and clearing the fault allowed us to disable
auto-mounting the filesystems. Going through them one by one all but one
mounted OK. The one again triggered a panic. We left mounting on that
one disabled for now to be back in production after pulling data from
the backup tapes. Scrubbing didn't show any error so any idea what's
behind the problem? Any chance to fix the FS?

We had the same problem. Victor pointed my to

http://bugs.opensolaris.org/bugdatabase/view_bug.do?bug_id=6742788

with a workaround to mount the filesystem read-only to save the data.
I still hope to figure out the chain of events that causes this. Did you
use any extended attributes on this filesystem?

--
Arne

Mounting the FS read-only worked, thanks again. I checked the attributes
and the set for all files is:

{archive,nohidden,noreadonly,nosystem,noappendonly,nonodump,noimmutable,av_modified,noav_quarantined,nonounlink}

so just the default ones

Thomas

-
GPG fingerprint: B1 EE D2 39 2C 82 26 DA A5 4D E0 50 35 75 9E ED
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Reconfiguring a RAID-Z dataset

2010-06-12 Thread Thomas Burgess



  Yeah, this is what I was thinking too...

 Is there anyway to retain snapshot data this way? I've read about the ZFS
 replay/mirror features, but my impression was that this was more so for a
 development mirror for testing rather than a reliable backup? This is the
 only way I know of that one could do something like this. Is there some
 other way to create a solid clone, particularly with a machine that won't
 have the same drive configuration?




I recently used zfs send/recv to copy a bunch of datasets from a raidz2 box
to a box made on mirrors.  It works fine.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Reconfiguring a RAID-Z dataset

2010-06-12 Thread Thomas Burgess

On Sun, Jun 13, 2010 at 12:18 AM, Joe Auty j...@netmusician.org wrote:

  Thomas Burgess wrote:


   Yeah, this is what I was thinking too...

 Is there anyway to retain snapshot data this way? I've read about the ZFS
 replay/mirror features, but my impression was that this was more so for a
 development mirror for testing rather than a reliable backup? This is the
 only way I know of that one could do something like this. Is there some
 other way to create a solid clone, particularly with a machine that won't
 have the same drive configuration?




  I recently used zfs send/recv to copy a bunch of datasets from a raidz2
 box to a box made on mirrors.  It works fine.


  ZFS send/recv looks very cool and very convenient. I wonder what it was
 that I read that suggested not relying on it for backups? Maybe this was
 alluding to the notion that like relying on RAID for a backup, if there is
 corruption your mirror (i.e. machine you are using with zfs recv) will be
 corrupted too?

 At any rate, thanks for answering this question! At some point if I go this
 route I'll test send and recv functionality to give all of this a dry run.






well, it's not considered to be an enterprise ready backup solution  I
think this is due to the fact that you can't recover a single file from a
zfs send stream but despite this limitation it's still VERY handy.

Another reason, from what i understand by reading this list, is that the
zfs send streams aren't resilient.  If you do not pipe it directly into a
zfs receive, it might get corrupted and be worthless(basically don't
save the output of zfs send and expect to receive it later)

again, this is not relevant if you are doing a zfs send into a zfs receive
at the other end

I think the 2 reasons i just gave are the reasons people have warned against
it...but still, it's damn amazing.





 --
 Joe Auty, NetMusician
 NetMusician helps musicians, bands and artists create beautiful,
 professional, custom designed, career-essential websites that are easy to
 maintain and to integrate with popular social networks.
 www.netmusician.org
 j...@netmusician.org


nmtwitter.png___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] zfs corruptions in pool

2010-06-06 Thread Thomas Maier-Komor

On 06.06.2010 08:06, devsk wrote:
 I had an unclean shutdown because of a hang and suddenly my pool is degraded 
 (I realized something is wrong when python dumped core a couple of times).
 
 This is before I ran scrub:
 
   pool: mypool
  state: DEGRADED
 status: One or more devices has experienced an error resulting in data
 corruption.  Applications may be affected.
 action: Restore the file in question if possible.  Otherwise restore the
 entire pool from backup.
see: http://www.sun.com/msg/ZFS-8000-8A
  scan: scrub repaired 0 in 0h7m with 0 errors on Mon May 31 09:00:27 2010
 config:
 
 NAMESTATE READ WRITE CKSUM
 mypool  DEGRADED 0 0 0
   c6t0d0s0  DEGRADED 0 0 0  too many errors
 
 errors: Permanent errors have been detected in the following files:
 
 mypool/ROOT/May25-2010-Image-Update:0x3041e
 mypool/ROOT/May25-2010-Image-Update:0x31524
 mypool/ROOT/May25-2010-Image-Update:0x26d24
 mypool/ROOT/May25-2010-Image-Update:0x37234
 //var/pkg/download/d6/d6be0ef348e3c81f18eca38085721f6d6503af7a
 mypool/ROOT/May25-2010-Image-Update:0x25db3
 //var/pkg/download/cb/cbb0ff02bcdc6649da3763900363de7cff78ec72
 mypool/ROOT/May25-2010-Image-Update:0x26cf6
 
 
 I ran scrub and this is what it has to say afterwards.
 
   pool: mypool
  state: DEGRADED
 status: One or more devices has experienced an unrecoverable error.  An
 attempt was made to correct the error.  Applications are unaffected.
 action: Determine if the device needs to be replaced, and clear the errors
 using 'zpool clear' or replace the device with 'zpool replace'.
see: http://www.sun.com/msg/ZFS-8000-9P
  scan: scrub repaired 0 in 0h11m with 0 errors on Sat Jun  5 22:43:54 2010
 config:
 
 NAMESTATE READ WRITE CKSUM
 mypool  DEGRADED 0 0 0
   c6t0d0s0  DEGRADED 0 0 0  too many errors
 
 errors: No known data errors
 
 Few of questions:
 
 1. Have the errors really gone away? Can I just clear and be content that 
 errors are really gone?
 
 2. Why did the errors occur anyway if ZFS guarantees on-disk consistency? I 
 wasn't writing anything. Those files were definitely not being touched when 
 the hang and unclean shutdown happened.
 
 I mean I don't mind if I create or modify a file and it doesn't land on disk 
 because on unclean shutdown happened but a bunch of unrelated files getting 
 corrupted, is sort of painful to digest.
 
 3. The action says Determine if the device needs to be replaced. How the 
 heck do I do that?


Is it possible that this system runs on a virtual box? At least I've
seen such a thing happen on a Virtual Box but never on a real machine.

The reason why the error have gone away might be that meta data has
three copies IIRC. So if your disk only had corruptions in the meta data
area these errors can be repaired by scrubbing the pool.

The smartmontools might help you figuring out if the disk is broken. But
if you only had an unexpected shutdown and now everything is clean after
a scrub, I wouldn't expect the disk to be broken. You can get the
smartmontools from opencsw.org.

If your system is really running on a Virtual Box I'd recommend that you
turn of disk write caching of Virtual Box. Search the OpenSolaris forum
of Virtual Box. There is an article somewhere how to do this. IIRC the
subject is somethink like 'zfs pool curruption'. But it is also
somewhere in the docs.

HTH,
Thomas
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Snapshots, txgs and performance

2010-06-06 Thread thomas

Very interesting. This could be useful for a number of us. Would you be willing 
to share your work?
-- 
This message posted from opensolaris.org
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Ideal SATA/SAS Controllers for ZFS

2010-05-26 Thread Thomas Burgess

On Wed, May 26, 2010 at 5:47 PM, Brandon High bh...@freaks.com wrote:

 On Sat, May 15, 2010 at 4:01 AM, Marc Bevand m.bev...@gmail.com wrote:
  I have done quite some research over the past few years on the best (ie.
  simple, robust, inexpensive, and performant) SATA/SAS controllers for
 ZFS.

 I've spent some time looking at the capabilities of a few controllers
 based on the questions about the SiI3124 and PMP support.

 According to the docs, the Marvell 88SX6081 driver doesn't support NCQ
 or PMP, though the card does. While I'm not really performance bound
 on my system, I imagine NCQ would help performance a bit, at least for
 scrubs or resilvers. Even more so because I'm using the slow WD10EADS
 drives.

 This raises the question of whether a SAS controller supports NCQ for
 sata drives. Would an LSI 1068e based controller? What about a LSI
 2008 based card?



If that is the chip on the AOC-SAT2-MV8 then i'm pretty sure it does suppoer
NCQ

I'm also pretty sure the LSI supports NCQ

I'm not 100% sure though
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Ideal SATA/SAS Controllers for ZFS

2010-05-26 Thread Thomas Burgess

I thought it didI couldn't imagine sun using that chip in the original
thumper if it didn't suppoer NCQalso, i've read where people have had to
DISABLE ncq on this driver to fix one bug or another (as a work around)


On Wed, May 26, 2010 at 8:40 PM, Marty Faltesek
marty.falte...@oracle.comwrote:

 On Wed, 2010-05-26 at 17:18 -0700, Brandon High wrote:
   If that is the chip on the AOC-SAT2-MV8 then i'm pretty sure it does
  suppoer
   NCQ
 
  Not according to the driver documentation:
  http://docs.sun.com/app/docs/doc/819-2254/marvell88sx-7d
  In addition, the 88SX6081 device supports the SATA II Phase 1.0
  specification features, including SATA II 3.0 Gbps speed, SATA II Port
  Multiplier functionality and SATA II Port Selector. Currently the
  driver does not support native command queuing, port multiplier or
  port selector functionality.
 
  The driver source isn't available (or I couldn't find it) so it's not
  easy to confirm.

 marvell88sx does support NCQ.  This man page error was corrected in
 nevada build 138.

 Marty




___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

[zfs-discuss] question about zpool iostat output

I was just wondering:

I added a SLOG/ZIL to my new system today...i noticed that the L2ARC shows
up under it's own headingbut the SLOG/ZIL doesn'tis this correct?


see:



   capacity operationsbandwidth
poolalloc   free   read  write   read  write
--  -  -  -  -  -  -
rpool   15.3G  44.2G  0  0  0  0
  c6t4d0s0  15.3G  44.2G  0  0  0  0
--  -  -  -  -  -  -
tank10.9T  7.22T  0  2.43K  0   300M
  raidz210.9T  7.22T  0  2.43K  0   300M
c4t6d0  -  -  0349  0  37.6M
c4t5d0  -  -  0350  0  37.6M
c5t7d0  -  -  0350  0  37.6M
c5t3d0  -  -  0350  0  37.6M
c8t0d0  -  -  0354  0  37.6M
c4t7d0  -  -  0351  0  37.6M
c4t3d0  -  -  0350  0  37.6M
c5t8d0  -  -  0349  0  37.6M
c5t0d0  -  -  0348  0  37.6M
c8t1d0  -  -  0353  0  37.6M
  c6t5d0s0  0  8.94G  0  0  0  0
cache   -  -  -  -  -  -
  c6t5d0s1  37.5G  0  0158  0  19.6M



It seems sort of strange to me that it doesn't look like this instead:






   capacity operationsbandwidth
poolalloc   free   read  write   read  write
--  -  -  -  -  -  -
rpool   15.3G  44.2G  0  0  0  0
  c6t4d0s0  15.3G  44.2G  0  0  0  0
--  -  -  -  -  -  -
tank10.9T  7.22T  0  2.43K  0   300M
  raidz210.9T  7.22T  0  2.43K  0   300M
c4t6d0  -  -  0349  0  37.6M
c4t5d0  -  -  0350  0  37.6M
c5t7d0  -  -  0350  0  37.6M
c5t3d0  -  -  0350  0  37.6M
c8t0d0  -  -  0354  0  37.6M
c4t7d0  -  -  0351  0  37.6M
c4t3d0  -  -  0350  0  37.6M
c5t8d0  -  -  0349  0  37.6M
c5t0d0  -  -  0348  0  37.6M
c8t1d0  -  -  0353  0  37.6M
log   -  -  -  -  -  -
  c6t5d0s0  0  8.94G  0  0  0  0
cache   -  -  -  -  -  -
  c6t5d0s1  37.5G  0  0158  0  19.6M
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] USB Flashdrive as SLOG?

The last couple times i've read this questions, people normally responded
with:

It depends

you might not even NEED a slog, there is a script floating around which can
help determine that...

If you could benefit from one, it's going to be IOPS which help youso if
the usb drive has more iops than your pool configuration does, then it might
give some benefit.but then again, usb might not be as safe either, and
if an older version you may want to mirror it.


On Tue, May 25, 2010 at 8:11 AM, Kyle McDonald kmcdon...@egenera.comwrote:

 Hi,

 I know the general discussion is about flash SSD's connected through
 SATA/SAS or possibly PCI-E these days. So excuse me if I'm askign
 something that makes no sense...

 I have a server that can hold 6 U320 SCSI disks. Right now I put in 5
 300GB for a data pool, and 1 18GB for the root pool.

 I've been thinking lately that I'm not sure I like the root pool being
 unprotected, but I can't afford to give up another drive bay. So
 recently the idea occurred to me to go the other way. If I were to get 2
 USB Flash Thunb drives say 16 or 32 GB each, not only would i be able to
 mirror the root pool, but I'd also be able to put a 6th 300GB drive into
 the data pool.

 That led me to wonder whether partitioning out 8 or 12 GB on a 32GB
 thumb drive would be beneficial as an slog?? I bet the USB bus won't be
 as good as SATA or SAS, but will it be better than the internal ZIL on
 the U320 drives?

 This seems like at least a win-win, and possibly a win-win-win.
 Is there some other reason I'm insane to consider this?

  -Kyle


 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] can you recover a pool if you lose the zil (b134+)

2010-05-25 Thread thomas

Is there a best practice on keeping a backup of the zpool.cache file? Is it 
possible? Does it change with changes to vdevs?
-- 
This message posted from opensolaris.org
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] question about zpool iostat output

i am running the last release from the genunix page

uname -a output:

SunOS wonslung-raidz2 5.11 snv_134 i86pc i386 i86pc Solaris


On Tue, May 25, 2010 at 10:33 AM, Cindy Swearingen 
cindy.swearin...@oracle.com wrote:

 Hi Thomas,

 This looks like a display bug. I'm seeing it too.

 Let me know which Solaris release you are running and
 I will file a bug.

 Thanks,

 Cindy


 On 05/25/10 01:42, Thomas Burgess wrote:

 I was just wondering:

 I added a SLOG/ZIL to my new system today...i noticed that the L2ARC shows
 up under it's own headingbut the SLOG/ZIL doesn'tis this correct?


 see:



   capacity operationsbandwidth
 poolalloc   free   read  write   read  write
 --  -  -  -  -  -  -
 rpool   15.3G  44.2G  0  0  0  0
  c6t4d0s0  15.3G  44.2G  0  0  0  0
 --  -  -  -  -  -  -
 tank10.9T  7.22T  0  2.43K  0   300M
  raidz210.9T  7.22T  0  2.43K  0   300M
c4t6d0  -  -  0349  0  37.6M
c4t5d0  -  -  0350  0  37.6M
c5t7d0  -  -  0350  0  37.6M
c5t3d0  -  -  0350  0  37.6M
c8t0d0  -  -  0354  0  37.6M
c4t7d0  -  -  0351  0  37.6M
c4t3d0  -  -  0350  0  37.6M
c5t8d0  -  -  0349  0  37.6M
c5t0d0  -  -  0348  0  37.6M
c8t1d0  -  -  0353  0  37.6M
  c6t5d0s0  0  8.94G  0  0  0  0
 cache   -  -  -  -  -  -
  c6t5d0s1  37.5G  0  0158  0  19.6M



 It seems sort of strange to me that it doesn't look like this instead:






   capacity operationsbandwidth
 poolalloc   free   read  write   read  write
 --  -  -  -  -  -  -
 rpool   15.3G  44.2G  0  0  0  0
  c6t4d0s0  15.3G  44.2G  0  0  0  0
 --  -  -  -  -  -  -
 tank10.9T  7.22T  0  2.43K  0   300M
  raidz210.9T  7.22T  0  2.43K  0   300M
c4t6d0  -  -  0349  0  37.6M
c4t5d0  -  -  0350  0  37.6M
c5t7d0  -  -  0350  0  37.6M
c5t3d0  -  -  0350  0  37.6M
c8t0d0  -  -  0354  0  37.6M
c4t7d0  -  -  0351  0  37.6M
c4t3d0  -  -  0350  0  37.6M
c5t8d0  -  -  0349  0  37.6M
c5t0d0  -  -  0348  0  37.6M
c8t1d0  -  -  0353  0  37.6M
 log   -  -  -  -  -  -
  c6t5d0s0  0  8.94G  0  0  0  0
 cache   -  -  -  -  -  -
  c6t5d0s1  37.5G  0  0158  0  19.6M






 


 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] questions about zil

On Tue, May 25, 2010 at 11:27 AM, Edward Ned Harvey
solar...@nedharvey.comwrote:

  From: zfs-discuss-boun...@opensolaris.org [mailto:zfs-discuss-
  boun...@opensolaris.org] On Behalf Of Nicolas Williams
 
   I recently got a new SSD (ocz vertex LE 50gb)
  
   It seems to work really well as a ZIL performance wise.
   I know it doesn't have a supercap so lets' say dataloss
   occursis it just dataloss or is it pool loss?
 
  Just dataloss.

 WRONG!

 The correct answer depends on your version of solaris/opensolaris.  More
 specifically, it depends on the zpool version.  The latest fully updated
 sol10 and the latest opensolaris release (2009.06) only go up to zpool 14
 or
 15.  But in zpool 19 is when a ZIL loss doesn't permanently offline the
 whole pool.  I know this is available in the developer builds.

 The best answer to this, I think, is in the ZFS Best Practices Guide:
 (uggh, it's down right now, so I can't paste the link)

 If you have zpool 19, and you lose an unmirrored ZIL, then you lose your
 pool.  Also, as a configurable option apparently, I know on my systems, it
 also meant I needed to power cycle.

 If you have zpool =19, and you lose an unmirrored ZIL, then performance
 will be degraded, but everything continues to work as normal.

 Apparently the most common mode of failure for SSD's is also failure to
 read.  To make it worse, a ZIL is only read after system crash, which means
 the possibility of having a failed SSD undetected must be taken into
 consideration.  If you do discover a failed ZIL after crash, with zpool 19
 your pool is lost.  But with zpool =19 only the unplayed writes are lost.
 With zpool =19, your pool will be intact, but you would lose up to 30sec
 of
 writes that occurred just before the crash.


 I didn't ask about losing my zil.

I asked about power loss taking out my pool.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] questions about zil



 At least to me, this was not clearly not asking about losing zil and was
 not clearly asking about power loss.  Sorry for answering the question
 you
 thought you didn't ask.


I was only responding to your response of WRONG!!!   The guy wasn't wrong in
regards to my questions.  I'm sorry for not making THAT more clear in my
post.



 I would suggest clarifying your question, by saying instead:  so lets' say
 *power*loss occurs  Then it would have been clear what you were asking.


I'm pretty sure i did ask about power lossor at least it was implied by
my point about the UPS.  You're right, i probably should have been a little
more clear.


 Since this is a SSD you're talking about, unless you have enabled
 nonvolatile write cache on that disk (which you should never do), and the
 disk incorrectly handles cache flush commands (which it should never do),
 then the supercap is irrelevant.  All ZIL writes are to be done
 synchronously.

 This SSD doesn't use nonvolatile write cache (at least i don't think it
does, it's a SF-1500 based ssd)
I might be wrong about this, but i thought one of the biggest things about
the sandforce was that it doesn't use DRAM


 If you have a power loss, you don't lose your pool, and you also don't lose
 any writes in the ZIL.  You do, however, lose any async writes that were
 not
 yet flushed to disk.  There is no way to prevent that, regardless of ZIL
 configuration.

Yes, I know that i lose async writesi just wasn't sure if that resulted
in an issue...I might be somewhat confused to how the ZIL works but i
thought the point of the ZIL was to pretend a write actually happened when
it may not have actually been flushed to disk yet...in this case, a write to
the zil might not make it to diski just didn't know if this could result
in a loss of a pool due to some sort of corruption of the uberblock or
something.I'm not entirely up to speed on the voodoo that is ZFS.



I wasn't trying to be rude, sorry if it came off like that.

I am aware of the issue regarding removing the ZIL on non-dev versions of
opensolarisi am on b134 so that doesnt' apply to me.  Thanks
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] questions about zil

On Tue, May 25, 2010 at 12:38 PM, Bob Friesenhahn 
bfrie...@simple.dallas.tx.us wrote:

 On Mon, 24 May 2010, Thomas Burgess wrote:


 It's a sandforce sf-1500 model but without a supercapheres some info
 on it:

 Maximum Performance

  *  Max Read: up to 270MB/s
  *  Max Write: up to 250MB/s
  *  Sustained Write: up to 235MB/s
  *  Random Write 4k: 15,000 IOPS
  *  Max 4k IOPS: 50,000


 Isn't there a serious problem with these specifications?  It seems that the
 minimum assured performance values (and the median) are much more
 interesting than some maximum performance value which might only be
 reached during a brief instant of the device lifetime under extremely ideal
 circumstances.  It seems that toilet paper may of much more practical use
 than these specifications.  In fact, I reject them as being specifications
 at all.

 The Apollo reentry vehicle was able to reach amazing speeds, but only for a
 single use.

 Bob

What exactly do you mean?
Every review i've read about this device has been great.  Every review i've
read about the sandforce controllers has been good toare you saying they
have shorter lifetimes?  Everything i've read has made them sound like they
should last longer than typical ssds because they write less actual data




 --
 Bob Friesenhahn
 bfrie...@simple.dallas.tx.us, http://www.simplesystems.org/users/bfriesen/
 GraphicsMagick Maintainer,http://www.GraphicsMagick.org/

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] questions about zil

Also, let me note, it came with a 3 year warranty so I expect it to last at
least 3 years...but if it doesn't, i'll just return it under the warranty.


On Tue, May 25, 2010 at 1:26 PM, Thomas Burgess wonsl...@gmail.com wrote:



 On Tue, May 25, 2010 at 12:38 PM, Bob Friesenhahn 
 bfrie...@simple.dallas.tx.us wrote:

 On Mon, 24 May 2010, Thomas Burgess wrote:


 It's a sandforce sf-1500 model but without a supercapheres some info
 on it:

 Maximum Performance

  *  Max Read: up to 270MB/s
  *  Max Write: up to 250MB/s
  *  Sustained Write: up to 235MB/s
  *  Random Write 4k: 15,000 IOPS
  *  Max 4k IOPS: 50,000


 Isn't there a serious problem with these specifications?  It seems that
 the minimum assured performance values (and the median) are much more
 interesting than some maximum performance value which might only be
 reached during a brief instant of the device lifetime under extremely ideal
 circumstances.  It seems that toilet paper may of much more practical use
 than these specifications.  In fact, I reject them as being specifications
 at all.

 The Apollo reentry vehicle was able to reach amazing speeds, but only for
 a single use.

 Bob

 What exactly do you mean?
 Every review i've read about this device has been great.  Every review i've
 read about the sandforce controllers has been good toare you saying they
 have shorter lifetimes?  Everything i've read has made them sound like they
 should last longer than typical ssds because they write less actual data




 --
 Bob Friesenhahn
 bfrie...@simple.dallas.tx.us,
 http://www.simplesystems.org/users/bfriesen/
 GraphicsMagick Maintainer,http://www.GraphicsMagick.org/



___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

[zfs-discuss] questions about zil

I recently got a new SSD (ocz vertex LE 50gb)

It seems to work really well as a ZIL performance wise.  My question is, how
safe is it?  I know it doesn't have a supercap so lets' say dataloss
occursis it just dataloss or is it pool loss?


also, does the fact that i have a UPS matter?


the numbers i'm seeing are really nicethese are some nfs tar times
before zil:


real 2m21.498s

user 0m5.756s

sys 0m8.690s


real 2m23.870s

user 0m5.756s

sys 0m8.739s



and these are the same ones after.




real 0m32.739s

user 0m5.708s

sys 0m8.515s



real 0m35.580s

user 0m5.707s

sys 0m8.526s




I also sliced iti have 16 gb ram so i used a 9 gb slice for zil and the
rest for L2ARC



this is for a single 10 drive raidz2 vdev so fari'm really impressed
with the performance gains
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] questions about zil



  ZFS is always consistent on-disk, by design. Loss of the ZIL will result
 in loss of the data in the ZIL which hasn't been flushed out to the hard
 drives, but otherwise, the data on the hard drives is consistent and
 uncorrupted.



 This is what i thought.  I have read this list on and off for awhile now
but i'm not a guruI see a lot of stuff about the intel ssd and disabling
the write cacheso i just wasn't sure...This is good news.






  It avoids the scenario of losing data in your ZIL due to power loss (and,
 of course, the rest of your system).  So, yes, if you actually care about
 your system, I'd recommend at least a minimal UPS to allow for quick
 shutdown after a power loss.


 yes, i have a nice little UPS.  I've tested it a few times and it seems to
work well.  It gives me about 20 minutes of power and can even send commands
via a script to shut down the system before the battery goes dry.




 That's going to pretty much be the best-case use for the ZIL - NFS writes
 being synchronous.  Of course, using the rest of the SSD for L2ARC is likely
 to be almost (if not more) helpful for performance for a wider variety of
 actions.


 yes, i have another machine without a zil (i bought a kingston 64 gb ssd on
sale and intended to try it as a zil but ultimately decided to just use it
as l2arc because of the performance numbers...)  but the l2arc helps a ton
for my uses.  I did slice this ssd...i used 9 gb for zil and the rest for
l2arc (about 36 gb)   I'm really impressed with this ssdfor only 160
dollars (180 - 20 mail in rebate) it's a killer deal.

it can do 235 MB/s sustained writes and has soemthing like 15,000 iops





 --
 Erik Trimble
 Java System Support
 Mailstop:  usca22-123
 Phone:  x17195
 Santa Clara, CA


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] questions about zil



 Not familiar with that model


It's a sandforce sf-1500 model but without a supercapheres some info on
it:



Maximum Performance

   - Max Read: up to 270MB/s
   - Max Write: up to 250MB/s
   - Sustained Write: up to 235MB/s
   - Random Write 4k: 15,000 IOPS
   - Max 4k IOPS: 50,000



per
http://www.ocztechnology.com/products/solid-state-drives/2-5--sata-ii/performance-enterprise-solid-state-drives/ocz-vertex-limited-edition-sata-ii-2-5--ssd.html




 Wow.  That's a pretty huge improvement. :-)

 - Garrett (newly of Nexenta)



yes, i love it.  I'm really impressed with this ssd for the money160 usd
(180 - 20 rebate)
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] New SSD options




 From earlier in the thread, it sounds like none of the SF-1500 based
 drives even have a supercap, so it doesn't seem that they'd necessarily
 be a better choice than the SLC-based X-25E at this point unless you
 need more write IOPS...

 Ray


I think the upcoming OCZ Vertex 2 Pro will have a supercap.

I just bought a ocz vertex le, it doesn't have a supercap but it DOES have
some awesome specs otherwise..
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

[zfs-discuss] confused

2010-05-23 Thread Thomas Burgess

did this come out?

http://cr.opensolaris.org/~gman/opensolaris-whats-new-2010-05/

i was googling trying to find info about the next release and ran across
this


Does this mean it's actually about to come out before the end of the month
or is this something else?
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] confused

2010-05-23 Thread Thomas Burgess

never mindjust found more info on this...shoudl have held back from
asking


On Mon, May 24, 2010 at 1:26 AM, Thomas Burgess wonsl...@gmail.com wrote:

 did this come out?

 http://cr.opensolaris.org/~gman/opensolaris-whats-new-2010-05/

 i was googling trying to find info about the next release and ran across
 this


 Does this mean it's actually about to come out before the end of the month
 or is this something else?


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Opteron 6100? Does it work with opensolaris?

yah, unfortunately this is the first send.  i'm trying to send 9 TB of data.
 It really sucks because i was at 6 TB when it lost power

On Sat, May 22, 2010 at 2:34 AM, Brandon High bh...@freaks.com wrote:

 You can resume a send if the destination has a snapshot in common with
 the source. If you don't, there's nothing you can do.

 It probably taking a while to restart because the sends that were
 interrupted need to be rolled back.

 Sent from my Nexus One.

 On May 21, 2010 9:44 PM, Thomas Burgess wonsl...@gmail.com wrote:

 I can't tell you for sure

 For some reason the server lost power and it's taking forever to come back
 up.

 (i'm really not sure what happened)

 anyways, this leads me to my next couple questions:


 Is there any way to resume a zfs send/recv

 Why is it taking so long for the server to come up?
 it's stuck on Reading ZFS config

 and there is a FLURRY of hard drive lights blinking (all 10 in sync )





 On Sat, May 22, 2010 at 12:26 AM, Brandon High bh...@freaks.com wrote:
 
  On Fri, May 21, 201...


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] HDD Serial numbers for ZFS

install smartmontools


There is no package for it but it's EASY to install

once you do, you can get ouput like this:


pfexec /usr/local/sbin/smartctl -d sat,12 -a /dev/rdsk/c5t0d0
smartctl 5.39.1 2010-01-28 r3054 [i386-pc-solaris2.11] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family: Seagate Barracuda 7200.12 family
Device Model: ST31000528AS
Serial Number:6VP06FF5
Firmware Version: CC34
User Capacity:1,000,204,886,016 bytes
Device is:In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:Sat May 22 11:15:50 2010 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status:  (   0) The previous self-test routine
completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection:  ( 609) seconds.
Offline data collection
capabilities:  (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities:(0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability:(0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time:  (   1) minutes.
Extended self-test routine
recommended polling time:  ( 192) minutes.
Conveyance self-test routine
recommended polling time:  (   2) minutes.
SCT capabilities:(0x103f) SCT Status supported.
SCT Feature Control supported.
SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE  UPDATED
 WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate 0x000f   113   099   006Pre-fail  Always
  -   55212722
  3 Spin_Up_Time0x0003   095   095   000Pre-fail  Always
  -   0
  4 Start_Stop_Count0x0032   100   100   020Old_age   Always
  -   132
  5 Reallocated_Sector_Ct   0x0033   100   100   036Pre-fail  Always
  -   1
  7 Seek_Error_Rate 0x000f   081   060   030Pre-fail  Always
  -   136183285
  9 Power_On_Hours  0x0032   091   091   000Old_age   Always
  -   7886
 10 Spin_Retry_Count0x0013   100   100   097Pre-fail  Always
  -   0
 12 Power_Cycle_Count   0x0032   100   100   020Old_age   Always
  -   132
183 Runtime_Bad_Block   0x   100   100   000Old_age   Offline
   -   0
184 End-to-End_Error0x0032   100   100   099Old_age   Always
  -   0
187 Reported_Uncorrect  0x0032   100   100   000Old_age   Always
  -   0
188 Command_Timeout 0x0032   100   100   000Old_age   Always
  -   0
189 High_Fly_Writes 0x003a   085   085   000Old_age   Always
  -   15
190 Airflow_Temperature_Cel 0x0022   063   054   045Old_age   Always
  -   37 (Lifetime Min/Max 32/40)
194 Temperature_Celsius 0x0022   037   046   000Old_age   Always
  -   37 (0 16 0 0)
195 Hardware_ECC_Recovered  0x001a   048   025   000Old_age   Always
  -   55212722
197 Current_Pending_Sector  0x0012   100   100   000Old_age   Always
  -   0
198 Offline_Uncorrectable   0x0010   100   100   000Old_age   Offline
   -   0
199 UDMA_CRC_Error_Count0x003e   200   200   000Old_age   Always
  -   0
240 Head_Flying_Hours   0x   100   253   000Old_age   Offline
   -   23691039612915
241 Total_LBAs_Written  0x   100   253   000Old_age   Offline
   -   263672243
242 Total_LBAs_Read 0x   100   253   000Old_age   Offline
   -   960644151

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
100  Not_testing
200  Not_testing
300  Not_testing
400  Not_testing
500  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


On Sat, May 22, 2010 at 3:09 AM, Andreas Iannou 
andreas_wants_the_w...@hotmail.com wrote:

  I

Re: [zfs-discuss] HDD Serial numbers for ZFS

i don't think there is but it's dirt simple to install.

I followed the instructions here:


http://cafenate.wordpress.com/2009/02/22/setting-up-smartmontools-on-opensolaris/



On Sat, May 22, 2010 at 3:19 AM, Andreas Iannou 
andreas_wants_the_w...@hotmail.com wrote:

  Thanks Thomas, I thought there'd already be a package in the repo for it.

 Cheers,
 Andre

 --
 Date: Sat, 22 May 2010 03:17:38 -0400
 Subject: Re: [zfs-discuss] HDD Serial numbers for ZFS
 From: wonsl...@gmail.com
 To: andreas_wants_the_w...@hotmail.com
 CC: zfs-discuss@opensolaris.org

 install smartmontoolsá


 There is no package for it but it's EASY to install

 once you do, you can get ouput like this:


  pfexec /usr/local/sbin/smartctl -d sat,12 -a /dev/rdsk/c5t0d0
 smartctl 5.39.1 2010-01-28 r3054 [i386-pc-solaris2.11] (local build)
 Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

 === START OF INFORMATION SECTION ===
 Model Family: á á Seagate Barracuda 7200.12 family
 Device Model: á á ST31000528AS
 Serial Number: á á6VP06FF5
 Firmware Version: CC34
 User Capacity: á á1,000,204,886,016 bytes
 Device is: á á á áIn smartctl database [for details use: -P show]
 ATA Version is: á 8
 ATA Standard is: áATA-8-ACS revision 4
 Local Time is: á áSat May 22 11:15:50 2010 EDT
 SMART support is: Available - device has SMART capability.
 SMART support is: Enabled

 === START OF READ SMART DATA SECTION ===
 SMART overall-health self-assessment test result: PASSED

 General SMART Values:
 Offline data collection status: á(0x82) Offline data collection activity
 was completed without error.
 Auto Offline Data Collection: Enabled.
 Self-test execution status: á á á( á 0) The previous self-test routine
 completed
 without error or no self-test has everá
 been run.
 Total time to complete Offlineá
 data collection: ( 609) seconds.
 Offline data collection
 capabilities: (0x7b) SMART execute Offline immediate.
 Auto Offline data collection on/off support.
 Suspend Offline collection upon new
 command.
 Offline surface scan supported.
 Self-test supported.
 Conveyance Self-test supported.
 Selective Self-test supported.
 SMART capabilities: á á á á á á(0x0003) Saves SMART data before entering
 power-saving mode.
 Supports SMART auto save timer.
 Error logging capability: á á á á(0x01) Error logging supported.
 General Purpose Logging supported.
 Short self-test routineá
 recommended polling time: ( á 1) minutes.
 Extended self-test routine
 recommended polling time: ( 192) minutes.
 Conveyance self-test routine
 recommended polling time: ( á 2) minutes.
 SCT capabilities: á á á (0x103f) SCT Status supported.
 SCT Feature Control supported.
 SCT Data Table supported.

 SMART Attributes Data Structure revision number: 10
 Vendor Specific SMART Attributes with Thresholds:
 ID# ATTRIBUTE_NAME á á á á áFLAG á á VALUE WORST THRESH TYPE á á áUPDATED
 áWHEN_FAILED RAW_VALUE
 áá1 Raw_Read_Error_Rate á á 0x000f á 113 á 099 á 006 á áPre-fail áAlways á
 á á - á á á 55212722
 áá3 Spin_Up_Time á á á á á á0x0003 á 095 á 095 á 000 á áPre-fail áAlways á
 á á - á á á 0
 áá4 Start_Stop_Count á á á á0x0032 á 100 á 100 á 020 á áOld_age á Always á
 á á - á á á 132
 áá5 Reallocated_Sector_Ct á 0x0033 á 100 á 100 á 036 á áPre-fail áAlways á
 á á - á á á 1
 áá7 Seek_Error_Rate á á á á 0x000f á 081 á 060 á 030 á áPre-fail áAlways á
 á á - á á á 136183285
 áá9 Power_On_Hours á á á á á0x0032 á 091 á 091 á 000 á áOld_age á Always á
 á á - á á á 7886
 á10 Spin_Retry_Count á á á á0x0013 á 100 á 100 á 097 á áPre-fail áAlways á
 á á - á á á 0
 á12 Power_Cycle_Count á á á 0x0032 á 100 á 100 á 020 á áOld_age á Always á
 á á - á á á 132
 183 Runtime_Bad_Block á á á 0x á 100 á 100 á 000 á áOld_age á Offline á
 á á- á á á 0
 184 End-to-End_Error á á á á0x0032 á 100 á 100 á 099 á áOld_age á Always á
 á á - á á á 0
 187 Reported_Uncorrect á á á0x0032 á 100 á 100 á 000 á áOld_age á Always á
 á á - á á á 0
 188 Command_Timeout á á á á 0x0032 á 100 á 100 á 000 á áOld_age á Always á
 á á - á á á 0
 189 High_Fly_Writes á á á á 0x003a á 085 á 085 á 000 á áOld_age á Always á
 á á - á á á 15
 190 Airflow_Temperature_Cel 0x0022 á 063 á 054 á 045 á áOld_age á Always á
 á á - á á á 37 (Lifetime Min/Max 32/40)
 194 Temperature_Celsius á á 0x0022 á 037 á 046 á 000 á áOld_age á Always á
 á á - á á á 37 (0 16 0 0)
 195 Hardware_ECC_Recovered á0x001a á 048 á 025 á 000 á áOld_age á Always á
 á á - á á á 55212722
 197 Current_Pending_Sector á0x0012 á 100 á 100 á 000 á áOld_age á Always á
 á á - á á á 0
 198 Offline_Uncorrectable á 0x0010 á 100 á 100 á 000 á áOld_age á Offline á
 á á- á á á 0
 199 UDMA_CRC_Error_Count á á0x003e á 200 á 200 á 000 á áOld_age á Always á
 á á - á á á 0
 240 Head_Flying_Hours á á á 0x á 100 á 253 á 000 á áOld_age á Offline á
 á á- á á á 23691039612915
 241 Total_LBAs_Written á á á0x á 100 á 253 á 000 á áOld_age á Offline á
 á á- á á á 263672243
 242 Total_LBAs_Read á á á á 0x á 100 á 253 á 000 á áOld_age á

Re: [zfs-discuss] Opteron 6100? Does it work with opensolaris?

i only care about the most recent snapshot, as this is a growing video
collection.

i do have snapshots, but i only keep them for when/if i accidently delete
something, or rename something wrong.


On Sat, May 22, 2010 at 3:43 AM, Brandon High bh...@freaks.com wrote:

 On Fri, May 21, 2010 at 10:22 PM, Thomas Burgess wonsl...@gmail.com
 wrote:
  yah, it seems that rsync is faster for what i need anywaysat least
 right
  now...

 If you don't have snapshots you want to keep in the new copy, then
 probably...

 -B

 --
 Brandon High : bh...@freaks.com

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Understanding ZFS performance.

If you install Opensolaris with the AHCI settings off, then switch them on,
it will fail to boot


I had to reinstall with the settings correct.

the best way to tell if ahci is working is to use cfgadm
if you see your drives there, ahci is on

if not, then you may need to reinstall with it on (for the rpool at least)


On Sat, May 22, 2010 at 4:43 PM, Brian broco...@vt.edu wrote:

 Is there a way within opensolaris to detect if AHCI is being used by
 various controllers?

 I suspect you may be accurate an AHCI is not turned on.  The bios for this
 particular motherboard is fairly confusing on the AHCI settings.  The only
 setting I have is actually in the raid section, and it seems to let select
 between IDE/AHCI/RAID as an option.  However, I can't tell if it applies
 only if one is using software RAID.

 If I set it to AHCI, another screen appears prior to boot that is titled
 AMD AHCI BIOS.  However, opensolaris hangs during booting with this enabled.
 Is there a way from the grub menu to request opensolaris boot without the
 splashscreen, but instead boot with debug information printed to the
 console?
 --
 This message posted from opensolaris.org
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Understanding ZFS performance.

just to make sure i understand what is going on here,

you have a rpool which is having performance issues, and you discovered ahci
was disabled?


you enabled it, and now it won't boot.  correct?

This happened to me and the solution was to export my storage pool and
reinstall my rpool with the ahci settings on.

Then i imported my storage pool and all was golden


On Sat, May 22, 2010 at 5:25 PM, Brian broco...@vt.edu wrote:

 Thanks -
   I can give reinstalling a shot.  Is there anything else I should do
 first?  Should I export my tank pool?
 --
 This message posted from opensolaris.org
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Understanding ZFS performance.

This didn't work for me.  I had the exact same issue a few days ago.

My motherboard had the following:

Native IDE
AHCI
RAID
Legacy IDE

so naturally i chose AHCI, but it ALSO had a mode called IDE/SATA combined
mode

I thought i needed this to use both the ide and ant sata ports, turns out it
was basically an ide emulation mode for sata, long story short i ended up
with opensolaris installed in IDE mode.

I had to reinstall.  I tried the livecd/import method and it still failed to
boot.


On Sat, May 22, 2010 at 5:30 PM, Ian Collins i...@ianshome.com wrote:

 On 05/23/10 08:52 AM, Thomas Burgess wrote:

 If you install Opensolaris with the AHCI settings off, then switch them
 on, it will fail to boot


 I had to reinstall with the settings correct.

  Well you probably didn't have to.  Booting form the live CD and importing
 the pool would have put things right.

 --
 Ian.


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Understanding ZFS performance.

this old thread has info on how to switch from ide-sata mode


http://opensolaris.org/jive/thread.jspa?messageID=448758#448758




On Sat, May 22, 2010 at 5:32 PM, Ian Collins i...@ianshome.com wrote:

 On 05/23/10 08:43 AM, Brian wrote:

 Is there a way within opensolaris to detect if AHCI is being used by
 various controllers?

 I suspect you may be accurate an AHCI is not turned on.  The bios for this
 particular motherboard is fairly confusing on the AHCI settings.  The only
 setting I have is actually in the raid section, and it seems to let select
 between IDE/AHCI/RAID as an option.  However, I can't tell if it applies
 only if one is using software RAID.



 [answered in other post]


  If I set it to AHCI, another screen appears prior to boot that is titled
 AMD AHCI BIOS.  However, opensolaris hangs during booting with this enabled.
 Is there a way from the grub menu to request opensolaris boot without the
 splashscreen, but instead boot with debug information printed to the
 console?



 Just hit a key once the bar is moving.

 --
 Ian.


 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Understanding ZFS performance.

GREAT, glad it worked for you!



On Sat, May 22, 2010 at 7:39 PM, Brian broco...@vt.edu wrote:

 Ok.  What worked for me was booting with the live CD and doing:

 pfexec zpool import -f rpool
 reboot

 After that I was able to boot with AHCI enabled.  The performance issues I
 was seeing are now also gone.  I am getting around 100 to 110 MB/s during a
 scrub.  Scrubs are completing in 20 minutes for 1TB of data rather than 1.2
 hours.
 --
 This message posted from opensolaris.org
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

[zfs-discuss] snapshots send/recv

I'm confusedI have a filesystem on server 1 called tank/nas/dump

I made a snapshot called first

zfs snapshot tank/nas/d...@first

then i did a zfs send/recv like:

zfs send tank/nas/d...@first | ssh wonsl...@192.168.1.xx /bin/pfexec
/usr/sbin/zfs recv tank/nas/dump


this worked fine, next today, i wanted to send what has changed

i did


zfs snapshot tank/nas/d...@second


now, heres where i'm confusedfrom reading the man page i thought this
command would work:


pfexec zfs send -i tank/nas/d...@first tank/nas/d...@second| ssh
wonsl...@192.168.1.15 /bin/pfexec /usr/sbin/zfs recv -vd tank/nas/dump



but i get an error:

cannot receive incremental stream: destination tank/nas/dump has been
modified
since most recent snapshot


why is this?
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] snapshots send/recv

On Sat, May 22, 2010 at 9:26 PM, Ian Collins i...@ianshome.com wrote:

 On 05/23/10 01:18 PM, Thomas Burgess wrote:


 this worked fine, next today, i wanted to send what has changed

 i did
 zfs snapshot tank/nas/d...@second

 now, heres where i'm confusedfrom reading the man page i thought this
 command would work:

 pfexec zfs send -i tank/nas/d...@first tank/nas/d...@second| ssh
 wonsl...@192.168.1.15 mailto:wonsl...@192.168.1.15 /bin/pfexec
 /usr/sbin/zfs recv -vd tank/nas/dump

  It should (you can shorten the first snap to first.


 but i get an error:

 cannot receive incremental stream: destination tank/nas/dump has been
 modified
 since most recent snapshot

  Well has it?  Even wandering around the filesystem with atime enabled
 will cause this error.

 Add -f to the receive to force a roll-back to the state after the original
 snap.

 Ahh, this i didn't know. Yes, i DID cd to the dir and check some stuff and
atime IS enabledthis is NOT very intuitive.

adding -F worked...thanks




 --

 Ian.


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] snapshots send/recv

ok, so forcing just basically makes it drop whatever changes were made

Thats what i was wondering...this is what i expected


On Sun, May 23, 2010 at 12:05 AM, Ian Collins i...@ianshome.com wrote:

 On 05/23/10 03:56 PM, Thomas Burgess wrote:

 let me ask a question though.

 Lets say i have a filesystem

 tank/something

 i make the snapshot

 tank/someth...@one

 i send/recv it

 then i do something (add a file...remove something, whatever) on the send
 side, then i do a send/recv and force it of the next filesystem

  What do you mean force it of the next filesystem?


  will the new recv'd filesystem be identical to the original forced
 snapshot or will it be a combination of the 2?


 The received filesystem will be identical to the sending one.

 --
 Ian.


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] New SSD options

2010-05-21 Thread thomas

On the PCIe side, I noticed there's a new card coming from LSI that claims 
150,000 4k random writes. Unfortunately this might end up being an OEM-only 
card.

I also notice on the ddrdrive site that they now have an opensolaris driver and 
are offering it in a beta program.
-- 
This message posted from opensolaris.org
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] send/recv over ssh

I seem to be getting decent speed with arcfour (this was what i was using to
begin with)

Thanks for all the helpthis honestly was just me being stupid...looking
back on yesterday, i can't even remember what i was doing wrong nowi was
REALLY tired when i asked this question.


On Fri, May 21, 2010 at 2:43 PM, Brandon High bh...@freaks.com wrote:

 On Fri, May 21, 2010 at 11:28 AM, David Dyer-Bennet d...@dd-b.net wrote:
  I thought I remembered a none cipher, but couldn't find it the other
  year and decided I must have been wrong.  I did use ssh-1, so maybe I
  really WAS remembering after all.

 It may have been in ssh2 as well, or at least the commercial version
 .. I thought it used to be a compile time option for openssh too.

  Seems a high price to pay to try to protect idiots from being idiots.
  Anybody who doesn't understand that encryption = none means it's not
  encrypted and hence not safe isn't safe as an admin anyway.

 Well, it won't expose your passwords since the key exchange it still
 encrypted ... That's good, right?

 Circling back to the original topic, you can use ssh to start up
 mbuffer on the remote side, then start the send. Something like:

 #!/bin/bash

 ssh -f r...@${recv_host} mbuffer -q -I ${SEND_HOST}:1234 | zfs recv
 puddle/tank
 sleep 1
 zfs send -R tank/foo/bar | mbuffer -O ${RECV_HOST}:1234


 When I was moving datasets between servers, I was on the console of
 both, so manually starting the send/recv was not a problem.

 I've tried doing it with netcat rather than mbuffer but it was
 painfully slow, probably due to network buffers. ncat (from the nmap
 devs) may be a suitable alternative, and can support ssl and
 certificate based auth.

 -B

 --
 Brandon High : bh...@freaks.com
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Opteron 6100? Does it work with opensolaris?

 8:10:12:15:20
supported_max_cstates   0
vendor_id   AuthenticAMD

module: cpu_infoinstance: 7
name:   cpu_info7   class:misc
brand   AMD Opteron(tm) Processor 6128
cache_id7
chip_id 0
clock_MHz   2000
clog_id 7
core_id 7
cpu_typei386
crtime  9171.560266487
current_clock_Hz20
current_cstate  0
family  16
fpu_typei387 compatible
implementation  x86 (chipid 0x0 AuthenticAMD 100F91
family 16 model 9 step 1 clock 2000 MHz)
model   9
ncore_per_chip  8
ncpu_per_chip   8
pg_id   11
pkg_core_id 7
snaptime113230.737322698
socket_type G34
state   on-line
state_begin 1274377645
stepping1
supported_frequencies_Hz
 8:10:12:15:20
supported_max_cstates   0
vendor_id   AuthenticAMD


On Mon, May 17, 2010 at 5:55 PM, Dennis Clarke dcla...@blastwave.orgwrote:


 On 05-17-10, Thomas Burgess wonsl...@gmail.com wrote:
 psrinfo -pv shows:
 
 The physical processor has 8 virtual processors (0-7)
 x86  (AuthenticAMD 100F91 family 16 model 9 step 1 clock 200 MHz)
AMD Opteron(tm) Processor 6128   [  Socket: G34 ]
 

 That's odd.

 Please try this :

 # kstat -m cpu_info -c misc
 module: cpu_infoinstance: 0
 name:   cpu_info0   class:misc
brand   VIA Esther processor 1200MHz
cache_id0
chip_id 0
clock_MHz   1200
clog_id 0
core_id 0
cpu_typei386
crtime  3288.24125364
current_clock_Hz1199974847
current_cstate  0
family  6
fpu_typei387 compatible
implementation  x86 (CentaurHauls 6A9 family 6 model
 10 step 9 clock 1200 MHz)
model   10
ncore_per_chip  1
ncpu_per_chip   1
pg_id   -1
pkg_core_id 0
snaptime1526742.97169617
socket_type Unknown
state   on-line
state_begin 1272610247
stepping9
supported_frequencies_Hz1199974847
supported_max_cstates   0
vendor_id   CentaurHauls

 You should get a LOT more data.

 Dennis


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Opteron 6100? Does it work with opensolaris?

Something i've been meaning to ask

I'm transfering some data from my older server to my newer one.  the older
server has a socket 775 intel Q9550 8 gb ddr2 800 20 1TB drives in raidz2 (3
vdevs, 2 with 7 drives one with 6) connected to 3 AOC-SAT2-MV8 cards spread
as evenly across them as i could

The new server is socket g34 based with the opteron 6128 8 core cpu with 16
gb ddr3 1333 ECC ram with 10 2TB drives (so far) in a single raidz2 vdev
connected to 3 LSI SAS3081E-R cards (flashed with IT firmware)

I'm sure this is due to something i don't understand, but durring zfs
send/recv from the old server to the new server (3 send/recv streams)  I'm
noticing the loadavg on the old server is much less than the new one

this is form top on the old server:

load averages:  1.58,  1.57,  1.37;   up 5+05:13:17
 04:52:42


and this is the newer server

load averages:  6.20,  5.98,  5.30;   up 1+05:03:02
 18:49:57




shouldn't the newer server have LESS load?

Please forgive my ubernoobness.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Opteron 6100? Does it work with opensolaris?

is 3 zfs recv's random?



On Fri, May 21, 2010 at 10:03 PM, Brandon High bh...@freaks.com wrote:

 On Fri, May 21, 2010 at 5:54 PM, Thomas Burgess wonsl...@gmail.com
 wrote:
  shouldn't the newer server have LESS load?
  Please forgive my ubernoobness.

 Depends on what it's doing!

 Load average is really how many process are waiting to run, so it's
 not always a useful metric. If there are processes waiting on disk,
 you can have high load with almost no cpu use. Check the iowait with
 iostat or top.

 You've got a pretty wide stripe, which isn't going to give the best
 performance, especially for random write workloads. Your old 3 vdev
 config will have better random write performance.

 Check to see what's using the CPU with top or prstat. prstat gives
 better info for threads, imo.

 -B

 --
 Brandon High : bh...@freaks.com

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Opteron 6100? Does it work with opensolaris?

yeah, i'm aware of the performance aspects.  I use these servers as mostly
hd video servers for my house...they don't need to perform amazingly.  I
originally went with the setup on the old server because of everything i had
read about performance with wide stripes...in all honesty it performed
amazingly well, much more than i truly need...i plan to have 2 raidz2
stripes of 10 drives in this server (new one).

At most it will be serving 4-5 HD streams (mostly 720p mkv files, with some
1080p as well)

The older server can EASILY  max out 2  Gb/s links..i imagine the new server
will be able to do this as well...i think a scrub of the old server takes
4-5 hours.i'm not sure what this equates to in MB/s but its WAY more
than i ever really need.

This is what led me to use wider stripes in the new server, and i'm honestly
considering redoing the old server as well, if i switched to 2  wider
stripes instead of 3 i'd gain another TB or twofor my use i don't think
that would be a horrible thing.


On Fri, May 21, 2010 at 10:03 PM, Brandon High bh...@freaks.com wrote:

 On Fri, May 21, 2010 at 5:54 PM, Thomas Burgess wonsl...@gmail.com
 wrote:
  shouldn't the newer server have LESS load?
  Please forgive my ubernoobness.

 Depends on what it's doing!

 Load average is really how many process are waiting to run, so it's
 not always a useful metric. If there are processes waiting on disk,
 you can have high load with almost no cpu use. Check the iowait with
 iostat or top.

 You've got a pretty wide stripe, which isn't going to give the best
 performance, especially for random write workloads. Your old 3 vdev
 config will have better random write performance.

 Check to see what's using the CPU with top or prstat. prstat gives
 better info for threads, imo.

 -B

 --
 Brandon High : bh...@freaks.com

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Opteron 6100? Does it work with opensolaris?

I can't tell you for sure

For some reason the server lost power and it's taking forever to come back
up.

(i'm really not sure what happened)

anyways, this leads me to my next couple questions:


Is there any way to resume a zfs send/recv

Why is it taking so long for the server to come up?
it's stuck on Reading ZFS config

and there is a FLURRY of hard drive lights blinking (all 10 in sync )



On Sat, May 22, 2010 at 12:26 AM, Brandon High bh...@freaks.com wrote:

 On Fri, May 21, 2010 at 7:57 PM, Thomas Burgess wonsl...@gmail.com
 wrote:
  is 3 zfs recv's random?

 It might be. What do a few reports of 'iostat -xcn 30' look like?

 -B

 --
 Brandon High : bh...@freaks.com

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Opteron 6100? Does it work with opensolaris?

yah, it seems that rsync is faster for what i need anywaysat least right
now...


On Sat, May 22, 2010 at 1:07 AM, Ian Collins i...@ianshome.com wrote:

 On 05/22/10 04:44 PM, Thomas Burgess wrote:

 I can't tell you for sure

 For some reason the server lost power and it's taking forever to come back
 up.

 (i'm really not sure what happened)

 anyways, this leads me to my next couple questions:


 Is there any way to resume a zfs send/recv

  Nope.


  Why is it taking so long for the server to come up?
 it's stuck on Reading ZFS config

 and there is a FLURRY of hard drive lights blinking (all 10 in sync )

  It's cleaning up the mess.  If you had a lot of data copied over, it'll
 take a while deleting it!

 --
 Ian.


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Opteron 6100? Does it work with opensolaris?