Re: [OpenIndiana-discuss] ZFS; what the manuals don't say ...

Doug Hughes Tue, 23 Oct 2012 08:24:00 -0700

On 10/23/2012 11:08 AM, Robin Axelsson wrote:

On 2012-10-23 15:41, Doug Hughes wrote:
On 10/23/2012 8:29 AM, Robin Axelsson wrote:
Hi,
I've been using zfs for a while but still there are some questionsthat have remained unanswered even after reading the documentationso I thought I would ask them here.
I have learned that zfs datasets can be expanded by adding vdevs.Say that you have created say a raidz3 pool named "mypool" with thecommand
# zpool create mypool raidz3 disk1 disk2 disk3 ... disk8

you can expand the capacity by adding vdevs to it through the command

# zpool add mypool raidz3 disk9 disk10 ... disk16
The vdev that is added doesn't need to have the same raid/mirrorconfiguration or disk geometry, if I understand correctly. It willmerely be dynamically concatenated with the old storage pool. Thedocumentations says that it will be "striped" but it is not so clearwhat that means if data is already stored in the old vdevs of the pool.
Unanswered questions:
* What determines _where_ the data will be stored on a such a pool?Will it fill up the old vdev(s) before moving on to the new one orwill the data be distributed evenly?* If the old pool is almost full, an even distribution will beimpossible, unless zpool rearranges/relocates data upon adding thevdev. Is that what will happen upon adding a vdev?* Can the individual vdevs be read independently/separately? If saythe newly added vdev faults, will the entire pool be unreadable orwill I still be able to access the old data? What if I took asnapshot before adding the new vdev?
* Can several datasets be mounted to the same mount point, i.e. canmultiple "file system"-datasets be mounted so that they (the root ofthem) are all accessed from exactly the same (POSIX) path andsubdirectories with coinciding names will be merged? The purpose ofthis would be to seamlessly expand storage capacity this way justlike when adding vdevs to a pool.* If that's the case how will the data be distributed/allocated overthe datasets if I copy a data file to that path?
Kind regards
Robin.
*) yes, you can dynamically add more disks and zfs will just startusing them.
*) zfs stripes across all vdevs evenly, as it can.
*) as your old vdev gets full, zfs will only allocate blocks to thenewer, less full vdev*) since it's a stripe across vdevs (and they should all be raidz2 orbetter!) if one vdev fails, your filesystem will be unavailable. Theyare not independent unless you put them in a separate pool.*) you cannot have overlapping /mixed filesystems at exactly the sameplace, however it is perfectly possible to have e.g. /export be onrootpool, /export/mystuff on zpool1 and /export/mystuff/morestuff beon zpool2.
The unasked question is "If I wanted the vdevs to be equallybalanced, could I?". The answers is a qualified yes. What you wouldneed to do is reopen every single file, buffer it to memory, thenwrite every block out again. We did this operation once. It meansthat all vdevs will roughly have the same block allocation when youare done.
Do you happen to know how that's done in OI? Otherwise I would have tomove each file one by one to a disk location outside the dataset andthen move it back or zfs send the dataset to another pool of at leastequal size and then zfs receive it back to the expanded pool.

you don't have to move it, you just have to open, read it into memory,seek back to the beginning, and write it out again. Rewriting thoseblocks will take care of it since ZFS is copy-on-write. You will need tobe wary of your snapshots during this process since all files will berewritten and you'll double your space consumption.


(basically a perl, python, or other similar script could do this)


_______________________________________________
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss

Re: [OpenIndiana-discuss] ZFS; what the manuals don't say ...

Reply via email to