Re: [Lustre-discuss] MDT overloaded when writting small files in large number

2008-12-08 Thread Jeff Darcy
Daire Byrne wrote: > In the past when we've had workloads with lots of small files where each > client/job has a unique dataset we have used disk image files on top of a > Lustre filesystem to store them. This file is then stored on a single OST and > so it reduces the overhead of going to the M

[Lustre-discuss] Misaligned buffers being handed to LND?

2008-10-22 Thread Jeff Darcy
Have any other LND authors (either at Sun or elsewhere) noticed recent Lustre versions passing iovs with non-page-aligned middle elements down to the LND layer? We've always seen plenty of misaligned start/end elements, but prior to 1.6.3 we never saw it with middle elements. We were advised

Re: [Lustre-discuss] MDT and MGS

2008-09-29 Thread Jeff Darcy
Brian J. Murrell wrote: > But you will notice that in the specific section 4.2.2.6 "Running > Multiple Lustre Filesystems" they do demonstrate setting up a separate > MGS: > Actually they demonstrate setting up a separate MDT to use an existing MGS. There's even a note that says "specify --mdt

Re: [Lustre-discuss] MDT and MGS

2008-09-29 Thread Jeff Darcy
Brian J. Murrell wrote: > This is _exactly_ the reason that we recommend a separate MGS for > installations that have even a notion of wanting more than one > filesystem. > I just checked, and the current Lustre operations manual (v1.14, updated September 19) does not seem to reflect that belie

Re: [Lustre-discuss] diskless booting & Debian

2008-08-12 Thread Jeff Darcy
Troy Benjegerdes wrote: > I see a relatively blank diskless booting wiki page at > > http://wiki.lustre.org/index.php?title=Diskless_Booting > > Are there any details on this, other than "University of Colorado at > Boulder has performed interesting work?" > > I have a couple of interests here.. >

[Lustre-discuss] Problems with NAT and/or version upgrade

2008-02-11 Thread Jeff Darcy
In our lab, we have several 108-node and 972-node systems, and we're using a small (four-node Opteron 2.6.16-27-0.9_lustre-1.6.0.1smp) Lustre server setup to provide some shared space for applications and such. Because there are so many client subnets involved and we don't want the server rout

[Lustre-discuss] Inconsistent data with Lustre 1.6.3

2008-01-29 Thread Jeff Darcy
We're running Lustre 1.6.3 and Linux 2.6.18 on our 972-node (5832-processor) machines, and we're seeing some interesting problems when we run executables from a Lustre filesystem. When we run 5000-processor jobs, we often see some - maybe only a few, maybe a couple of dozen - fail with illegal