Re: [Lustre-discuss] Mag Gam

2013-07-20 Thread Mag Gam
http://gonularimetal.com/vmxs/mej.pbrkvf Mag Gam 7/21/2013 7:23:10 AM ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss

[Lustre-discuss] integrating client into the kernel

2011-05-28 Thread Mag Gam
Are there any plans to integrate Lustre client into the kernel? ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss

Re: [Lustre-discuss] Lustre HA Experiences

2011-05-24 Thread Mag Gam
What was your conclusion? What is a good HA solution with Lustre? I am hoping SNS will be a big push for the next year On Wed, May 4, 2011 at 5:16 PM, Jason Rappleye wrote: > > On May 4, 2011, at 10:05 AM, Charles Taylor wrote: > >> >> We are dipping our toes into the waters of Lustre HA using >

Re: [Lustre-discuss] Client Kernel panic - not syncing. Lustre 1.8.5

2011-05-24 Thread Mag Gam
stick with 1.6.6 , its a great release! BTW, why did you decide to upgrade to 1.8.x? is there a feature you are looking for? On Fri, May 20, 2011 at 2:48 PM, Aaron Everett wrote: > Thanks for the tip. I've already updated with the LU-286 patch, but I'll > build new rpms with both patches and rol

Re: [Lustre-discuss] windows native client

2011-05-04 Thread Mag Gam
horray to oracle! not On Wed, Apr 20, 2011 at 10:57 AM, Colin Faber wrote: > This port is no longer available from Oracle. > > -cf > > > On 04/20/2011 07:30 AM, hua zhou wrote: >> >> Hi, >> >> From the website: >> (http://wiki.lustre.org/index.php/Windows_Native_Client#Download_and_S...) >> >>

Re: [Lustre-discuss] Update of PDSI filesystem stats data

2011-03-04 Thread Mag Gam
i hope one of the new features you are going to implement is SNS :-) On Wed, Feb 23, 2011 at 8:37 PM, Andreas Dilger wrote: > When looking at how to implement features for Lustre (which I'm doing a lot > of recently :-) I somtimes consult the PDSI filesystem statistics data at > http://www.pd

Re: [Lustre-discuss] Lustre 2.1 Release

2011-03-04 Thread Mag Gam
This is really great and thanks for keeping this open! For aspiring software engineers at my school it would be valuable to dial-in the calls to hear professional speak. On Fri, Feb 25, 2011 at 8:18 AM, Diego Moreno wrote: > Hi Peter, > > That's great news! It's really interesting to know about

Re: [Lustre-discuss] [Lustre-devel] List of Lustre Projects

2010-11-05 Thread Mag Gam
Thanks for putting this together. Are there any plans for SNS? On Mon, Nov 1, 2010 at 3:43 PM, Andreas Dilger wrote: > On 2010-11-01, at 10:21, James Simmons wrote: >>> due to a number of people asking me for Lustre projects to work on, I >>> created a list of projects that do not currently ha

Re: [Lustre-discuss] OpenSFS Information Meeting In Parallel With SC10

2010-11-05 Thread Mag Gam
stakeholder: Please consider making SNS a top priority. After waiting several years we (I am sure many others) are migrating to other more fault tolerant filesystems. On Thu, Oct 28, 2010 at 4:25 PM, Norman Morse wrote: > lustre-discuss@lists.lustre.org > > Inaugural OpenSFS Meeting At SC10 To P

Re: [Lustre-discuss] Announce: Lustre 2.0.0 is available!

2010-09-09 Thread Mag Gam
thanks. Either way, congrats! and keep up the good work On Thu, Sep 9, 2010 at 11:37 AM, Andreas Dilger wrote: > On 2010-09-09, at 4:53, Mag Gam wrote: >> For the future releases, will the client ever be part of the stock >> kernel? > > There are no current plans to do

Re: [Lustre-discuss] Announce: Lustre 2.0.0 is available!

2010-09-09 Thread Mag Gam
This is great news! For the future releases, will the client ever be part of the stock kernel? What is the status of SNS? This is an important feature for many people and it seems people are are shying away from Lustre and going to other solutions solely based on this feature. On Thu, Aug 26,

Re: [Lustre-discuss] Fwd: Re: Announce: Lustre 1.8.4 is available!

2010-09-01 Thread Mag Gam
I am curious, is there a Oracle roadmap similar to what Sun was providing? On Fri, Aug 27, 2010 at 11:29 AM, yangsheng wrote: > >> >> >> >> Original Message >> Subject:      Re: [Lustre-discuss] Announce: Lustre 1.8.4 is available! >> Date:         Thu, 26 Aug 2010 15:00:46 -0

[Lustre-discuss] Lustre RAID1 SNS

2010-05-06 Thread Mag Gam
Was looking thru here, http://wiki.lustre.org/images/f/ff/OST_Migration_RAID1_SNS.pdf Is this actually work in progress or proposal? This is perhaps the feature of the decade for Lustre :-) ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org

Re: [Lustre-discuss] Announce: Lustre 1.8.3 is available!

2010-05-04 Thread Mag Gam
what is the status of merging the code into the lInux mainline kernel? or even have the client into the kernel tree? On Mon, May 3, 2010 at 10:19 PM, Norberto Meijome wrote: > On 1 May 2010 09:49, Terry Rutledge wrote: >> >> >> * RHEL5.4 kernel update >> Kernel update to OEL5.4 2.6.16-164.11.1

Re: [Lustre-discuss] Future of LusterFS?

2010-04-26 Thread Mag Gam
Speaking of the future. Is there any more news about SNS? I think thats the only thing Lustre is missing to make it "production" ready and not just for research labs. On Fri, Apr 23, 2010 at 12:07 PM, Stuart Midgley wrote: > Yes, we suffer hardware failures.  All the time.  That is sort of the

Re: [Lustre-discuss] disappeared data from OST

2010-02-16 Thread Mag Gam
Peter: I am glad you mention this. What is an appropriate backup tool for Lustre? I know with 2.x they will introduce ChangeLogs, but for people using 1.6.x what is a good tool? I suppose 'rsync' or drbd for realtime? What do you recommend? On Mon, Feb 15, 2010 at 6:30 PM, Peter Grandi wrote

Re: [Lustre-discuss] High difference in I/O network traffic in lustre client

2010-02-01 Thread Mag Gam
How many OSS and OSTs do you have ? What type of hardware are they running on? What type of network connection? The file you are trying to access what OSS is it on? Are the files striped? What On Mon, Feb 1, 2010 at 4:44 AM, Lex wrote: > Hi guys > > In effort to improve our storage system perf

Re: [Lustre-discuss] No space left on device for just one file

2010-01-11 Thread Mag Gam
Can you paste us the file name? I want to see if we can touch something like this. On Fri, Jan 8, 2010 at 1:36 PM, Michael Robbert wrote: > I have a user that reported a problem creating a file on our Lustre > filesystem. When I investigated I found that the problem appears to be unique > to ju

Re: [Lustre-discuss] Lustre reading file and RAM

2010-01-02 Thread Mag Gam
Thats what I am doing now. There must be cache on the MDS side on 1.6, no? On Sat, Jan 2, 2010 at 1:06 AM, Andreas Dilger wrote: > On 2010-01-01, at 15:38, Mag Gam wrote: >> >> We are running 1.6.7.2 OSS and Clients. If there are 50 clients >> accessing the same 10gig file

Re: [Lustre-discuss] MD1000 woes and OSS migration suggestions

2010-01-01 Thread Mag Gam
Does HP offer something similar to what you are saying Wojciech? It sounds very impressive. On Wed, Dec 30, 2009 at 8:55 PM, Wojciech Turek wrote: > Hi Nick, > > I don't think you should invest into new MD1000 brick just to make it > working in split mode. > Split mode doesn't give you much, exc

[Lustre-discuss] Lustre reading file and RAM

2010-01-01 Thread Mag Gam
We are running 1.6.7.2 OSS and Clients. If there are 50 clients accessing the same 10gig file would the 10gig file go into memory of the OSS or for each access it starts over? My OSS has 32gig of memory. TIA ___ Lustre-discuss mailing list Lustre-discuss

Re: [Lustre-discuss] Lustre 1.6.7.3

2009-12-11 Thread Mag Gam
nded 1.6.7.2+" means that the patch is actually > *in* the 1.6.7.2 release, not that it was landed *after* the 1.6.7.2 > release. > > Mag Gam wrote: >> >> Will there ever be a 1.6.7.3 release? It seems some bugs from 1.6.7.2 >> have been addressed. >> >>

Re: [Lustre-discuss] Adaptive Timeouts in Lustre 1.6.x vs. 1.8.x

2009-12-10 Thread Mag Gam
Bump. I too am curious about this. On Fri, Nov 20, 2009 at 11:04 PM, Andreas Dilger wrote: > On 2009-11-20, at 04:00, Alvaro Aguilera wrote: >> I was wondering if some could explain to me the differences -if any- >> in the implementation of adaptive timeouts in Lustre 1.8.x compared >> to Lustr

[Lustre-discuss] Lustre 1.6.7.3

2009-12-10 Thread Mag Gam
Will there ever be a 1.6.7.3 release? It seems some bugs from 1.6.7.2 have been addressed. 17336 18289 19453 19514 19539 19584 19586 19601 19697 19728 19754 19759 19788 20491 Is it best just to patch 1.6.7.2 or wait for 1.6.7.3? ___ Lustre-discuss maili

Re: [Lustre-discuss] WARNING: short read while accessing file >4GB on 32-bit client

2009-12-09 Thread Mag Gam
Thanks for letting us know. Is there a website or newsgroup other than bugzilla to show us these types of warnings and bugs and related patch/bugzilla entry? On Wed, Dec 9, 2009 at 6:16 PM, Johann Lombardi wrote: > Hi all, > > A bug impacting 32-bit lustre clients has been identified in the mo

[Lustre-discuss] client I/O

2009-12-04 Thread Mag Gam
Is it possible to figure out what client is taking up the most I/O? We have 8 OSS and 200 clients and it seems 5 to 6 clients are taking up all the bandwidth and I am trying to figure out who it is... TIA ___ Lustre-discuss mailing list Lustre-discuss@li

Re: [Lustre-discuss] Lustre slow file open close on RHEL5

2009-11-15 Thread Mag Gam
Can you check if you have readahead enabled? http://manual.lustre.org/manual/LustreManual16_HTML/LustreProc.html#50557055_78950 This could probably be your cause. On Thu, Nov 12, 2009 at 3:03 PM, Wojciech Turek wrote: > Hi, > > Cluster running Lustre 1.6.6 > Opening and closing files takes lon

[Lustre-discuss] Lustre and NFS integration

2009-11-11 Thread Mag Gam
At our lab we deal with a lot of images. Each image is about 4-6Gb average. We want to use Lustre for processing the image and place it on more reliable storage such as 3140/NetApp for backups. I was wondering if anyone has a scheme they are using to easily integrate the 2 technologies together so

Re: [Lustre-discuss] Removing OSTs

2009-11-03 Thread Mag Gam
there is. > > -Original Message- > From: lustre-discuss-boun...@lists.lustre.org > [mailto:lustre-discuss-boun...@lists.lustre.org] On Behalf Of Mag Gam > Sent: Monday, November 02, 2009 7:15 PM > To: lustre-discuss@lists.lustre.org > Subject: [L

[Lustre-discuss] Removing OSTs

2009-11-02 Thread Mag Gam
Is it possible to remove a OST permanently on 1.8.x? ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss

Re: [Lustre-discuss] Support for vanilla kernels in lustre servers

2009-10-31 Thread Mag Gam
if I were to deploy a system now and I want to do the kernel compile way, what kernel do you recommend? I prefer using 1.6.7.2 because of its stability... On Thu, Oct 29, 2009 at 1:32 PM, Andreas Dilger wrote: > On 2009-10-29, at 06:49, Ramiro Alba Queipo wrote: >> I am using (now testing) lust

Re: [Lustre-discuss] [Lustre-Performance] Lustre performance on SLES 11 x86-64

2009-10-30 Thread Mag Gam
Check the network connections. What kind of network are you on? 10/100 or 100/1000? Do you see any dropped packets? On Fri, Oct 30, 2009 at 7:49 AM, udara weerapperuma wrote: > > Hi All, > > > I have SLES 11 x86-64  and lustre v 1.8.1.1 installed in it. This use as MDS > and OSS. > > /dev/sda

Re: [Lustre-discuss] filesystem corruption

2009-09-06 Thread Mag Gam
So, I have to ask Why disable write-back cache on the controller? 2009/9/4 恩强周 : > It's really dangerous! e2fsck  bring it back. > > 2009/9/3 Peter Kjellstrom >> >> On Thursday 03 September 2009, 恩强周 wrote: >> > hi all, >> > >> > I have lustre  corrupted when a OSS powered off by ipmi accident

[Lustre-discuss] mds and ost question on older hardware

2009-08-31 Thread Mag Gam
Would running an MDS and OST on a 32bit hardware hamper any performance or scalability? These boxes all have 4GB of memory. ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss

Re: [Lustre-discuss] Upgrading 1.6.7.1 -> 1.8.1 Documentation?

2009-08-30 Thread Mag Gam
On Fri, Aug 28, 2009 at 2:54 PM, Brian J. Murrell wrote: > On Fri, 2009-08-28 at 20:41 +0200, Nick Jennings wrote: >> Hi everyone, > > Hi, > >>  I was wondering if there was any documentation specific to upgrading >> lustre 1.6.x branch to 1.8.x branch. > > rpm -Uvh ... > >> All the docs I could fi

Re: [Lustre-discuss] NFS vs Lustre

2009-08-30 Thread Mag Gam
Well said. This should be on the Wiki :-) On Sat, Aug 29, 2009 at 2:15 PM, John K. Dawson wrote: > Lee, > > Thanks for posting this. I found the background and perspective very > interesting. > > John > > John K. Dawson > jkdaw...@gmail.com > 612-860-2388 > > On Aug 29, 2009, at 12:56 PM, Lee Wa

Re: [Lustre-discuss] NFS vs Lustre

2009-08-29 Thread Mag Gam
Lustre is a parallel filesystem where NFS is not. The advantage of NFS is its native for many Unix systems and is widely available. The advantage of Lustre is its performance. GPFS is a parallel fileysystem very similar to Lustre but its backed by IBM. It runs on AIX and Linux. Its good but costl

Re: [Lustre-discuss] MDS refuses connections (no visible reason)

2009-08-18 Thread Mag Gam
just curious, if you didn't compile your own kernel, how do you apply this patch? Is our only option to upgrade via RPMS or is there another way to apply the patch? On Tue, Aug 18, 2009 at 4:27 AM, Patricia Santos Marco wrote: > Our MDT have lustre 1.6.7, I see in this message > http://lists.lus

[Lustre-discuss] btree

2009-08-16 Thread Mag Gam
I know the underlying filesystem for Lustre is ext3+extends, but I was wondering does Lustre use B-Trees in its logical layout meaning once a client tries to access its data does it actually use a btree for its access/retrieval? As a matter of fact, do most clustered parallel file system use btrees

Re: [Lustre-discuss] lustre performance degrade on SuSE 11

2009-08-12 Thread Mag Gam
R   TX-OK TX-ERR TX-DRP TX-OVR Flg > eth0   1500   0 1741587      0      0      0 2002121      0      0      0 BMRU > > > cheers, > __ > tharindu > > -Original Message- > From: Mag Gam [mailto:magaw...@gmail.com] > Sent: Wednesday, August 12, 2009 4:40 PM > To: T

[Lustre-discuss] applications question

2009-08-12 Thread Mag Gam
Since we have been using lustre for a while now we would like to port some of them to use I/O features of lustre. Currently, all of the applications are very memory intensive, meaning they take in the entire dataset at 1 time load it into memory (64GB) and process. I was wondering if there are some

Re: [Lustre-discuss] lustre performance degrade on SuSE 11

2009-08-12 Thread Mag Gam
Is your network setup properly? Can you scp/ftp file to your OSS as a test? Check if your network interfaces are properly connected (autoneg, 1000Full). make sure you aren't getting any packet loss (netstat -i). Also First then that and see how it goes. On Wed, Aug 12, 2009 at 3:37 AM, Tharindu R

Re: [Lustre-discuss] Geographic cluster

2009-08-11 Thread Mag Gam
How does this replication feature compare to rsync in performance and ease of use? On Sat, Aug 8, 2009 at 2:42 AM, Andreas Dilger wrote: > On Aug 07, 2009  20:24 -0400, Brian J. Murrell wrote: >> The other thing is that I don't know that our replication feature is >> going to be bi-directional.

[Lustre-discuss] Multiple lustre filesystems

2009-08-11 Thread Mag Gam
At our lab we generate fluid flow simulation data which is done overnight and we take the data for other analysis purposes. Its about 30TB and we would like to backup the data to another lustre filesystem. In addition to backup I would also also to get high availability. So, I was thinking of this

Re: [Lustre-discuss] Problems upgrading from 1.6 to 1.8

2009-08-10 Thread Mag Gam
Lets wait for couple of days and check back :-) I appreciate you giving us feedback. On Mon, Aug 10, 2009 at 5:02 AM, Christopher J.Walker wrote: > Christopher J.Walker wrote: >> >> Mag Gam wrote: >>> >>> Thanks for the response Chris. >>> >>

Re: [Lustre-discuss] Problems upgrading from 1.6 to 1.8

2009-08-05 Thread Mag Gam
Thanks for the response Chris. On Wed, Aug 5, 2009 at 5:20 PM, Andreas Dilger wrote: > On Aug 05, 2009  18:45 +0100, Christopher J.Walker wrote: >> Aug  5 13:53:01 se02 kernel: LustreError: >> 2668:0:(lib-move.c:95:lnet_try_match_md()) Matching packet from >> 12345-10.1.4@tcp, match 1449 len

[Lustre-discuss] size of OST

2009-08-05 Thread Mag Gam
I know the largest possible OST is 8TB, but is that a recommended size? I wan to avoid maintaining many objects therefore I was thinking of creating 10x8TB OSTs on 10 OSS. Was wondering what kind of problems can arise. TIA ___ Lustre-discuss mailing list

Re: [Lustre-discuss] Problems upgrading from 1.6 to 1.8

2009-08-05 Thread Mag Gam
Were you able to fix this? On Fri, Jul 17, 2009 at 11:38 AM, Christopher J.Walker wrote: > In order to avoid occasional crashes on our 1.6.4.3 OSSs, we have just > upgraded our MDS and OSSs from 1.6.4.3 to lustre 1.8.0.1. Unfortunately, > we are having problems writing files  - we've tried from

[Lustre-discuss] Moving away from bugzilla

2009-08-05 Thread Mag Gam
Are there any plans to move away from Bugzilla for issue tracking? I have been lurking around https://bugzilla.lustre.org for several months now and I still find it very hard to use, do others have the same feeling? or is there a setting or a preferred filter to see all the new bugs in 1.8 series?

Re: [Lustre-discuss] Lustre featured on podcast (HT: Andreas Dilger)

2009-08-04 Thread Mag Gam
Either way Andres. We appreciate the talk. My masters OS design class talked about this for 45 mins today :-) You are a celebrity. On Tue, Aug 4, 2009 at 2:35 PM, Brian J. Murrell wrote: > On Tue, 2009-08-04 at 11:17 -0600, Andreas Dilger wrote: >> >> To be more clear - TCP isn't used on IB, Elan

Re: [Lustre-discuss] Lustre featured on podcast (HT: Andreas Dilger)

2009-08-04 Thread Mag Gam
Palen [bro...@umich.edu] > Sent: 08/03/2009 08:35 PM AST > To: Mag Gam > Cc: lustre-discuss discuss > Subject: Re: [Lustre-discuss] Lustre featured on podcast (HT: Andreas Dilger) > > > > http://en.wikipedia.org/wiki/Nagle%27s_algorithm > > Looks like you intentio

Re: [Lustre-discuss] Lustre featured on podcast (HT: Andreas Dilger)

2009-08-03 Thread Mag Gam
ot 99% header/crc data.  Sounds like >> a way to make latency bad. >> >> Brock Palen >> www.umich.edu/~brockp >> Center for Advanced Computing >> bro...@umich.edu >> (734)936-1985 >> >> >> >> On Aug 3, 2009, at 8:20 PM, Mag Gam wrot

Re: [Lustre-discuss] Lustre featured on podcast (HT: Andreas Dilger)

2009-08-03 Thread Mag Gam
> packets so they are not 99% header/crc data.  Sounds like a way to make > latency bad. > > Brock Palen > www.umich.edu/~brockp > Center for Advanced Computing > bro...@umich.edu > (734)936-1985 > > > > On Aug 3, 2009, at 8:20 PM, Mag Gam wrote: > >> Ver

Re: [Lustre-discuss] Lustre featured on podcast (HT: Andreas Dilger)

2009-08-03 Thread Mag Gam
Very nice. 15:54, what is "Nagle" ? He didn't say anything about SNS, but changeLogs seems very promising! On Mon, Aug 3, 2009 at 8:55 AM, Brock Palen wrote: > Thanks to Andreas for taking an hour out to talk with Jeff Squyres and > myself (Brock Palen) about the Lustre cluster filesystem on o

Re: [Lustre-discuss] Alternative to DRBD

2009-07-21 Thread Mag Gam
Can't wait until 2.0 and SNS. I think thats the only die-hard feature Lustre is really missing. On Tue, Jul 21, 2009 at 10:19 AM, wrote: > Michael, > > - "Michael Di Domenico" wrote: >> > We are currently using e2scan and distributed rsyncs across the >> compute farm to do the same thing wi

[Lustre-discuss] Alternative to DRBD

2009-07-20 Thread Mag Gam
Other than DRBD and Hot standby are there any other alternatives? We want to have a redundant copy of our data and was wondering if rsync is the only way to accomplish this. ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lust

Re: [Lustre-discuss] Lustre compared to Gluster (Mag Gam)

2009-07-18 Thread Mag Gam
I was first interested in the simplicity and redundancy but it seems the product needs to be more mature like Lustre. Can't wait until Lustre's SNS ;-) On Fri, Jul 17, 2009 at 12:41 PM, Jordan Mendler wrote: >> We have been hearing a lot of news recently about "Gluster". Does >> anyone know how

[Lustre-discuss] Lustre compared to Gluster

2009-07-16 Thread Mag Gam
We have been hearing a lot of news recently about "Gluster". Does anyone know how it compares to Lustre? Can it do the same things as Lustre? It seems it has built in SNS. Anyone know? ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http:/

[Lustre-discuss] Lustre SNS and Hadoop

2009-07-10 Thread Mag Gam
Does anyone know the status of SNS and Lustre? I know it was postponed until further notice but has anyone word any other news? Also, since SNS isn't going to be ready for primetime, is anyone using Hadoop and Lustre? Since Hadoop has a redundancy failover won't that make more sense with Lustre?

Re: [Lustre-discuss] MDT crash: ll_mdt at 100%

2009-07-07 Thread Mag Gam
So, are you all good now? Thanks for the explanation, BTW! On Tue, Jul 7, 2009 at 7:42 AM, Thomas Roth wrote: > Hi, > > Mag Gam wrote: >> Exactly the symptoms I had. How long were you running this for?  Also, >> how easy is it for you to reproduce this error? > &

Re: [Lustre-discuss] MDT crash: ll_mdt at 100%

2009-07-03 Thread Mag Gam
://lists.lustre.org/pipermail/lustre-discuss/2009-April/010167.html On Fri, Jul 3, 2009 at 10:44 AM, Thomas Roth wrote: > > > Mag Gam wrote: >> http://lists.lustre.org/pipermail/lustre-discuss/2009-March/009928.html >> >> Look familiar? >> > Yes, I've read t

Re: [Lustre-discuss] MDT crash: ll_mdt at 100%

2009-07-03 Thread Mag Gam
or the moment the problem seems to have been fixed by shutdown, > fs-check and writeconf of all servers. > However, I don't want to do that every other week ... > > Thanks a lot for your help, > Thomas > > Mag Gam wrote: >> Hi Tom: >> >> There was a known is

Re: [Lustre-discuss] MDT crash: ll_mdt at 100%

2009-07-02 Thread Mag Gam
Hi Tom: There was a known issue with 1.6.7.1. What I did was downgrade to 1.6.6 and everything worked well. Or you can try upgrading, but there is something def wrong with that version... If you like, I can help you offline. I should be free this weekend (I have a long weekend) On Thu, Jul 2,

Re: [Lustre-discuss] missing ost's?

2009-06-16 Thread Mag Gam
do you have many small files? On Tue, Jun 16, 2009 at 8:58 PM, Michael Di Domenico wrote: > On Tue, Jun 16, 2009 at 8:25 PM, Michael Di > Domenico wrote: >> I have a small lustre test cluster with eight OST's running.  The >> servers were shut off over the weekend, upon turning them back on and

[Lustre-discuss] mmap issue with software

2009-05-29 Thread Mag Gam
Hello All: I have been trying to run MonetDB ( http://monetdb.cwi.nl/ ) and storing my data on a Lustre Filesystem (close to 1TB). The software works but it keeps crashing after certain hours. I spoke to the MonetDB people and they say something is up with Lustre and mmap(). I was wondering if any

[Lustre-discuss] mmap()

2009-05-13 Thread Mag Gam
I have an application which I would like to use Lustre as the backing storage. However, the application (MonetDB) uses mmap(). Would the application have any problems if using Lustre as its backing storage? TIA ___ Lustre-discuss mailing list Lustre-dis

Re: [Lustre-discuss] tcp network load balancing understanding lustre 1.8

2009-05-10 Thread Mag Gam
Thanks for the screen shot Arden. What is the maximum # of slaves you can have on a bonded interface? On Sun, May 10, 2009 at 12:15 AM, Arden Wiebe wrote: > > Bond0 knows which interface to utilize because all the other eth0-5 are > designated as slaves in their configuration files.  The manu

Re: [Lustre-discuss] tcp network load balancing understanding lustre 1.8

2009-05-09 Thread Mag Gam
I second the responses. Go with Native OS bonding, Linux in this case. Makes life so much easier... Good luck On Thu, May 7, 2009 at 8:20 PM, Isaac Huang wrote: > On Thu, May 07, 2009 at 03:02:49PM -0700, Klaus Steden wrote: >> .. >> I didn't even touch Lustre bonding, because as you both

Re: [Lustre-discuss] "Building Lustre, Protocol Basics, and Debugging" presentation

2009-05-08 Thread Mag Gam
Thanks for the response John. On Thu, May 7, 2009 at 8:54 AM, Johann Lombardi wrote: > On May 7, 2009, at 1:07 PM, Mag Gam wrote: >> >> What is the purpose of these extra kernel patches? > > Add things (ability to register journal callbacks, export some symbols which &

[Lustre-discuss] "Building Lustre, Protocol Basics, and Debugging" presentation

2009-05-07 Thread Mag Gam
Hello All, I was intrigued by this slide show http://wiki.lustre.org/images/1/1f/Lug_johann.pdf I have a question on slide 5 Kernel patches needed > Re-add journal callback support in jbd > Jbd fixes & statistics > scsi disk statistics – could be removed if blktrace enabled > Export some sy

Re: [Lustre-discuss] failure rates

2009-04-26 Thread Mag Gam
Hi John: From our experience (1 year), we simply love Lustre. Couple of lessons I learned regarding stability: 1) Always have the latest e2fs progs 2) I recommend not to get the latest version available version 3) Run on good hardware. Test your hardware with iozone and bonnie for 72 hours straig

[Lustre-discuss] Lustre and ZFS

2009-04-17 Thread Mag Gam
By looking at some slides on the wiki I noticed there is a lot of activity for using zfs as the underlying file system for Lustre on Linux which is great. But, I was under the impression ZFS or its ideas would never be ported to Linux because of license compatibility issues (CDDL and GPL). My ques

Re: [Lustre-discuss] Clarification on DDN performance best practices

2009-04-16 Thread Mag Gam
stre kernel. > > - Kit > > ----- Original Message - > From: Mag Gam > To: Kit Westneat > Cc: Peter Grandi ; List Lustre discussion > > Sent: Wed Apr 15 20:11:41 2009 > Subject: Re: [Lustre-discuss] Clarification on DDN performance best practices > > What

Re: [Lustre-discuss] Clarification on DDN performance best practices

2009-04-15 Thread Mag Gam
What kernel does DDN use for its products? Or is that a closed secret? On Mon, Apr 13, 2009 at 10:17 AM, Kit Westneat wrote: > >> As to this, the tier are organized as something similar to 8+2 RAID6. >> For 8-way data, which is the unit? Sectors? Blocks? In other words, what >> is the stripe dat

Re: [Lustre-discuss] 1.6.7

2009-04-11 Thread Mag Gam
Hi John: Yes, I think it was pulled out. There was a serious issue with it. On Fri, Apr 10, 2009 at 8:14 PM, Aaron Porter wrote: > On Fri, Apr 10, 2009 at 3:38 PM, John White wrote: >> Hey Folks, >>        So I'm not sure if this is the proper forum, but.. does anyone know >> what happened to

Re: [Lustre-discuss] WARNING: Potential directory corruptions on the MDS with 1.6.7

2009-04-09 Thread Mag Gam
Is there a bugzilla entry for 1.6.7 where we can see all the patches available? I would like to test out 1.6.7 but patch it myself for learning experience. Also I want to build a hardened version of 1.6.7 and possibly distribute it out On Thu, Apr 9, 2009 at 5:10 AM, Andreas Dilger wrote: > On

Re: [Lustre-discuss] files in lost+found

2009-04-07 Thread Mag Gam
I have tried "chattr -i " and then unlink I still get out of bound errors. Any other ideas? TIA On Sun, Apr 5, 2009 at 1:44 PM, Andreas Dilger wrote: > On Apr 04, 2009 22:47 -0400, Mag Gam wrote: >> I already did a e2fsck. The problem is on my clients I see couple of >

Re: [Lustre-discuss] files in lost+found

2009-04-04 Thread Mag Gam
er wrote: > On Apr 04, 2009 18:58 -0400, Mag Gam wrote: >> I see, so there is no easy way to recover on the MGS. I have a good >> idea who the user is for these files but the file names are very hard >> to decypher. They have names like 543434. I am not sure what file that >&

Re: [Lustre-discuss] files in lost+found

2009-04-04 Thread Mag Gam
04, 2009 09:36 -0400, Mag Gam wrote: >> I have over 52k files in my lost+found of my MGS (not my OSTs). I am >> not sure what tool I need to use to recover these files. Should i use >> lfsck or ll_recover_lost_found_objs? > > That is only for the OSTs. Unfortunate

[Lustre-discuss] files in lost+found

2009-04-04 Thread Mag Gam
I have over 52k files in my lost+found of my MGS (not my OSTs). I am not sure what tool I need to use to recover these files. Should i use lfsck or ll_recover_lost_found_objs? TIA ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lis

Re: [Lustre-discuss] lustre knowledge base

2009-03-31 Thread Mag Gam
Yes who do we speak to get lots of that stuff removed? Most of the questions on the news groups are redundant, it would be very nice to put a FAQ /KB on the Wiki On Tue, Mar 31, 2009 at 7:28 PM, Aaron Porter wrote: > On Tue, Mar 31, 2009 at 5:09 AM, Mag Gam wrote: >> >> Any

[Lustre-discuss] lustre knowledge base

2009-03-30 Thread Mag Gam
Does anyone know if the KB is still being maintained? if so, where is the URL for it? TIA ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss

Re: [Lustre-discuss] Delete file entries from Metadata server

2009-03-28 Thread Mag Gam
This data is really not important. When I tried to remove it I keep getting, "Numerical result out of range". Is there something else I can do? On Wed, Mar 25, 2009 at 12:50 PM, Brian J. Murrell wrote: > On Wed, 2009-03-25 at 11:29 -0500, Robert Olson wrote: >> I thought I had read at some poi

Re: [Lustre-discuss] Delete file entries from Metadata server

2009-03-25 Thread Mag Gam
Thankyou. I will try to recover them. I would have to wait for the weekend to do that because I can't afford to down a OST during work hours TIA On Wed, Mar 25, 2009 at 5:25 AM, Daire Byrne wrote: > Mag, > > - "Mag Gam" wrote: > >> To do the ll_rec

Re: [Lustre-discuss] Delete file entries from Metadata server

2009-03-24 Thread Mag Gam
To do the ll_recover_lost_found_objs, can we have everything mounted up? Or is it recommended we unmount everything? Also what to do when we see ? for the file attributes. Most likely these files are gone, how can we remove them from the system all together? TIA On Wed, Mar 11, 2009 at 11:

Re: [Lustre-discuss] Removing a filesystem

2009-03-23 Thread Mag Gam
Thanks Brian. This answers a lot of questions. Thanks for the great insight. On Mon, Mar 23, 2009 at 8:45 AM, Brian J. Murrell wrote: > On Sat, 2009-03-21 at 02:22 -0400, Mag Gam wrote: >> in other words, reboot all of your clients? > > No. Not so much. If clients have been se

Re: [Lustre-discuss] Read-only file system

2009-03-23 Thread Mag Gam
x27;t important files, but I am not sure how to remove them. Any thoughts? TIA On Mon, Mar 23, 2009 at 8:42 AM, Brian J. Murrell wrote: > On Sat, 2009-03-21 at 05:26 -0700, Mag Gam wrote: >> We have been experiecing problems recently, where our Lustre >> filesystem is becoming read

Re: [Lustre-discuss] Read-only file system

2009-03-21 Thread Mag Gam
First time ever, # cat health_check device lfs002-MDT reported unhealthy NOT HEALTHY I have never seen this. On Sat, Mar 21, 2009 at 5:26 AM, Mag Gam wrote: > We have been experiecing problems recently, where our Lustre > filesystem is becoming read-only (we can't even s

[Lustre-discuss] Read-only file system

2009-03-21 Thread Mag Gam
We have been experiecing problems recently, where our Lustre filesystem is becoming read-only (we can't even see our data). For example when I invoke 'ls' or 'find' ls: .: Read-only file system find: cannot get current directory: Read-only file system The client version is: lustre: 1.6.6 kernel

Re: [Lustre-discuss] Removing a filesystem

2009-03-20 Thread Mag Gam
in other words, reboot all of your clients? On Fri, Mar 20, 2009 at 12:25 PM, Brian J. Murrell wrote: > On Thu, 2009-03-19 at 22:55 -0400, Mag Gam wrote: >> >> "Lustre Error 137-5: UUID 'oldfs001-OST-UUID' is not available for >> connect (no target) &#

[Lustre-discuss] 2 LustreErrors (possible statahead)

2009-03-20 Thread Mag Gam
Hello All: I have been seeing these messages in our syslogs: Lustre MDS: 1.6.4.3 with kernel 2.6.22 Clients: lustre: 1.6.6 kernel: patchless build: 1.6.6-1969123119-PRISTINE-.usr.src.linux-2.6.18-92.1.10.el5 Mar 20 07:15:46 OSS07 kernel: [32288.746536] LustreError: 7960:0:(client.c:521:p

Re: [Lustre-discuss] Removing a filesystem

2009-03-19 Thread Mag Gam
Using lustre 1.6.4.3 with kernel 2.6.22 TIA On Thu, Mar 19, 2009 at 10:55 PM, Mag Gam wrote: > We previously had a filesystem and we removed its MGS/MDS and its > OSTS. We created a new filesystem and everything is fine now. However, > We still see messages like this: > "L

[Lustre-discuss] Removing a filesystem

2009-03-19 Thread Mag Gam
We previously had a filesystem and we removed its MGS/MDS and its OSTS. We created a new filesystem and everything is fine now. However, We still see messages like this: "Lustre Error 137-5: UUID 'oldfs001-OST-UUID' is not available for connect (no target) ' On the MDS, i even did a 'lctl dl

Re: [Lustre-discuss] Lustre Group on LinkedIn

2009-03-18 Thread Mag Gam
Is there a facebook page? On Tue, Mar 17, 2009 at 1:48 AM, Jeffrey Bennett wrote: > Hello, > > No intention to spam but I would like to mention that there is a Lustre group > on LinkedIn for those interested. > > The URL is http://www.linkedin.com/groups?home=&gid=1772375 > > jab > > __

Re: [Lustre-discuss] MDT backup

2009-03-18 Thread Mag Gam
Alex: For its worth, we gave up on backing up our MDS because of its sheer size and the contents of our filesystem. We have close to 40TB of space with very small files. Backing up MDS without LVM snapshot took almost 5 days. Its probally better to have a backup of your most important files and i

Re: [Lustre-discuss] Replacing OSTs

2009-03-18 Thread Mag Gam
I suggest creating a new filesystem and rsync the data over the network. I would not over complicate the situation. On Mon, Mar 16, 2009 at 8:41 AM, Rayentray Tappa wrote: > Hi List! > > As you may already know :) I'm now trying a lustre installation on a > mass storage prototype. I'll soon be re

Re: [Lustre-discuss] Adding OSTs problem

2009-03-17 Thread Mag Gam
Thanks Cliff. Seems like when I used fsname= (with the equal) it worked. On Mon, Mar 16, 2009 at 2:51 PM, Cliff White wrote: > Mag Gam wrote: >> >> I have added 2 volumes onto my existing filesystem. >> >> mkfs.lustre --fsname lfs002 --ost --mgsnode=mg...@tcp /dev/l

[Lustre-discuss] Adding OSTs problem

2009-03-15 Thread Mag Gam
I have added 2 volumes onto my existing filesystem. mkfs.lustre --fsname lfs002 --ost --mgsnode=mg...@tcp /dev/lustrevg/lv03 mkfs.lustre --fsname lfs002 --ost --mgsnode=mg...@tcp /dev/lustrevg/lv04 I even managed to mount up the OSTS (each are 2TB) However, on the clients we don't see the extra

Re: [Lustre-discuss] mds server crashing

2009-03-15 Thread Mag Gam
Lustre by NFS using the kernel export nfs daemon, try to disable that. > > > Cheers, > Bernd > > On Sunday 15 March 2009, Mag Gam wrote: >> This happened again :-( >> >> Basically, there is a process called "ll_mdt30" which is taking up >> 100% of the

Re: [Lustre-discuss] mds server crashing

2009-03-15 Thread Mag Gam
in /etc/modules.conf On Sat, Mar 14, 2009 at 8:35 AM, Mag Gam wrote: > Hey Bernd: > > Thanks for the reply. > > Interesting. We are using with NFS too. Is there something in > particular we need to do like "enable port 988 in /etc/modules.conf" > which I think

  1   2   3   >