Re: [Lustre-discuss] [HPDD-discuss] Unable to move MDS using procedure in the manual

2013-06-04 Thread Ken Hornstein
>Which version of Lustre is this? File based backup / restore does not work >in 2.x. OI scrub which rebuilds the object index is available from Lustre >2.3 onwards. So file based backup / restore will work from 2.3 onwards. Well, crud. I guess that's what Colin was going to tell me, and I see And

Re: [Lustre-discuss] [HPDD-discuss] Unable to move MDS using procedure in the manual

2013-06-04 Thread Ken Hornstein
>Which version? Whoops, can you believe I forgot that? It's 2.1.2. --Ken ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss

[Lustre-discuss] Unable to move MDS using procedure in the manual

2013-06-04 Thread Ken Hornstein
I tried to move my MDS from one filesystem on the same machine to another, using the procedure outlined in the Lustre manuals (I didn't use dd, since the underlying disks weren't the same size and also I did not think it was required). Specifically, I used rsync to copy the files, and also used ge

Re: [Lustre-discuss] problem with installing lustre and ofed

2012-12-28 Thread Ken Hornstein
>That's good to know kernel-ib comes with the lustre stock install. > >What about the rest of the OFED tools? I mean things like ibdiagnet, >ibstatus, etc? (I will look at the contents of the other rpms and see >what I can learn) I think Jeff missed a few steps. If you want the _server-side_ pa

Re: [Lustre-discuss] What does ldlm AST mean?

2012-08-28 Thread Ken Hornstein
>When I was reading source codes of ldlm, the term "AST" puzzled me. I think >it means callback, but what is its full name? Asynchronous software trap. If it makes you feel any better, I had to ask as well :-/ I was told the term dates back from VMS. Hm, some quick Googling suggests that it may

Re: [Lustre-discuss] Interoperable issues between 1.8.6 and 2.1

2012-08-23 Thread Ken Hornstein
>When you refer to ia64, are you referring to the itanium systems? I'm referring to systems where "uname -p" returns "ia64". Is that itanium? No idea. --Ken ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mail

Re: [Lustre-discuss] Interoperable issues between 1.8.6 and 2.1

2012-08-22 Thread Ken Hornstein
>I am not finding where it says explicitly that a lustre client running >1.8 will successfully be able to read and write to a set of lustre >servers running lustre 2.1. are there any known issues? I forget where that was written down; I can report that it works fine. WITH THE EXCEPTION of ia64 1.8

Re: [Lustre-discuss] [wc-discuss] Re: [Lustre-devel] [Twg] Lustre and cross-platform portability

2012-03-16 Thread Ken Hornstein
>Also fuse client will able to run on any OS have a FUSE porting that is >any BSD, OpenSolaris, MacOS, in additional to the windows. That is >easy way to maintain a single client for many OS. It is, unfortunately, not quite that simple. I can't claim to be a FUSE expert, but I've been paying att

Re: [Lustre-discuss] [Twg] Lustre and cross-platform portability

2012-03-16 Thread Ken Hornstein
>Ken, my apologies for this misstatement. I guess that my faulty memory >is to blame for the fact that I didn't recall the MacOS code was made >publicly available for download. No problem. Back when I gave the talk at LUG the source wasn't available yet due to issues here, but we got that worked

Re: [Lustre-discuss] [Twg] Lustre and cross-platform portability

2012-03-15 Thread Ken Hornstein
>I have no information that the WinNT project will ever be released >by Oracle, and as yet there has not been any code released from the >MacOS port, so the libcfs portability layer is potentially exacting >a high cost in code maintenance and complexity (CLIO being a prime >example) for no apparent

Re: [Lustre-discuss] New wc-discuss Lustre Mailing List

2011-07-12 Thread Ken Hornstein
>According to this FAQ: >http://groups.google.com/support/bin/answer.py?answer=46438&topic=9257 > >There's no need for a Google account to join a public Google Group via >email. But sending an email to wc-discuss+subscr...@googlegroups.com >and wc-discuss-subscr...@googlegroups.com both ended up wi

Re: [Lustre-discuss] speed differences in lustre/infiniband ipoib & "native" ib

2011-06-29 Thread Ken Hornstein
>we're struggling mightily to get ubuntu clients working in native IB >mode against centos lustre/IB servers. > >since we've never had a working native IB client, we have no basis in >our assumption that the speed increase should be tremendous & thus >justify our struggle. It really depends on a t

Re: [Lustre-discuss] lustre ofed compatibility

2011-06-09 Thread Ken Hornstein
>Doing a manual modprobe on lustre also fails: >> # modprobe lustre >> WARNING: Error inserting osc >> (/lib/modules/2.6.18-194.3.1.el5_lustre.1.8.4/updates/kernel/fs/lustre/osc.ko): >> The core problem is Lustre is compiled using the wrong symbol versions for the OFed you've actually got in th

Re: [Lustre-discuss] Poor metadata operation performance

2011-05-20 Thread Ken Hornstein
>Ken, the OSTs need to track the ownership of objects for quota. The more >stripes there are on a file, the more RPCs that need to be sent, which is why >we don't recommend wide striping unless there is a reason for it (bandwidth, >size, etc). Fair enough; I always forget about quota accounting,

[Lustre-discuss] Poor metadata operation performance

2011-05-20 Thread Ken Hornstein
So I guess there are some things I _still_ don't understand about Lustre metadata handling. Specifically, what metadata gets stored on OSTs and why. What brings this all up is that a) we have users who have lots of files and b) we recently are doing through some reorganization that requires chang

Re: [Lustre-discuss] IB storage as an OST target

2011-03-28 Thread Ken Hornstein
>Anybody had any experience using an IB based storage >target as an OST? We do that. >Apart from the obvious issue of separating the IB SAN(SRP/SER) >storage traffic from the Lustre traffic are there any issues? We don't actually separate the IB traffic from the Lustre traffic; in our cases they

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-25 Thread Ken Hornstein
>I have a report from a user that is is getting EINTR when a SIGALRM goes >off on his write(). It isn't unexpected to get SIGALRM because he >called the alarm, but he also has SA_RESTART set. I can't remember >whose responsibility it is to restart the call, syscall or whereever, >but it seems

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-25 Thread Ken Hornstein
>I don't understand why you don't just fix your application to handle a >perfectly valid and expected condition (that it's currently not >handling) instead of wasting time trying to find the cause of the >expected condition. Even if you find it, it's likely not a bug and not >something that can/wi

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Ken Hornstein
>As for your questions : >- I have to mention that I always had had this issue, and this is why >I've upgraded from 1.8.4 to 1.8.5, hoping this would solve it. Ah, okay, I misunderstood that; my apologies. >- I will try to have that SA_RESTART flag set in the app... if I can >find where the signa

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Ken Hornstein
>OK, the app is used to deal with standard disks, that is why it is not >handling the EINTR signal propoerly. I think you're misunderstanding what a "signal" is in the Unix sense. EINTR isn't a signal; it's a return code from the write() system call that says, "Hey, you got a signal in the middle

Re: [Lustre-discuss] Help! Newbie trying to set up Lustre network

2011-02-22 Thread Ken Hornstein
>But my mount command is failing and that's the issue: >mount -t lustre 192.168.0.2@tcp0:/temp /lustre >mount.lustre: mount 192.168.0.2@tcp0:/temp at /lustre failed: No such >device >Are the lustre modules loaded? Right, and every time I've seen the mount command fail like this (with ENODEV as the

Re: [Lustre-discuss] Help! Newbie trying to set up Lustre network

2011-02-22 Thread Ken Hornstein
>Dmesg and syslog are clean and has no entries about lustre client. ... are you _sure_? Even /var/log/messages? I ask because this sure seems like the Lustre modules are not loaded (you can check that with the lsmod command). If they aren't loaded, then the core issue will be buried somewhere i

Re: [Lustre-discuss] Compiling Lustre 2 on SLES10

2011-02-22 Thread Ken Hornstein
>Thanks for putting in this effort. I think that others would be >interested (I am). Perhaps you could share your work at github or post >to lustre-devel ahead of LUG? Thanks for the words of encouragement! My boss has no problem with my work being redistributed, but he's not really a fan of pu

Re: [Lustre-discuss] Compiling Lustre 2 on SLES10

2011-02-22 Thread Ken Hornstein
>FYI, 1.8 and 2.0 used to share the ldiskfs patches (we lost this with the >transition to git), so the ldiskfs module shipped with 1.8.5 (which supports >SLES11 SP1) already has most of the patches required for lustre 2.0.0. I >think it would have been less painful to start from there and to add th

Re: [Lustre-discuss] Compiling Lustre 2 on SLES10

2011-02-21 Thread Ken Hornstein
>Yes, that is what Oracle had announced in the roadmap. > >SLES servers are still supported on Lustre 1.8.x, but Oracle announced >plans to not support them with Lustre 2.x. Given the similarities >between the RHEL6 and SLES11 kernels, I am sure someone could bring SLES >support back when RHEL6

Re: [Lustre-discuss] Lustre install

2010-10-28 Thread Ken Hornstein
>Okay, so I ran a depmod and then tried again. I am now running into these >error messages: >-- >node205:/etc/modprobe.d # modprobe lustre >WARNING: Error inserting osc >(/lib/modules/2.6.32.12-0.7-default/updates/kernel/fs/lustre/osc.ko): >Input/output error You should in /var/log/messages or in

Re: [Lustre-discuss] recovering formatted OST

2010-10-21 Thread Ken Hornstein
>Now I have another problem. After last segfault I can not restart the fsck >due to MMP. >[...] >Also when I try to access filesystem via debugfs it fails: > >debugfs -c -R 'ls' /dev/scratch2_ost16vg/ost16lv >debugfs 1.41.10.sun2 (24-Feb-2010) >/dev/scratch2_ost16vg/ost16lv: MMP: fsck being run whi

Re: [Lustre-discuss] Problem with LNET and openibd on Lustre 1.8.4 while rebooting

2010-09-09 Thread Ken Hornstein
>lustre does get unmounted before NFS filesystem as seen in the log message... >the problem is due to the fact that LNET is still up when openibd gets >removed. Huh, I'm wondering how it ever worked "right" before. Certainly on the systems I have at 1.8.1.1, I always had to have a Lustre start/s

Re: [Lustre-discuss] OST targets not mountable after disabling/enabling MMP

2010-08-09 Thread Ken Hornstein
>Using 'tune2fs -f -E clear-mmp' causes tune2fs to segfault: Ewww well, not sure what to tell you about that. >Did you use a newer version of tune2fs/e2fsprogs? Our current version >is e2fsprogs-1.40.11.sun1-0redhat. Do you know if it's safe to rev up >versions on e2fsprogs while running

Re: [Lustre-discuss] OST targets not mountable after disabling/enabling MMP

2010-08-09 Thread Ken Hornstein
>This gives me an MMP error though: >[r...@oss-0-25 log]# tune2fs -E clear-mmp /dev/sdd >tune2fs 1.40.11.sun1 (17-June-2008) >tune2fs: MMP: appears fsck currently being run on the filesystem while >trying to open /dev/sdd >Couldn't find valid filesystem superblock. Oh, I forgot ... did you try ad

Re: [Lustre-discuss] OST targets not mountable after disabling/enabling MMP

2010-08-09 Thread Ken Hornstein
>We recently experienced a power failure (and subsequent UPS failure) >which caused our Lustre filesystem to shutdown hard. We were able to >bring it back online but started seeing errors where the OSTs were being >remounted as read-only. We observed that all of the read-only OSTs were >repor

Re: [Lustre-discuss] Per directory quota

2010-07-16 Thread Ken Hornstein
>Without size-on-mds, either way would have to query both the mds and >each OST to get the size info. Not being that familiar with size-on- >mds, it does seem likely that "du" would still have to query the OST >for size info, even when "ls -l" does not. As someone who has spent the past week

Re: [Lustre-discuss] Lustre on FreeBSD

2010-06-23 Thread Ken Hornstein
>...the build them against your kernel -- unless you mean licensing >support under FreeBSD? In terms of licensing ... since Lustre is GPL, I can't see any reason why there would be any licensing conflict. All you have to do is download the sources, make the few minor changes to port Lustre to Fr

Re: [Lustre-discuss] Problems with fstab entry on sles11

2010-05-03 Thread Ken Hornstein
>Perhaps other SLES users here can post ideas on how they handle this >situation. I guess for those using heartbeat, this is a non-issue as >heartbeat handles the mounting and heartbeat is started after >networking. So, as a SLES user, let me offer my $0.02. _netdev still works in the sense that

[Lustre-discuss] MacOS X Lustre client source code now available

2010-04-27 Thread Ken Hornstein
Greetings all. Thanks to the good folks at Oracle (who were kind enough to allow me to use their public Git repository infrastructure, and did all the hard work of setting up the repo and educating me on the finer points of pushing to remote Git repos) I am pleased to announce that the source code

Re: [Lustre-discuss] MMP Problems

2010-04-27 Thread Ken Hornstein
>Feature will not be enabled until e2fsprogsis updated and 'tune2fs -O >mmp %{device}' is run. >Normaly MMP should be automaticly enabled with lustre 1.8.x. >I also installed the newerst e2fsprogs but the error message is the same. >The rest works fine. In the specific case of SLES11, not only do

Re: [Lustre-discuss] Future of LusterFS?

2010-04-22 Thread Ken Hornstein
>Make sure you read and understand the Lustre 2.0 release notes before you >buy. There seemed to be some specifics in there about using Oracle hardware. In all fairness ... that only matters if you pay Oracle for support. If you aren't paying Oracle for support (or have no plans to), then it doe

[Lustre-discuss] Early alpha version of MacOS X Lustre client available

2010-04-12 Thread Ken Hornstein
Greetings all. I am pleased to announce that I have made available an "early alpha" version of my port of the Lustre client to MacOS X. By "early alpha", I mean that it works, for me, and it might work for you. But it might crash your system. Actually, it will probably almost certainly crash yo

Re: [Lustre-discuss] RHEL5's OFED with lustre1.8.2 on IB

2010-04-08 Thread Ken Hornstein
>Why not just use the binary kernel we provide instead of rebuilding your >own? It's the *exact* same kernel that we used in our QA testing and >therefore a known quantity. I have to agree with Brian here ... the best success that we've had is to either use _everything_ from Sun/Oracle (I'm just

Re: [Lustre-discuss] How to force client-oss communication over IB when the MDS has only ethernet?

2010-03-23 Thread Ken Hornstein
>Yes! I created OSTs and did some testing with plain eth configuration >first, so this would explain things. How can I tell the MDS that things >have changed? You'll have to run tunefs.lustre --writeconf (the Lustre manual explains this in a bit more detail). I had some vague memory that you o

Re: [Lustre-discuss] How to force client-oss communication over IB when the MDS has only ethernet?

2010-03-23 Thread Ken Hornstein
>At the moment, it seems that all the traffic between clients-OSS goes >also through the slow eth connection. Is it possible to force them to >use faster IB interfaces when communication with each other, and only >use eth to communicate with the MDS? Stupid question time: is it possible that y

Re: [Lustre-discuss] Is OFED 'kernel-ib' required for o2ib on RHEL5?

2010-03-23 Thread Ken Hornstein
>Thank you very much for your post, it worked! So ... what was your problem? Wrong version of OFed loaded? Or Lustre was compiled using the wrong symbol versions? --Ken ___ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustr

Re: [Lustre-discuss] Is OFED 'kernel-ib' required for o2ib on RHEL5?

2010-03-23 Thread Ken Hornstein
>I also noticed something in trying to compile lustre. The >"Module.symvers" file is very much needed but is only created after you do >"make/make modules" in the /usr/src/linux directory. It does NOT exist >before then and the lustre installation guide makes it seem >like this file should exist

Re: [Lustre-discuss] Is OFED 'kernel-ib' required for o2ib on RHEL5?

2010-03-23 Thread Ken Hornstein
>You're right, I had problems with the module symbol versions using >Lustre 1.8.2 packages available at Sun website, kernel >2.6.18-164.11.1.el5 (RHEL 5.4) and OFED 1.5. The same problems happens >when using OFED 1.4.2. So since this comes up now and then, I've cc'd the list. So you can Google ar

Re: [Lustre-discuss] Is OFED 'kernel-ib' required for o2ib on RHEL5?

2010-03-23 Thread Ken Hornstein
>I've tried also to get Lustre 1.8.2 working with RHEL5.4 and OFED 1.5 >but I didn't get this trio working. Even with OFED 1.4.2 I had problems >when modprobing lustre module. I think you had problems with the module symbol versions, right? Those are relatively easy to track down, once you know a

Re: [Lustre-discuss] New to lustre - Help setting up Lustre

2009-12-30 Thread Ken Hornstein
> I think the original problem you were referring to is "undefined >symbol: ext2_attr_index_prefix " ? This is fixed in bug#20830 and >checked in to the cvs. The fix will be available with release of >e2fsprogs 1.41.9-sun1. Actually, I just looked at this bug, and this is not the problem

Re: [Lustre-discuss] New to lustre - Help setting up Lustre

2009-12-28 Thread Ken Hornstein
>As well, you should install the e2fsprogs package for your distro and >architecture. As long as we're talking about this ... in the case of SLES 11, you need to install some extra, hidden RPMs to get e2fsprogs to work (I found those by accident on downloads.lustre.org after spending a bunch of ti

[Lustre-discuss] e2fsck: undefined symbol: ext2_attr_index_prefi

2009-10-09 Thread Ken Hornstein
I have Lustre 1.8.1 running on a bunch of SLES 11/x86_64 systems. I'm using the stock binaries from www.sun.com. Everything is fine ... except that some of the e2fsprogs utilites are unhappy. Specifically, if I try to run e2fsck, I get: # e2fsck /dev/sdb e2fsck: symbol lookup error: e2fsck: und