Re: [Lustre-discuss] lustre patches for e2fsprogs version 1.41.0?

2008-08-28 Thread Patrick Winnertz
Hello, > If you are very interested to start working on this, then you can get the > lustre-e2fsprogs CVS module (put it in a directory called "patches" in > the e2fsprogs tree) and then run "quilt push -a" to try and apply patches, > fixing each one as you go. Where is this module located? I didn

Re: [Lustre-discuss] Softlockup issues. Lustre related?

2008-08-28 Thread Bernd Schubert
Hello Alex, On Thursday 28 August 2008 05:20:47 Alex Lee wrote: > Hello Folks, > > I have few client nodes that are getting soft lockup errors. These are > patchless clients running Lustre 1.6.5.1 with kernel > 2.6.18-53.1.6.el5-PAPI. More or less stock RHEL 5.1 with PAPI patch added > on it. The

Re: [Lustre-discuss] Seeing OST errors on the OSS that doesnt have it mounted

2008-08-28 Thread Bernd Schubert
On Thursday 28 August 2008 05:41:24 Alex Lee wrote: > Is there any documentation on how to decode the error messages? I feel > bad keep posting on the list for every single error message I dont > understand. I don't think so, you have the source ;) Nathaniel Rutman posted a quite useful bash errn

Re: [Lustre-discuss] Softlockup issues. Lustre related?

2008-08-28 Thread Alex Lee
Bernd Schubert wrote: > Hello Alex, > > On Thursday 28 August 2008 05:20:47 Alex Lee wrote: > >> Hello Folks, >> >> I have few client nodes that are getting soft lockup errors. These are >> patchless clients running Lustre 1.6.5.1 with kernel >> 2.6.18-53.1.6.el5-PAPI. More or less stock RHEL 5.

Re: [Lustre-discuss] HAMMER

2008-08-28 Thread Mag Gam
Well, I guess I was intrigued by the replication portion of HAMMER. I suppose SNS will take care of this for us... On Sat, Aug 23, 2008 at 10:28 AM, Troy Benjegerdes <[EMAIL PROTECTED]> wrote: > On Sat, Aug 23, 2008 at 05:51:36PM +0400, Nikita Danilov wrote: >> Mag Gam writes: >> > Looks like t

Re: [Lustre-discuss] csum errors

2008-08-28 Thread Stuart Midgley
for completeness, here are the logs from 172.16.4.93 Aug 27 07:49:55 clus093 kernel: LustreError: 132-0: BAD WRITE CHECKSUM: changed on the client after we checksummed it - likely false positive due to mmap IO (bug 11742): from [EMAIL PROTECTED] inum 24522277/1605841060 object 12021/0 extent

[Lustre-discuss] ksocklnd multiple connections

2008-08-28 Thread Tim Burgess
Hi All, Just wondering if someone can give us some insight into the logic that ksocklnd uses to decide which connections to make. There's not so much in the Lustre operations manual about it, but the impression I get from reading around is that if we have: options lnet networks=tcp0(eth0,eth1)

[Lustre-discuss] Lustre_config fails trying to access mgs - mdt and mgs are configured together.

2008-08-28 Thread Alexander, Jack
In my lustre 1.6 config, I have two MSA2000 and two DL380G5 servers. The servers sfs1 and sfs2 are the internal network Ethernet names for my two servers. The system interconnect names, ic-sfs1 and ic-sfs2, correspond to the servers. I've successfully (I think) run both "lctl ping [EMAIL PROT

Re: [Lustre-discuss] Lustre Patchless Client

2008-08-28 Thread Andreas Dilger
On Aug 27, 2008 20:57 +0300, Ender G�ler wrote: > I'm new to lustre community and lustre software as well. I have question > regarding to patchless lustre client installation. The OS is Redhat EL 5.1. > I'm using voltaire gridstack v5.1.3 for infiniband software stack. I > installed the lustre-1.6

Re: [Lustre-discuss] Softlockup issues. Lustre related?

2008-08-28 Thread Bernd Schubert
Hi Alex, On Thursday 28 August 2008 14:52:22 Alex Lee wrote: > Someone found this bug for me that looks very similar. > > https://bugzilla.lustre.org/show_bug.cgi?id=15975 > > Does this look anything close? I'm pretty clueless about debugging > kernel traces. yeah, looks like this is your issue.

Re: [Lustre-discuss] Softlockup issues. Lustre related?

2008-08-28 Thread Alex Lee
Bernd Schubert wrote: > Hi Alex, > > On Thursday 28 August 2008 14:52:22 Alex Lee wrote: > >> Someone found this bug for me that looks very similar. >> >> https://bugzilla.lustre.org/show_bug.cgi?id=15975 >> >> Does this look anything close? I'm pretty clueless about debugging >> kernel traces.

Re: [Lustre-discuss] Seeing OST errors on the OSS that doesnt have it mounted

2008-08-28 Thread Andreas Dilger
On Aug 28, 2008 12:41 +0900, Alex Lee wrote: > Andreas Dilger wrote: >>> Aug 23 12:27:52 lustre-oss-0-0 kernel: LustreError: >>> 2918:0:(ldlm_lib.c:1536:target_send_reply_msg()) @@ >>> @ processing error (-19) [EMAIL PROTECTED] x52/t0 o8->@:0/0 lens >>> 240/0 e 0 to 0 dl 1219462372 >>> ref 1 f

Re: [Lustre-discuss] csum errors

2008-08-28 Thread Andreas Dilger
On Aug 28, 2008 21:49 +0800, Stuart Midgley wrote: > for completeness, here are the logs from 172.16.4.93 > > Aug 27 07:49:55 clus093 kernel: LustreError: 132-0: BAD WRITE > CHECKSUM: changed on the client after we checksummed it - likely false > positive due to mmap IO (bug 11742): from [EMA

Re: [Lustre-discuss] lustre patches for e2fsprogs version 1.41.0?

2008-08-28 Thread Andreas Dilger
On Aug 28, 2008 09:12 +0200, Patrick Winnertz wrote: > > If you are very interested to start working on this, then you can get the > > lustre-e2fsprogs CVS module (put it in a directory called "patches" in > > the e2fsprogs tree) and then run "quilt push -a" to try and apply patches, > > fixing ea

Re: [Lustre-discuss] ksocklnd multiple connections

2008-08-28 Thread Andreas Dilger
On Aug 28, 2008 22:11 +0800, Tim Burgess wrote: > - all dual connected hosts are connected to both LeftSwitch and RightSwitch > - clients network interfaces are 172.16.4.x/16 (eth0,leftswitch) and > 172.16.5.x/16 (eth1,rightswitch) > - OSS/MDS network interfaces are 172.16.0.x/16 (eth0,leftswitch)

Re: [Lustre-discuss] csum errors

2008-08-28 Thread Stuart Midgley
Thanks for the information, greatly appreciated. We are keeping an eye on the client causing these errors and doing a few tests. The mmap issue is interesting. The code producing these errors is running across the entire cluster, so I assume if it was mmap-ing we would be seeing these sort