Re: [Lustre-discuss] Linux kernel problem

2009-11-12 Thread Papp Tamás
Papp Tamás wrote, On 2009. 11. 13. 4:26: > liu ning wrote, On 2009. 11. 13. 4:18: > >> Hi all, >> >> I installed Lustre on a small cluster running CentOS 5.3 and it works >> well. But after my installation, the Linux kernel is patched and the >> NVIDIA driver can not be used. As a result,the v

Re: [Lustre-discuss] Linux kernel problem

2009-11-12 Thread Papp Tamás
liu ning wrote, On 2009. 11. 13. 4:18: > Hi all, > > I installed Lustre on a small cluster running CentOS 5.3 and it works > well. But after my installation, the Linux kernel is patched and the > NVIDIA driver can not be used. As a result,the visualization software > can not work properly. So I

Re: [Lustre-discuss] osc lost on MDS server

2009-11-12 Thread Lu Wang
We take the 2 servers back to the cluster. After 15 hours's running, we get this errors in /var/log/message: Nov 13 10:37:04 beshome01 kernel: LustreError: 2359:0:(llog_obd.c:211:llog_add()) No ctxt Nov 13 10:37:04 beshome01 kernel: LustreError: 2359:0:(llog_obd.c:211:llog_add()) Skipped 2

[Lustre-discuss] Linux kernel problem

2009-11-12 Thread liu ning
Hi all, I installed Lustre on a small cluster running CentOS 5.3 and it works well. But after my installation, the Linux kernel is patched and the NVIDIA driver can not be used. As a result,the visualization software can not work properly. So I tried to reinstall the NVIDIA driver but the installa

Re: [Lustre-discuss] Dual NICs issue -- How to enforce Lustre to use the second NIC

2009-11-12 Thread Daneil Goodman
> > 3. On private network node, I cannot start LNET using ip2nets option > [r...@private ~]# lsmod |grep lnet > lnet 273084 1 ksocklnd > libcfs136180 2 ksocklnd,lnet > [r...@private ~]# lctl network configure > LNET configure error 100: Network is down > > /var/lo

[Lustre-discuss] Lustre slow file open close on RHEL5

2009-11-12 Thread Wojciech Turek
Hi, Cluster running Lustre 1.6.6 Opening and closing files takes longer on RHEL5 than on RHEL4. This is only happens with files located on Lustre file system. To reproduce this problem I used small C code (located on the bottom of my email). Is this a known problem? I will be grateful for any sugg

Re: [Lustre-discuss] Mount Failure

2009-11-12 Thread Isaac Huang
On Thu, Nov 12, 2009 at 12:47:33PM -0500, Brian J. Murrell wrote: > On Thu, 2009-11-12 at 10:37 +, Chris Exton wrote: > > I am having a few problems with Lustre and I can???t seem to find the > > answer to my problem on the web so I wondered if you could help? > > You have networking problems

Re: [Lustre-discuss] Mount Failure

2009-11-12 Thread Brian J. Murrell
On Thu, 2009-11-12 at 10:37 +, Chris Exton wrote: > I am having a few problems with Lustre and I can’t seem to find the > answer to my problem on the web so I wondered if you could help? You have networking problems: Nov 11 11:26:00 lss01 kernel: LustreError: 9238:0:(lib-move.c:1371:lnet_sen

Re: [Lustre-discuss] Dual NICs issue -- How to enforce Lustre to use the second NIC

2009-11-12 Thread Daneil Goodman
On Wed, Nov 11, 2009 at 9:20 PM, Isaac Huang wrote: > On Wed, Nov 11, 2009 at 04:07:39PM -0600, Daneil Goodman wrote: > >Hello list, > >By searching the archive, I found a similar message dated back in > >January 2008 -- How do you make an MGS/OSS listen on 2 NICs? Looks > like > >

Re: [Lustre-discuss] e2fsck --mdsdb segmentation fault

2009-11-12 Thread Andreas Dilger
On 2009-11-12, at 04:42, Heiko Schröter wrote: > some of our OSTs and our MDS has block and inode errors in the > filesystem. > When cleaning the systems with e2fsck everything looks ok till some > minutes later new block and inode errors are introduced on the OSTs > and the MDS. > So we gath

Re: [Lustre-discuss] osc lost on MDS server

2009-11-12 Thread Lu Wang
Hi list, We have tried again trying to recover the system to a consistant state with following steps: 1. Fulled out the 10Gbit Ethernet links connecting to the computing clustre, and connected the 2 server using a direct ether net link. This step isolated the 2 servers from computing c