Re: [Lustre-discuss] OSS1 Node issue

2012-02-21 Thread Larry
I have checked your logs, maybe there are several osts on your oss1, there must be at least one ost is read-only, it's have no business with permissions. running e2fsck on you ost device is recommended to resolve the rc=-30 problem. On Tue, Feb 21, 2012 at 4:00 PM, VIJESH EK ekvij...@gmail.com

Re: [Lustre-discuss] OSS1 Node issue

2012-02-21 Thread VIJESH EK
- * * *We are waiting for your feedback.* * * *Thanks Regards VIJESH E K* * * On Tue, Feb 21, 2012 at 12:22 PM, VIJESH EK ekvij...@gmail.com wrote: *Dear All,* * * *We have done the following changes in the exec Nodes , still now also we are * *getting the same errors in

Re: [Lustre-discuss] EXTERNAL: Re: LNET Performance Issue

2012-02-21 Thread Barberi, Carl E
I am using QDR IB and I am not using any IB extenders. This is a self-contained network, with no access outside of the lab it is currently in. Carl From: Jeremy Filizetti [mailto:jeremy.filize...@gmail.com] Sent: Monday, February 20, 2012 8:37 PM To: Kevin Van Maren Cc: Barberi, Carl E;

[Lustre-discuss] Printing the data in OST code

2012-02-21 Thread Jack David
Hi All, I am trying to analyze the OSC-OST communication by putting some prints in the code. Basically I want to print the data that is sent by OSC and received by OST. But I am not able to print the correct, may be because of I am not printing the correct buffers. Please correct me in the

Re: [Lustre-discuss] Printing the data in OST code

2012-02-21 Thread Jack David
On Tue, Feb 21, 2012 at 8:13 PM, Jack David jd6...@gmail.com wrote: Hi All, I am trying to analyze the OSC-OST communication by putting some prints in the code. Basically I want to print the data that is sent by OSC and received by OST. But I am not able to print the correct, may be because

Re: [Lustre-discuss] OSS1 Node issue

2012-02-21 Thread Kevin Van Maren
This is not the correct list for help with SGE. That being said, the real issue (as has been mentioned by several people) is that an OST has gone read-only due to some issue. The file system will not function properly until this is resolved, irrespective of where you put SGE. You will need to

Re: [Lustre-discuss] OSS1 Node issue

2012-02-21 Thread Kevin Van Maren
The logs you attached start sometime after the issue: to tell what happened you need to find the error in the logs before you started getting these errors: Feb 5 04:03:13 oss1 kernel: LustreError: 9222:0:(filter_io_26.c:693:filter_commitrw_write()) error starting transaction: rc = -30 It