Work completions generated after a queue pair has made the transition to an error state

2010-10-12 Thread Bart Van Assche
Hello, Has anyone already tried to process the work completions generated by a HCA after the state of a queue pair has been changed to IB_QPS_ERR ? With the hardware/firmware/driver combination I have tested I have observed the following: * Multiple completions with the same wr_id and nonzero (err

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-12 Thread Ralph Campbell
I haven't seen it. It isn't supposed to happen. What hardware and software are you using and how do you reproduce it? On Tue, 2010-10-12 at 11:38 -0700, Bart Van Assche wrote: > Hello, > > Has anyone already tried to process the work completions generated by > a HCA after the state of a queue p

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-12 Thread Or Gerlitz
Bart Van Assche wrote: > Has anyone been looking into this before ? nope, never ever, what hca is that? Or. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-12 Thread Bart Van Assche
On Tue, Oct 12, 2010 at 8:50 PM, Ralph Campbell wrote: > On Tue, 2010-10-12 at 11:38 -0700, Bart Van Assche wrote: >> Hello, >> >> Has anyone already tried to process the work completions generated by >> a HCA after the state of a queue pair has been changed to IB_QPS_ERR ? >> With the hardware/fi

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-12 Thread Eli Cohen
On Tue, Oct 12, 2010 at 08:58:59PM +0200, Bart Van Assche wrote: > On Tue, Oct 12, 2010 at 8:50 PM, Ralph Campbell > wrote: > > On Tue, 2010-10-12 at 11:38 -0700, Bart Van Assche wrote: > >> Hello, > >> > >> Has anyone already tried to process the work completions generated by > >> a HCA after the

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-13 Thread Eli Cohen
libmlx4 does not report the opcode of the operation if it completed with error. We will fix this and post a patch for review. On Tue, Oct 12, 2010 at 10:22 PM, Eli Cohen wrote: > On Tue, Oct 12, 2010 at 08:58:59PM +0200, Bart Van Assche wrote: >> On Tue, Oct 12, 2010 at 8:50 PM, Ralph Campbell >>

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-13 Thread Or Gerlitz
Eli Cohen wrote: Completions with non-zero (error) status and a wr_id / opcode combination were received that were never queued by the application. In case of error the opcode of the completed operation is not provided. I am not sure why. Eli, there's nothing in the IB spec that mandates the WC

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-13 Thread Eli Cohen
On Wed, Oct 13, 2010 at 03:51:10PM +0200, Or Gerlitz wrote: > Eli Cohen wrote: > >Completions with non-zero (error) status and a wr_id / opcode > >combination were received that were never queued by the > >application. > >In case of error the opcode of the completed operation is not provided. I am

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-13 Thread Roland Dreier
> "If the status of the operation that generates the Work Completion is > anything other than success, the contents of the Work Completion are > undefined except as noted below. The contents of a Work Completion > are:" Yes, that is exactly right. And the "noted below" fields are the status a

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-13 Thread Roland Dreier
I'm not clear on the problem observed here. A few notes: - If a QP transitions to error state, then *all* work requests, whether or not they were signaled, generate a completion with status "flush". - If a work request completes with an error status, then the opcode field is not defined.

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-13 Thread Eli Cohen
On Wed, Oct 13, 2010 at 09:05:47AM -0700, Roland Dreier wrote: > > "If the status of the operation that generates the Work Completion is > > anything other than success, the contents of the Work Completion are > > undefined except as noted below. The contents of a Work Completion > > are:" > >

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-13 Thread Bart Van Assche
On Wed, Oct 13, 2010 at 6:18 PM, Roland Dreier wrote: > > I'm not clear on the problem observed here.  A few notes: > >  - If a QP transitions to error state, then *all* work requests, whether >   or not they were signaled, generate a completion with status "flush". > >  - If a work request comple