Re: [PATCH 00/20, v4] Make ib_srp better suited for H.A. purposes

2012-08-28 Thread Bart Van Assche
On 08/27/12 18:37, Dongsu Park wrote: while testing ib_srp based on your srp-ha, we sometimes hit kernel crashes with the call trace below. How to reproduce: 0. Kernel 3.2.15 with SCST v4193 on the target, Kernel 3.2.8 with ib_srp-ha on the initiator. 1. Configure 500+ vdisks on

Re: [PATCH 00/20, v4] Make ib_srp better suited for H.A. purposes

2012-08-28 Thread Dongsu Park
Hi Bart, On 28.08.2012 10:04, Bart Van Assche wrote: On 08/27/12 18:37, Dongsu Park wrote: while testing ib_srp based on your srp-ha, we sometimes hit kernel crashes with the call trace below. How to reproduce: 0. Kernel 3.2.15 with SCST v4193 on the target, Kernel 3.2.8 with

Re: [PATCH 00/20, v4] Make ib_srp better suited for H.A. purposes

2012-08-28 Thread Bart Van Assche
On 08/28/12 12:25, Dongsu Park wrote: By the way, as long as I've observed today, the crash occurs only if rport_dev_loss_timedout() is called. It means, without device loss, a simple rport_delete does not make any crash. Is that probably because arguments to pr_err() are accessing to

ibv_modify_qp to IBV_QPS_ERR returns EAGAIN

2012-08-28 Thread Albert Strasheim
Hello all I was wondering if anyone could shed some light about what kind conditions might cause ibv_modify_qp to IBV_QPS_ERR to return EAGAIN? The error occurred on a QP that probably already had some work completions for requests that failed. I've only seen it happen once in about 3 months, so

Re: [PATCH for-next V1 0/4] IB/IPoIB TSS and RSS support for datagram mode

2012-08-28 Thread Or Gerlitz
On Mon, Aug 13, 2012 at 5:27 PM, Tzahi Oved tza...@mellanox.com wrote: Sean – thanks for the feedback. Reg the XRC semantics and object model: Sean, Can you let us know your thoughts here? Or. - XRC domain object allows many to many mappings where multiple XRC TGT QPs and multiple XRC SRQs

RE: [PATCH for-next V1 0/4] IB/IPoIB TSS and RSS support for datagram mode

2012-08-28 Thread Hefty, Sean
Can you let us know your thoughts here? I understand the purpose behind TSS/RSS. I'm not fond of making verbs more complex, but I haven't come up with anything that's really simpler. Tzahi's response addressed my main concerns. Is there a compelling reason for ever exposing this feature to

[PATCH] opensm/osm_sw_info_rcv.c: Fixed locking issue on osm_get_node_by_guid error

2012-08-28 Thread Hal Rosenstock
Signed-off-by: Hal Rosenstock h...@mellanox.com --- diff --git a/opensm/osm_sw_info_rcv.c b/opensm/osm_sw_info_rcv.c index 84e7fe0..122af89 100644 --- a/opensm/osm_sw_info_rcv.c +++ b/opensm/osm_sw_info_rcv.c @@ -378,7 +378,7 @@ void osm_si_rcv_process(IN void *context, IN void *data)

RE: ibv_modify_qp to IBV_QPS_ERR returns EAGAIN

2012-08-28 Thread Hefty, Sean
I was wondering if anyone could shed some light about what kind conditions might cause ibv_modify_qp to IBV_QPS_ERR to return EAGAIN? The error occurred on a QP that probably already had some work completions for requests that failed. I've only seen it happen once in about 3 months, so

RE: Writing RDMA applications on Linux

2012-08-28 Thread Hefty, Sean
$ ./examples/rstream -s 10.30.3.2 -S all name bytes xfers iters total time Gb/secusec/xfer 16k_lat 16k 1 10k 312m0.52s 5.06 25.93 24k_lat 24k 1 10k 468m0.82s 4.79 41.08 32k_lat 32k 1 10k

Re: [PATCH for-next V1 0/4] IB/IPoIB TSS and RSS support for datagram mode

2012-08-28 Thread Or Gerlitz
On Tue, Aug 28, 2012 at 9:07 PM, Hefty, Sean sean.he...@intel.com wrote: Can you let us know your thoughts here? I understand the purpose behind TSS/RSS. I'm not fond of making verbs more complex, but I haven't come up with anything that's really simpler. Tzahi's response addressed my main

Re: Writing RDMA applications on Linux

2012-08-28 Thread Gaetano Mendola
I'm not sure if I have to say sorry for the noise or not but it seems that the issue was just an NUMA issue! My system is a 2 node NUMA system and the IB board is attached on NODE 0. Not performing any cpu/mem affinity it seems the code runs on the worst node, always! Without affinity ( I did run