Arlin, Can you please refer to which bugzilla I should log a bug? Can you please provide me the url?
Thanks, Vipul On 30-11-2012 05:21, Davis, Arlin R wrote: > Vipul, > > Can you submit a bug in bugzilla for tracking? I will try to get to this > next couple of days. > > -arlin > >> -----Original Message----- >> From: Vipul Pandya [mailto:vi...@chelsio.com] >> Sent: Thursday, November 29, 2012 5:34 AM >> To: Davis, Arlin R >> Cc: Steve Wise; linux-rdma@vger.kernel.org; Kumar A S; Abhishek >> Agrawal; Divy Le Ray >> Subject: Re: Dapltest test error DAT_CONN_QUAL_IN_USE >> >> Hi Arlin, >> >> This issue is happening because there is a port collision between >> dapltest server port space and host TCP stack. The port collision >> happens because rdma_bind_addr is getting called from the two different >> places with different port arguments from dapltest. rdma_bind_addr is >> getting called from the following two places: >> >> 1. Once it is getting called from dapls_ib_setup_conn_listener function >> with starting port as 45278. Based on number of threads and eps, in >> subsequent call of dapls_ib_setup_conn_listener this port number will >> keep getting incremented. >> >> 2. 2nd time it is getting called from dapls_ib_qp_alloc function with >> port number as always 0. Now, when rdma_bind_addr gets called with port >> number 0 it will allocate any free random port number. >> >> Then when dapls_ib_setup_conn_listener calls the rdma_bind_addr with >> fix port number which is already allocate via dapls_ib_qp_alloc >> function rdma_bind_addr will return EADDRINUSE error, which in turn >> will result in DAT_CONN_QUAL_IN_USE error. >> >> I think solution here would be to call rdma_bind_addr from both the >> location passing port number from the same port range. >> >> Please let me know your thoughts on this. >> >> Our testing has been blocked because of this issue. We would like to >> get this fixed. Please let us know if we need to log a bug anywhere for >> this. >> >> Thanks, >> Vipul >> >> On 27-11-2012 01:24, Steve Wise wrote: >>> Perhaps the port is in use by the host TCP stack? >>> >>> >>> On 11/26/2012 1:30 PM, Davis, Arlin R wrote: >>>> dapltest server will start with port 45278 and increase by client >> thread count during each new client connection. If you never restart >> the server it will continue to increase the listen port based on new >> clients connecting. If you restart dapltest it will restart back at >> port 45278. I am not familiar with iWarp CM but the error is coming >> from rdma_bind_addr (EADDRINUSE|EBUSY|EADDRNOTAVAIL). I will have to >> defer to Steve for this error. >>>> >>>> -arlin >>>> >>>> >>>>> -----Original Message----- >>>>> From: linux-rdma-ow...@vger.kernel.org [mailto:linux-rdma- >>>>> ow...@vger.kernel.org] On Behalf Of Vipul Pandya >>>>> Sent: Friday, November 23, 2012 5:54 AM >>>>> To: linux-rdma@vger.kernel.org >>>>> Cc: Kumar A S; Steve Wise; Abhishek Agrawal; Davis, Arlin R; Divy >> Le >>>>> Ray >>>>> Subject: Dapltest test error DAT_CONN_QUAL_IN_USE >>>>> >>>>> Hi All, >>>>> >>>>> I was running dapltest between my client and server machines with >>>>> OFED- 3.5. While running the test it dapltest server throws an >> error >>>>> DAT_CONN_QUAL_IN_USE if I increase number of threads and endpoints. >>>>> >>>>> Dapltest server: >>>>> --------------- >>>>> dapltest -T S -D chelsio1 >>>>> >>>>> Dapltest client: >>>>> --------------- >>>>> dapltest -T T -s 102.1.1.2 -D chelsio1 -R BE -i 1 -t 16 -w 8 server >>>>> SR >>>>> 8192 4 client SR 8192 4 >>>>> >>>>> >>>>> Once I run the above test i get the following error on server side >>>>> and client side stalls. >>>>> >>>>> $# dapltest -T S -D chelsio1 >>>>> Dapltest: Service Point Ready - chelsio1 >>>>> Test[b13f]: dat_psp_create #6 error: DAT_CONN_QUAL_IN_USE >>>>> Test[b13f]: Warning: dat_ep_disconnect (abrupt) #0 error >>>>> DAT_INVALID_STATE DAT_INVALID_STATE_EP_UNCONNECTED >>>>> Test[b13f]: dat_evd_free (creq) error: DAT_INVALID_STATE >>>>> DAT_INVALID_STATE_EVD_IN_USE >>>>> Test[b13f]: Warning: dat_ep_disconnect (abrupt) #1 error >>>>> DAT_INVALID_STATE DAT_INVALID_STATE_EP_UNCONNECTED >>>>> Test[b13f]: dat_evd_free (creq) error: DAT_INVALID_STATE >>>>> DAT_INVALID_STATE_EVD_IN_USE >>>>> Test[b13f]: Warning: dat_ep_disconnect (abrupt) #2 error >>>>> DAT_INVALID_STATE DAT_INVALID_STATE_EP_UNCONNECTED >>>>> Test[b13f]: dat_evd_free (creq) error: DAT_INVALID_STATE >>>>> DAT_INVALID_STATE_EVD_IN_USE >>>>> Test[b13f]: Warning: dat_ep_disconnect (abrupt) #3 error >>>>> DAT_INVALID_STATE DAT_INVALID_STATE_EP_UNCONNECTED >>>>> Test[b13f]: dat_evd_free (creq) error: DAT_INVALID_STATE >>>>> DAT_INVALID_STATE_EVD_IN_USE >>>>> Test[b13f]: Warning: dat_ep_disconnect (abrupt) #4 error >>>>> DAT_INVALID_STATE DAT_INVALID_STATE_EP_UNCONNECTED >>>>> Test[b13f]: dat_evd_free (creq) error: DAT_INVALID_STATE >>>>> DAT_INVALID_STATE_EVD_IN_USE >>>>> Test[b13f]: Warning: dat_ep_disconnect (abrupt) #5 error >>>>> DAT_INVALID_STATE DAT_INVALID_STATE_EP_UNCONNECTED >>>>> Test[b13f]: dat_evd_free (creq) error: DAT_INVALID_STATE >>>>> DAT_INVALID_STATE_EVD_IN_USE >>>>> Test[b13f]: Warning: dat_ep_disconnect (abrupt) #6 error >>>>> DAT_INVALID_STATE DAT_INVALID_STATE_EP_UNCONNECTED >>>>> >>>>> Following link says DAT_CONN_QUAL_IN_USE error can come if rdma_cm >>>>> returns an error due to bind failure. >>>>> http://www.mail-archive.com/linux- >> r...@vger.kernel.org/msg01297.html >>>>> >>>>> rdma_cm from OFED-3.5 does not provide module parameter >>>>> 'unify_tcp_port_space'. So, just to narrow down I installed OFED- >>>>> 1.5.4.1 and ran the same test with unify_tcp_port_space=1. However >>>>> with that also I was able to reproduced the same issue. >>>>> >>>>> Please note that if I decrease the numbers of endpoints to 4 then >>>>> test works fine. i.e. If I give '-w 4' instead of '-w 8' in command >>>>> line then test runs fine. >>>>> >>>>> I am using dapltest version 2.0.36 which comes from OFED-3.5. >>>>> >>>>> Can anyone give any pointers on this? >>>>> >>>>> >>>>> Thanks, >>>>> Vipul >>>>> -- >>>>> To unsubscribe from this list: send the line "unsubscribe linux- >> rdma" >>>>> in the body of a message to majord...@vger.kernel.org More >> majordomo >>>>> info at http://vger.kernel.org/majordomo-info.html >>>> -- >>>> To unsubscribe from this list: send the line "unsubscribe linux- >> rdma" >>>> in the body of a message to majord...@vger.kernel.org More majordomo >>>> info at http://vger.kernel.org/majordomo-info.html >>> -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html