[openib-general] [PATCH] [TRIVIAL] cma.c: Remove compiler warning
This removes a compile warning : `ret' might be used uninitialized in this function This patch is against the IWARP branch of the code Signed-off-by: Pradipta Kumar Banerjee [EMAIL PROTECTED] --- Index: core/cma.c == --- cma.org 2006-06-22 12:45:33.0 +0530 +++ cma.c 2006-06-22 12:45:51.0 +0530 @@ -2066,6 +2066,7 @@ int rdma_disconnect(struct rdma_cm_id *i ret = iw_cm_disconnect(id_priv-cm_id.iw, 0); break; default: + ret = -ENOSYS; break; } out: ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] [PATCH] [TRIVIAL] librdmacm/examples/mckey.c: Fix example name in messages
On Wed, 2006-06-21 at 18:38, Sean Hefty wrote: librdmacm/examples/mckey.c: Fix example name in messages Signed-off-by: Hal Rosenstock [EMAIL PROTECTED] Thanks, Hal. Do you mind committing this change? Sure; committed in r8170. -- Hal - Sean ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] [PATCH] opensm: osm_pkey_tbl_make_block_pair() removal
On Wed, 2006-06-21 at 09:52, Sasha Khapyorsky wrote: Since 'blocks' pkey vector is updated only by receiver, remove it from osm_pkey_tbl_set_new_entry(), as well as osm_pkey_tbl_make_block_pair(). Signed-off-by: Sasha Khapyorsky [EMAIL PROTECTED] Thanks. Applied. -- Hal ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] [ibutils PATCH] osm.m4 fix for x86_64 machines
Hi The following patch osm.m4 fix changes the way ibutils packages auto-detect stack type and location of OpenSM libraries by scanning the lib and lib64 directories. Instead of hard-coding the dir name based on the uname -m we scan both giving the lib priority on the lib64. I applied it to the ibutils trunk. Please let me know see any issues with it. Eitan ---BeginMessage--- ---End Message--- ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] [ibutils PATCH] automake required = 1.9.2
Hi The following patch remove the requirement for automake version to be 1.9.3 and up and instead allows it to be 1.9.2. I have applied the patch. Please let me know if you find any issue with this change. Thanks EZ ---BeginMessage--- Hi Eitan, I didn't find any problem with using automake 1.9.2. There is the patch. Sasha Index: autogen.sh === --- autogen.sh (revision 8160) +++ autogen.sh (working copy) @@ -20,15 +20,15 @@ am_min=`echo $am_ver|sed 's/.*\.\([^\.]*\)\..*/\1/'` am_sub=`echo $am_ver|sed 's/.*\.//'` if [[ $am_maj 1 ]]; then -echo Min automake version is 1.9.3 +echo Min automake version is 1.9.2 exit fi if [[ $am_maj = 1 $am_min 9 ]]; then -echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.3 +echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.2 exit fi -if [[ $am_maj = 1 $am_min = 9 $am_sub 3 ]]; then -echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.3 +if [[ $am_maj = 1 $am_min = 9 $am_sub 2 ]]; then +echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.2 exit fi Index: ibdm/autogen.sh === --- ibdm/autogen.sh (revision 8160) +++ ibdm/autogen.sh (working copy) @@ -23,15 +23,15 @@ am_min=`echo $am_ver|sed 's/.*\.\([^\.]*\)\..*/\1/'` am_sub=`echo $am_ver|sed 's/.*\.//'` if [[ $am_maj 1 ]]; then -echo Min automake version is 1.9.3 +echo Min automake version is 1.9.2 exit fi if [[ $am_maj = 1 $am_min 9 ]]; then -echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.3 +echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.2 exit fi -if [[ $am_maj = 1 $am_min = 9 $am_sub 3 ]]; then -echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.3 +if [[ $am_maj = 1 $am_min = 9 $am_sub 2 ]]; then +echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.2 exit fi # make sure libtool is up-to-date Index: ibis/autogen.sh === --- ibis/autogen.sh (revision 8160) +++ ibis/autogen.sh (working copy) @@ -22,15 +22,15 @@ am_min=`echo $am_ver|sed 's/.*\.\([^\.]*\)\..*/\1/'` am_sub=`echo $am_ver|sed 's/.*\.//'` if [[ $am_maj 1 ]]; then -echo Min automake version is 1.9.3 +echo Min automake version is 1.9.2 exit fi if [[ $am_maj = 1 $am_min 9 ]]; then -echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.3 +echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.2 exit fi -if [[ $am_maj = 1 $am_min = 9 $am_sub 3 ]]; then -echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.3 +if [[ $am_maj = 1 $am_min = 9 $am_sub 2 ]]; then +echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.2 exit fi # make sure libtool is up-to-date Index: ibdiag/autogen.sh === --- ibdiag/autogen.sh (revision 8160) +++ ibdiag/autogen.sh (working copy) @@ -23,15 +23,15 @@ am_min=`echo $am_ver|sed 's/.*\.\([^\.]*\)\..*/\1/'` am_sub=`echo $am_ver|sed 's/.*\.//'` if [[ $am_maj 1 ]]; then -echo Min automake version is 1.9.3 +echo Min automake version is 1.9.2 exit fi if [[ $am_maj = 1 $am_min 9 ]]; then -echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.3 +echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.2 exit fi -if [[ $am_maj = 1 $am_min = 9 $am_sub 3 ]]; then -echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.3 +if [[ $am_maj = 1 $am_min = 9 $am_sub 2 ]]; then +echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.2 exit fi # make sure libtool is up-to-date Index: ibmgtsim/autogen.sh === --- ibmgtsim/autogen.sh (revision 8160) +++ ibmgtsim/autogen.sh (working copy) @@ -22,15 +22,15 @@ am_min=`echo $am_ver|sed 's/.*\.\([^\.]*\)\..*/\1/'` am_sub=`echo $am_ver|sed 's/.*\.//'` if [[ $am_maj 1 ]]; then -echo Min automake version is 1.9.3 +echo Min automake version is 1.9.2 exit fi if [[ $am_maj = 1 $am_min 9 ]]; then -echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.3 +echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.2 exit fi -if [[ $am_maj = 1 $am_min = 9 $am_sub 3 ]]; then -echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.3 +if [[ $am_maj = 1 $am_min = 9 $am_sub 2 ]]; then +echo automake version is too old:$am_maj.$am_min.$am_sub required 1.9.2 exit fi # make sure libtool is up-to-date ---End Message--- ___
Re: [openib-general] [librdmacm] rping
I was checking rping with the latest stack. The client exits normally, the server still hangs after printing the cq status. server ping data: rdma-ping-9: JKLMNOPQRSTU server DISCONNECT EVENT... wait for RDMA_READ_ADV state 9 cq completion failed status 5 When I kill the process and restart the server I get the following error: rdma_bind_addr error -1 Thanks, Amith ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] [PATCH][MINOR] OpenSM/SA client: In osm_vendor_ibumad_sa.c:osmv_query_sa, eliminate redundant code
OpenSM/SA client: In osm_vendor_ibumad_sa.c:osmv_query_sa, eliminate redundant code Signed-off-by: Hal Rosenstock [EMAIL PROTECTED] Index: libvendor/osm_vendor_ibumad_sa.c === --- libvendor/osm_vendor_ibumad_sa.c(revision 8174) +++ libvendor/osm_vendor_ibumad_sa.c(working copy) @@ -655,7 +655,6 @@ osmv_query_sa( case OSMV_QUERY_ALL_SVC_RECS: osm_log( p_log, OSM_LOG_DEBUG, osmv_query_sa DBG:001 %s, SVC_REC_BY_NAME\n ); -sa_mad_data.method = IB_MAD_METHOD_GETTABLE; sa_mad_data.attr_id = IB_MAD_ATTR_SERVICE_RECORD; sa_mad_data.attr_offset = ib_get_attr_offset( sizeof( ib_service_record_t ) ); @@ -701,7 +700,6 @@ osmv_query_sa( case OSMV_QUERY_NODE_REC_BY_NODE_GUID: osm_log( p_log, OSM_LOG_DEBUG, osmv_query_sa DBG:001 %s,NODE_REC_BY_NODE_GUID\n ); -sa_mad_data.method = IB_MAD_METHOD_GETTABLE; sa_mad_data.attr_id = IB_MAD_ATTR_NODE_RECORD; sa_mad_data.attr_offset = ib_get_attr_offset( sizeof( ib_node_record_t ) ); ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] SLES9 SP3 support was added
Hi All, We have added support for SLES9 SP3 that can be used with OFED 1.0. The kernel modules supported are: * mthca * core * CM CMA * IPoIB * SRP All user level apps and libraries are working too. CPU Architectures supported: * x86 * x86_64 * ia64 The backport patches are available at: https://openib.org/svn/gen2/branches/1.0/ofed/patches/2.6.5-7.244/ There is also a need to take the updated configure and install.sh that add SLES9 specific support. There are no other changes in the package beside these. Is there a need to create a package (1.0.1) with SLES9 support? Tziporet ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] OFED 1.0 - Official Release (Tziporet Koren)
zhu shi song wrote: I'm sorry SDP is not in production state. SDP is very important for our application and we are waiting it mature enough to be used in our product. And do you have any schedule to let SDP work ok(especially can support many large concurrent connections just like TCP)? I very appreciate I can test new SDP before end of June. tks zhu The plan is to have a stable SDP in 1.1 release. The schedule of 1.1 is end of July in the best case (more likely it will be mid-Aug) However we will have RCs before this and we can let you know when many large concurrent connections are supported. Tziporet ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] SLES9 SP3 support was added
Hi All, We have added support for SLES9 SP3 that can be used with OFED 1.0. The kernel modules supported are: * mthca * core * CM CMA * IPoIB * SRP All user level apps and libraries are working too. CPU Architectures supported: * x86 * x86_64 * ia64 The backport patches are available at: https://openib.org/svn/gen2/branches/1.0/ofed/patches/2.6.5-7.244/ There is also a need to take the updated configure and install.sh that add SLES9 specific support. There are no other changes in the package beside these. Is there a need to create a package (1.0.1) with SLES9 support? Tziporet ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] [librdmacm] rping
On Thu, 2006-06-22 at 09:08 -0400, amith rajith mamidala wrote: The client exits normally, the server still hangs after printing the cq status. server ping data: rdma-ping-9: JKLMNOPQRSTU server DISCONNECT EVENT... wait for RDMA_READ_ADV state 9 cq completion failed status 5 When I kill the process and restart the server I get the following error: rdma_bind_addr error -1 what svn revision? What transport? ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] [GIT PULL] please pull infiniband.git
Linus, please pull from master.kernel.org:/pub/scm/linux/kernel/git/roland/infiniband.git for-linus This tree is also available from kernel.org mirrors at: git://git.kernel.org/pub/scm/linux/kernel/git/roland/infiniband.git for-linus This is mostly merging the new iSER (iSCSI over RDMA transport) initiator: Krishna Kumar: IB/uverbs: Don't free wr list when it's known to be empty Or Gerlitz: IB/iser: iSCSI iSER transport provider header file IB/iser: iSCSI iSER transport provider high level code IB/iser: iSER initiator iSCSI PDU and TX/RX IB/iser: iSER RDMA CM (CMA) and IB verbs interaction IB/iser: iSER handling of memory for RDMA IB/iser: iSER Kconfig and Makefile Roland Dreier: IB/uverbs: Remove unnecessary list_del()s drivers/infiniband/Kconfig |2 drivers/infiniband/Makefile |1 drivers/infiniband/core/uverbs_cmd.c |2 drivers/infiniband/core/uverbs_main.c|6 drivers/infiniband/ulp/iser/Kconfig | 11 drivers/infiniband/ulp/iser/Makefile |4 drivers/infiniband/ulp/iser/iscsi_iser.c | 790 + drivers/infiniband/ulp/iser/iscsi_iser.h | 354 +++ drivers/infiniband/ulp/iser/iser_initiator.c | 738 +++ drivers/infiniband/ulp/iser/iser_memory.c| 401 + drivers/infiniband/ulp/iser/iser_verbs.c | 827 ++ drivers/scsi/Makefile|1 12 files changed, 3130 insertions(+), 7 deletions(-) create mode 100644 drivers/infiniband/ulp/iser/Kconfig create mode 100644 drivers/infiniband/ulp/iser/Makefile create mode 100644 drivers/infiniband/ulp/iser/iscsi_iser.c create mode 100644 drivers/infiniband/ulp/iser/iscsi_iser.h create mode 100644 drivers/infiniband/ulp/iser/iser_initiator.c create mode 100644 drivers/infiniband/ulp/iser/iser_memory.c create mode 100644 drivers/infiniband/ulp/iser/iser_verbs.c ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] IPoIB multicast
Can some one please explain the details of the IPoIB multicast. Or if there is some previous discussion or documentation about that can I get a pointer? In particular I am looking to understand the details of initiation of multicast join throug ipoib_send() and the join completion appears to happen through a MAD callback. How are the corresponding skbs freed? Why is the tx_ring used for send and what is the mcast-pkt_queue used for. Thanks for all the help. Pradeep [EMAIL PROTECTED]___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] Fw: IPoIB multicast
I am not sure if this mail got sent out. Please ignore if it is a duplicate. Pradeep [EMAIL PROTECTED] - Forwarded by Pradeep Satyanarayana/Beaverton/IBM on 06/22/2006 08:50 AM - Pradeep Satyanarayana/Beaverton/IBM 06/21/2006 10:28 PM To openib-general@openib.org cc Subject IPoIB multicast Can some one please explain the details of the IPoIB multicast. Or if there is some previous discussion or documentation about that can I get a pointer? In particular I am looking to understand the details of initiation of multicast join throug ipoib_send() and the join completion appears to happen through a MAD callback. How are the corresponding skbs freed? Why is the tx_ring used for send and what is the mcast-pkt_queue used for. Thanks for all the help. Pradeep [EMAIL PROTECTED]___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] uCMA kernel slab corruption and oops
Sean, I am running a couple of iMPI/uDAPL benchmarks at the same time and ran into this: (2.6.17 kernel and svn8112) Jun 22 10:46:51 localhost kernel: Slab corruption: start=8100202458f8, len=512 Jun 22 10:46:51 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 10:46:51 localhost kernel: Last user: [8807fc41](rdma_destroy_id+0x188/0x193 [rdma_cm]) Jun 22 10:46:51 localhost kernel: 0f0: 6b 6b 6b 6b 6b 6b 6b 6b 18 be 2d 37 00 81 ff ff Jun 22 10:46:51 localhost kernel: Prev obj: start=8100202456e0, len=512 Jun 22 10:46:51 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 10:46:51 localhost kernel: Last user: [88086599](ucma_get_event+0x202/0x21f [rdma_ucm]) Jun 22 10:46:51 localhost kernel: 000: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 10:46:51 localhost kernel: 010: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 10:46:51 localhost kernel: Next obj: start=810020245b10, len=512 Jun 22 10:46:51 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 10:46:51 localhost kernel: Last user: [804762c2](skb_release_data+0x92/0x97) Jun 22 10:46:51 localhost kernel: 000: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 10:46:51 localhost kernel: 010: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 10:46:53 localhost kernel: Slab corruption: start=8100202458f8, len=512 Jun 22 10:46:53 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 10:46:53 localhost kernel: Last user: [804762c2](skb_release_data+0x92/0x97) Jun 22 10:46:53 localhost kernel: 0f0: 40 5c 3c 18 00 81 ff ff 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 10:46:53 localhost kernel: Prev obj: start=8100202456e0, len=512 Jun 22 10:46:53 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 10:46:53 localhost kernel: Last user: [88086599](ucma_get_event+0x202/0x21f [rdma_ucm]) Jun 22 10:46:53 localhost kernel: 000: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 10:46:53 localhost kernel: 010: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 10:46:53 localhost kernel: Next obj: start=810020245b10, len=512 Jun 22 10:46:53 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 10:46:53 localhost kernel: Last user: [804762c2](skb_release_data+0x92/0x97) Jun 22 10:46:53 localhost kernel: 000: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 10:46:53 localhost kernel: 010: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 11:01:01 localhost kernel: Slab corruption: start=8100202458f8, len=512 Jun 22 11:01:01 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 11:01:01 localhost kernel: Last user: [88069831](ib_destroy_cm_id+0x23b/0x246 [ib_cm]) Jun 22 11:01:01 localhost kernel: 0f0: d0 79 4c 2d 00 81 ff ff 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 11:01:01 localhost kernel: Prev obj: start=8100202456e0, len=512 Jun 22 11:01:01 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 11:01:01 localhost kernel: Last user: [802a1a8e](load_elf_interp+0x411/0x423) Jun 22 11:01:01 localhost kernel: 000: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 11:01:01 localhost kernel: 010: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 11:01:01 localhost kernel: Next obj: start=810020245b10, len=512 Jun 22 11:01:01 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 11:01:01 localhost kernel: Last user: [804762c2](skb_release_data+0x92/0x97) Jun 22 11:01:01 localhost kernel: 000: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 11:01:01 localhost kernel: 010: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 11:22:33 localhost kernel: Slab corruption: start=8100202458f8, len=512 Jun 22 11:22:33 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 11:22:33 localhost kernel: Last user: [802a1a8e](load_elf_interp+0x411/0x423) Jun 22 11:22:33 localhost kernel: 0f0: a0 83 9e 21 00 81 ff ff 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 11:22:33 localhost kernel: Prev obj: start=8100202456e0, len=512 Jun 22 11:22:33 localhost kernel: Redzone: 0x170fc2a5/0x170fc2a5. Jun 22 11:22:33 localhost kernel: Last user: [880346bb](mthca_create_qp+0x48/0x275 [ib_mthca]) Jun 22 11:22:33 localhost kernel: 000: 00 40 6a 3d 00 81 ff ff 38 96 d4 3a 00 81 ff ff Jun 22 11:22:33 localhost kernel: 010: 48 15 64 29 00 81 ff ff 48 15 64 29 00 81 ff ff Jun 22 11:22:33 localhost kernel: Next obj: start=810020245b10, len=512 Jun 22 11:22:33 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 11:22:33 localhost kernel: Last user: [802a1a8e](load_elf_interp+0x411/0x423) Jun 22 11:22:33 localhost kernel: 000: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 11:22:33 localhost kernel: 010: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b Jun 22 11:22:43 localhost kernel: Slab corruption: start=8100202458f8, len=512 Jun 22 11:22:43 localhost kernel: Redzone: 0x5a2cf071/0x5a2cf071. Jun 22 11:22:43 localhost kernel: Last user:
Re: [openib-general] [PATCH] uDAPL dapl_evd_connection_callback does not support TIMED_OUT event
James, Added support for active side TIMED_OUT event from a provider. Signed-off by: Arlin Davis [EMAIL PROTECTED] Index: dapl/common/dapl_evd_connection_callb.c === --- dapl/common/dapl_evd_connection_callb.c (revision 8166) +++ dapl/common/dapl_evd_connection_callb.c (working copy) @@ -162,48 +162,15 @@ dapl_evd_connection_callback ( break; } case DAT_CONNECTION_EVENT_DISCONNECTED: - { - /* -* EP is now fully disconnected; initiate any post processing -* to reset the underlying QP and get the EP ready for -* another connection -*/ - ep_ptr-param.ep_state = DAT_EP_STATE_DISCONNECTED; - dapls_ib_disconnect_clean (ep_ptr, DAT_TRUE, ib_cm_event); - dapl_os_unlock (ep_ptr-header.lock); - - break; - } case DAT_CONNECTION_EVENT_PEER_REJECTED: - { - ep_ptr-param.ep_state = DAT_EP_STATE_DISCONNECTED; - dapls_ib_disconnect_clean (ep_ptr, DAT_TRUE, ib_cm_event); - dapl_os_unlock (ep_ptr-header.lock); - - break; - } case DAT_CONNECTION_EVENT_UNREACHABLE: - { - ep_ptr-param.ep_state = DAT_EP_STATE_DISCONNECTED; - dapls_ib_disconnect_clean (ep_ptr, DAT_TRUE, ib_cm_event); - dapl_os_unlock (ep_ptr-header.lock); - - break; - } case DAT_CONNECTION_EVENT_NON_PEER_REJECTED: - { - ep_ptr-param.ep_state = DAT_EP_STATE_DISCONNECTED; - dapls_ib_disconnect_clean (ep_ptr, DAT_TRUE, ib_cm_event); - dapl_os_unlock (ep_ptr-header.lock); - - break; - } case DAT_CONNECTION_EVENT_BROKEN: + case DAT_CONNECTION_EVENT_TIMED_OUT: { ep_ptr-param.ep_state = DAT_EP_STATE_DISCONNECTED; dapls_ib_disconnect_clean (ep_ptr, DAT_FALSE, ib_cm_event); dapl_os_unlock ( ep_ptr-header.lock ); - break; } case DAT_CONNECTION_REQUEST_EVENT: ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] [librdmacm] rping
amith rajith mamidala wrote: I was checking rping with the latest stack. The client exits normally, the server still hangs after printing the cq status. I have seen this happening in the following two scenarios : (1) server exits before the client - The client prints the following errors and stays there client DISCONNECT EVENT... cq completion failed status 5 client: post send error 22 (2) client exits before the server - The o/p is same as what you get. This behaviour is because of the way cm_thread() and cq_thread() functions are written. I have coded a fix for this. Will send it tomorrow after some more testing. server ping data: rdma-ping-9: JKLMNOPQRSTU server DISCONNECT EVENT... wait for RDMA_READ_ADV state 9 cq completion failed status 5 When I kill the process and restart the server I get the following error: rdma_bind_addr error -1 You will be able to kill only the rping process. If you look at the 'ps ax' output you will see that lt-rping is in the 'D' state. Hence the bind error. Only reboot helps Thanks, Pradipta Kumar. Thanks, Amith ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] [PATCH] rping.c: Fix hang if either the server or the client exits
early Reply-To: [EMAIL PROTECTED] This patch fixes the problem as reported by Amith. Signed-off-by: Pradipta Kumar Banerjee [EMAIL PROTECTED] --- Index: rping.c = --- rping.c.org 2006-06-23 00:22:17.0 +0530 +++ rping.c 2006-06-23 00:39:06.0 +0530 @@ -215,6 +215,7 @@ static int rping_cma_event_handler(struc case RDMA_CM_EVENT_DISCONNECTED: fprintf(stderr, %s DISCONNECT EVENT...\n, cb-server ? server : client); sem_post(cb-sem); + ret = -1; break; case RDMA_CM_EVENT_DEVICE_REMOVAL: ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] resend [PATCH] rping.c: Fix hang if either the server or the client exits early
Hi, Please ignore the earlier mail. There were some problems with the mailer. Here is the new one. This patch fixes the problem as reported by Amith. Signed-off-by: Pradipta Kumar Banerjee [EMAIL PROTECTED] --- Index: rping.c = --- rping.c.org 2006-06-23 00:22:17.0 +0530 +++ rping.c 2006-06-23 00:39:06.0 +0530 @@ -215,6 +215,7 @@ static int rping_cma_event_handler(struc case RDMA_CM_EVENT_DISCONNECTED: fprintf(stderr, %s DISCONNECT EVENT...\n, cb-server ? server : client); sem_post(cb-sem); + ret = -1; break; case RDMA_CM_EVENT_DEVICE_REMOVAL: ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] [PATCH][TRIVIAL] librdmacm/examples/udaddy.c: Fix example name in messages
librdmacm/examples/udaddy.c: Fix example name in messages Signed-off-by: Hal Rosenstock [EMAIL PROTECTED] Index: ../../librdmacm/examples/udaddy.c === --- ../../librdmacm/examples/udaddy.c (revision 8166) +++ ../../librdmacm/examples/udaddy.c (working copy) @@ -47,8 +47,8 @@ /* * To execute: - * Server: rdma_cmatose - * Client: rdma_cmatose dst_ip=ip + * Server: udaddy + * Client: udaddy [server_addr [src_addr]] */ struct cmatest_node { @@ -116,7 +116,7 @@ static int init_node(struct cmatest_node node-pd = ibv_alloc_pd(node-cma_id-verbs); if (!node-pd) { ret = -ENOMEM; - printf(cmatose: unable to allocate PD\n); + printf(udaddy: unable to allocate PD\n); goto out; } @@ -124,7 +124,7 @@ static int init_node(struct cmatest_node node-cq = ibv_create_cq(node-cma_id-verbs, cqe, node, 0, 0); if (!node-cq) { ret = -ENOMEM; - printf(cmatose: unable to create CQ\n); + printf(udaddy: unable to create CQ\n); goto out; } @@ -140,13 +140,13 @@ static int init_node(struct cmatest_node init_qp_attr.recv_cq = node-cq; ret = rdma_create_qp(node-cma_id, node-pd, init_qp_attr); if (ret) { - printf(cmatose: unable to create QP: %d\n, ret); + printf(udaddy: unable to create QP: %d\n, ret); goto out; } ret = create_message(node); if (ret) { - printf(cmatose: failed to create messages: %d\n, ret); + printf(udaddy: failed to create messages: %d\n, ret); goto out; } out: @@ -225,7 +225,7 @@ static int addr_handler(struct cmatest_n ret = rdma_resolve_route(node-cma_id, 2000); if (ret) { - printf(cmatose: resolve route failed: %d\n, ret); + printf(udaddy: resolve route failed: %d\n, ret); connect_error(); } return ret; @@ -250,7 +250,7 @@ static int route_handler(struct cmatest_ conn_param.retry_count = 5; ret = rdma_connect(node-cma_id, conn_param); if (ret) { - printf(cmatose: failure connecting: %d\n, ret); + printf(udaddy: failure connecting: %d\n, ret); goto err; } return 0; @@ -287,7 +287,7 @@ static int connect_handler(struct rdma_c conn_param.qp_type = node-cma_id-qp-qp_type; ret = rdma_accept(node-cma_id, conn_param); if (ret) { - printf(cmatose: failure accepting: %d\n, ret); + printf(udaddy: failure accepting: %d\n, ret); goto err2; } node-connected = 1; @@ -298,7 +298,7 @@ err2: node-cma_id = NULL; connect_error(); err1: - printf(cmatose: failing connection request\n); + printf(udaddy: failing connection request\n); rdma_reject(cma_id, NULL, 0); return ret; } @@ -351,7 +351,7 @@ static int cma_handler(struct rdma_cm_id case RDMA_CM_EVENT_CONNECT_ERROR: case RDMA_CM_EVENT_UNREACHABLE: case RDMA_CM_EVENT_REJECTED: - printf(cmatose: event: %d, error: %d\n, event-event, + printf(udaddy: event: %d, error: %d\n, event-event, event-status); connect_error(); ret = event-status; @@ -397,7 +397,7 @@ static int alloc_nodes(void) test.nodes = malloc(sizeof *test.nodes * connections); if (!test.nodes) { - printf(cmatose: unable to allocate memory for test nodes\n); + printf(udaddy: unable to allocate memory for test nodes\n); return -ENOMEM; } memset(test.nodes, 0, sizeof *test.nodes * connections); @@ -449,7 +449,7 @@ static int poll_cqs(void) for (done = 0; done message_count; done += ret) { ret = ibv_poll_cq(test.nodes[i].cq, 8, wc); if (ret 0) { - printf(cmatose: failed polling CQ: %d\n, ret); + printf(udaddy: failed polling CQ: %d\n, ret); return ret; } @@ -480,10 +480,10 @@ static int run_server(void) struct rdma_cm_id *listen_id; int i, ret; - printf(cmatose: starting server\n); + printf(udaddy: starting server\n); ret = rdma_create_id(test.channel, listen_id, test, RDMA_PS_UDP); if (ret) { - printf(cmatose: listen request failed\n); + printf(udaddy: listen request failed\n); return ret; } @@ -491,13 +491,13 @@ static int run_server(void) test.src_in.sin_port = 7174; ret = rdma_bind_addr(listen_id, test.src_addr); if (ret) { - printf(cmatose: bind
Re: [openib-general] resend [PATCH] rping.c: Fix hang if either the server or the client exits early
The goal of adding the return codes was so that the rping program could exit with a status indicating success or failure. Every rping run results in a DISCONNECT event, so I don't think we want to treat that case as an error. Also, can you explain why thi fixes Amith's problem, which sounded like a process was hanging? Thanks, Steve. On Fri, 2006-06-23 at 00:53 +0530, Pradipta Kumar Banerjee wrote: Hi, Please ignore the earlier mail. There were some problems with the mailer. Here is the new one. This patch fixes the problem as reported by Amith. Signed-off-by: Pradipta Kumar Banerjee [EMAIL PROTECTED] --- Index: rping.c = --- rping.c.org 2006-06-23 00:22:17.0 +0530 +++ rping.c 2006-06-23 00:39:06.0 +0530 @@ -215,6 +215,7 @@ static int rping_cma_event_handler(struc case RDMA_CM_EVENT_DISCONNECTED: fprintf(stderr, %s DISCONNECT EVENT...\n, cb-server ? server : client); sem_post(cb-sem); + ret = -1; break; case RDMA_CM_EVENT_DEVICE_REMOVAL: ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] [PATCH] uDAPL cma - event processing bug
On Wed, 21 Jun 2006, Arlin Davis wrote: James, Fix bug in dapls_ib_get_dat_event() call after adding new unreachable event. Committed in revision 8180. ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] [PATCH] uDAPL dapl_evd_connection_callback does not support TIMED_OUT event
On Thu, 22 Jun 2006, Arlin Davis wrote: James, Added support for active side TIMED_OUT event from a provider. Committed revision 8181, but with the different flag values retained: Index: dapl/common/dapl_evd_connection_callb.c === --- dapl/common/dapl_evd_connection_callb.c (revision 8109) +++ dapl/common/dapl_evd_connection_callb.c (working copy) @@ -162,34 +162,8 @@ dapl_evd_connection_callback ( break; } case DAT_CONNECTION_EVENT_DISCONNECTED: - { - /* -* EP is now fully disconnected; initiate any post processing -* to reset the underlying QP and get the EP ready for -* another connection -*/ - ep_ptr-param.ep_state = DAT_EP_STATE_DISCONNECTED; - dapls_ib_disconnect_clean (ep_ptr, DAT_TRUE, ib_cm_event); - dapl_os_unlock (ep_ptr-header.lock); - - break; - } case DAT_CONNECTION_EVENT_PEER_REJECTED: - { - ep_ptr-param.ep_state = DAT_EP_STATE_DISCONNECTED; - dapls_ib_disconnect_clean (ep_ptr, DAT_TRUE, ib_cm_event); - dapl_os_unlock (ep_ptr-header.lock); - - break; - } case DAT_CONNECTION_EVENT_UNREACHABLE: - { - ep_ptr-param.ep_state = DAT_EP_STATE_DISCONNECTED; - dapls_ib_disconnect_clean (ep_ptr, DAT_TRUE, ib_cm_event); - dapl_os_unlock (ep_ptr-header.lock); - - break; - } case DAT_CONNECTION_EVENT_NON_PEER_REJECTED: { ep_ptr-param.ep_state = DAT_EP_STATE_DISCONNECTED; @@ -199,6 +173,7 @@ dapl_evd_connection_callback ( break; } case DAT_CONNECTION_EVENT_BROKEN: + case DAT_CONNECTION_EVENT_TIMED_OUT: { ep_ptr-param.ep_state = DAT_EP_STATE_DISCONNECTED; dapls_ib_disconnect_clean (ep_ptr, DAT_FALSE, ib_cm_event); ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] ipath verbs does not compile against the latest SVN trunk verbs
On Tue, 2006-06-20 at 09:55 -0700, Woodruff, Robert J wrote: When I try to build SVN 8112 I get the following errors trying to build the ipath verbs. We're a bit out of date on the trunk. We'll be syncing it up RSN. b ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] [PATCH] uDAPL cma: lower debug level on consumer rejects
James, Lower the reject debug message level so we don't see warnings when consumers reject. Signed-off by: Arlin Davis [EMAIL PROTECTED] Index: dapl/openib_cma/dapl_ib_cm.c === --- dapl/openib_cma/dapl_ib_cm.c(revision 8166) +++ dapl/openib_cma/dapl_ib_cm.c(working copy) @@ -359,7 +359,7 @@ static void dapli_cm_active_cb(struct da cm_event = IB_CME_DESTINATION_REJECT; dapl_dbg_log( - DAPL_DBG_TYPE_WARN, + DAPL_DBG_TYPE_CM, dapli_cm_active_handler: REJECTED reason=%d\n, event-status); ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
[openib-general] OFED-1.0 fails install on AMD64
I just downloaded the OFED-1.0 and the install was going fine until ibutils. At that point, the install fails with : Open MPI RPM will be created during the installation process Building ibutils RPM. Please wait... Running rpmbuild --rebuild --define '_topdir /var/tmp/OFEDRPM' --define 'configure_options --prefix=/usr/local/ofed --mandir=/usr/local/ofed/share/man --cache-file=/var/tmp/OFED/ibutils.cache --with-osm=/var/tmp/OFED/usr/local/ofed --enable-ibmgtsim' --define '_prefix /usr/local/ofed' --define '_libdir /usr/local/ofed/lib64' --define '_mandir %{_prefix}/share/man' --define 'build_root /var/tmp/OFED' /home/snedigar/OFED-1.0/SRPMS/ibutils-1.0-0.src.rpm-ERROR: Failed executing "rpmbuild --rebuild --define '_topdir /var/tmp/OFEDRPM' --define 'configure_options --prefix=/usr/local/ofed --mandir=/usr/local/ofed/share/man --cache-file=/var/tmp/OFED/ibutils.cache --with-osm=/var/tmp/OFED/usr/local/ofed --enable-ibmgtsim' --define '_prefix /usr/local/ofed' --define '_libdir /usr/local/ofed/lib64' --define '_mandir %{_prefix}/share/man' --define 'build_root /var/tmp/OFED' /home/snedigar/OFED-1.0/SRPMS/ibutils-1.0-0.src.rpm" See log file: /tmp/OFED.28656.log I dug down into the log file it indicates and found : g++ -DHAVE_CONFIG_H -I. -I. -I.. -O2 -DIBDM_IBNL_DIR=\"/usr/local/ofed/lib64\" -I/usr/include -O2 -g -pipe -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -mtune=nocona -MT ibnl_scanner.lo -MD -MP -MF .deps/ibnl_scanner.Tpo -c ibnl_scanner.cc -fPIC -DPIC -o .libs/ibnl_scanner.oibnl_scanner.ll: In function 'int ibnl_lex()':ibnl_scanner.ll:197: warning: ignoring return value of 'size_t fwrite(const void*, size_t, size_t, FILE*)', declared with attribute warn_unused_resultg++ -DHAVE_CONFIG_H -I. -I. -I.. -O2 -DIBDM_IBNL_DIR=\"/usr/local/ofed/lib64\" -I/usr/include -O2 -g -pipe -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -mtune=nocona -MT ibnl_scanner.lo -MD -MP -MF .deps/ibnl_scanner.Tpo -c ibnl_scanner.cc -o ibnl_scanner.o /dev/null 21/bin/sh ../libtool --tag=CXX --mode=link g++ -O2 -DIBDM_IBNL_DIR='"/usr/local/ofed/lib64"' -I/usr/include -O2 -g -pipe -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -mtune=nocona -o libibdmcom.la -rpath /usr/local/ofed/lib64 -version-info "2:1:1" Fabric.lo SubnMgt.lo TraceRoute.lo CredLoops.lo TopoMatch.lo SysDef.lo LinkCover.lo Congestion.lo ibnl_parser.lo ibnl_scanner.lo g++ -shared -nostdlib /usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../../../lib64/crti.o /usr/lib/gcc/x86_64-redhat-linux/4.0.0/crtbeginS.o .libs/Fabric.o .libs/SubnMgt.o .libs/TraceRoute.o .libs/CredLoops.o .libs/TopoMatch.o .libs/SysDef.o .libs/LinkCover.o .libs/Congestion.o .libs/ibnl_parser.o .libs/ibnl_scanner.o -L/usr/lib/gcc/x86_64-redhat-linux/4.0.0 -L/usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../../../lib64 -L/usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../.. -L/lib/../lib64 -L/usr/lib/../lib64 -lstdc++ -lm -lc -lgcc_s /usr/lib/gcc/x86_64-redhat-linux/4.0.0/crtendS.o /usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../../../lib64/crtn.o -m64 -mtune=nocona -Wl,-soname -Wl,libibdmcom.so.1 -o .libs/libibdmcom.so.1.1.1/usr/bin/ld: /usr/lib/gcc/x86_64-redhat-linux/4.0.0/libstdc++.a(mt_allocator.o): relocation R_X86_64_32 against `__gnu_internal::freelist_key' can not be used when making a shared object; recompile with -fPIC/usr/lib/gcc/x86_64-redhat-linux/4.0.0/libstdc++.a: could not read symbols: Bad valuecollect2: ld returned 1 exit statusmake[3]: *** [libibdmcom.la] Error 1make[3]: Leaving directory `/var/tmp/OFEDRPM/BUILD/ibutils-1.0/ibdm/datamodel'make[2]: *** [all-recursive] Error 1make[2]: Leaving directory `/var/tmp/OFEDRPM/BUILD/ibutils-1.0/ibdm'make[1]: *** [all] Error 2make[1]: Leaving directory `/var/tmp/OFEDRPM/BUILD/ibutils-1.0/ibdm'make: *** [all-recursive] Error 1error: Bad exit status from /var/tmp/rpm-tmp.16738 (%install) RPM build errors: Bad exit status from /var/tmp/rpm-tmp.16738 (%install)ERROR: Failed executing "rpmbuild --rebuild --define '_topdir /var/tmp/OFEDRPM' --define 'configure_options --prefix=/usr/local/ofed --mandir=/usr/local/ofed/share/man --cache-file=/var/tmp/OFED/ibutils.cache --with-osm=/var/tmp/OFED/usr/local/ofed --enable-ibmgtsim' --define '_prefix /usr/local/ofed' --define '_libdir /usr/local/ofed/lib64' --define '_mandir %{_prefix}/share/man' --define 'build_root /var/tmp/OFED' /home/snedigar/OFED-1.0/SRPMS/ibutils-1.0-0.src.rpm" Can anyone shed any light on this ? Machine is dual Opteron, 2 gig memory, kernel 2.6.16 Don Snedigar Calpont Corp. 214-618-9516 ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] [Bug 146] OFED-1.0 DAPL fails to build on SLES10 on IA64 with IA64_FETCHADD error
On Thu, 22 Jun 2006, [EMAIL PROTECTED] wrote: http://openib.org/bugzilla/show_bug.cgi?id=146 [EMAIL PROTECTED] changed: What|Removed |Added Status|NEW |ASSIGNED --- Comment #1 from [EMAIL PROTECTED] 2006-06-22 14:55 --- We have code in dapl/udapl/linux/dapl_osd.h that is supposed to handle this. It looks like this broke when we moved to the autotools. I'll send you a patch to test. Here's the patch. Thank you for offering to test this. Please let me if it fixes the problem (I do not have an IA64 SLES system). Index: Makefile.am === --- Makefile.am (revision 8109) +++ Makefile.am (working copy) @@ -1,10 +1,11 @@ # $Id: $ +OSFLAGS = -DOS_VERSION=$(shell expr `uname -r | cut -f1 -d.` \* 65536 + `uname -r | cut -f2 -d.`) # Check for RedHat, needed for ia64 udapl atomic operations (IA64_FETCHADD syntax) if OS_RHEL -OSFLAGS=-DREDHAT_EL4 +OSFLAGS += -DREDHAT_EL4 else -OSFLAGS= +OSFLAGS += endif if DEBUG ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] [PATCH] uDAPL cma: lower debug level on consumer rejects
On Thu, 22 Jun 2006, Arlin Davis wrote: James, Lower the reject debug message level so we don't see warnings when consumers reject. Committed in revision 8182. ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
Re: [openib-general] OFED-1.0 fails install on AMD64
Well taking a couple of stabs in the dark here. What version of redhat/fedora are you using ? I am using rhel 4 update 3 and it uses gcc version 3.4.5-2 by default. It appears as if your system is using 4.0.0. Also do you have any environment variables set ? Such as CFLAGS, CCFLAGS or the like ? For the record the only reason I mention gcc 4x is because it is the only time I have personally seen that error arise. On 6/22/06, Don Snedigar [EMAIL PROTECTED] wrote: I just downloaded the OFED-1.0 and the install was going fine until ibutils. At that point, the install fails with : Open MPI RPM will be created during the installation process Building ibutils RPM. Please wait... Running rpmbuild --rebuild --define '_topdir /var/tmp/OFEDRPM' --define 'configure_options --prefix=/usr/local/ofed --mandir=/usr/local/ofed/share/man --cache-file=/var/tmp/OFED/ibutils.cache --with-osm=/var/tmp/OFED/usr/local/ofed --enable-ibmgtsim' --define '_prefix /usr/local/ofed' --define '_libdir /usr/local/ofed/lib64' --define '_mandir %{_prefix}/share/man' --define 'build_root /var/tmp/OFED' /home/snedigar/OFED-1.0/SRPMS/ibutils-1.0-0.src.rpm-ERROR: Failed executing rpmbuild --rebuild --define '_topdir /var/tmp/OFEDRPM' --define 'configure_options --prefix=/usr/local/ofed --mandir=/usr/local/ofed/share/man --cache-file=/var/tmp/OFED/ibutils.cache --with-osm=/var/tmp/OFED/usr/local/ofed --enable-ibmgtsim' --define '_prefix /usr/local/ofed' --define '_libdir /usr/local/ofed/lib64' --define '_mandir %{_prefix}/share/man' --define 'build_root /var/tmp/OFED' /home/snedigar/OFED-1.0/SRPMS/ibutils-1.0-0.src.rpm See log file: /tmp/OFED.28656.log I dug down into the log file it indicates and found : g++ -DHAVE_CONFIG_H -I. -I. -I.. -O2 -DIBDM_IBNL_DIR=\/usr/local/ofed/lib64\ -I/usr/include -O2 -g -pipe -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -mtune=nocona -MT ibnl_scanner.lo -MD -MP -MF .deps/ibnl_scanner.Tpo -c ibnl_scanner.cc -fPIC -DPIC -o .libs/ibnl_scanner.oibnl_scanner.ll: In function 'int ibnl_lex()':ibnl_scanner.ll:197: warning: ignoring return value of 'size_t fwrite(const void*, size_t, size_t, FILE*)', declared with attribute warn_unused_resultg++ -DHAVE_CONFIG_H -I. -I. -I.. -O2 -DIBDM_IBNL_DIR=\/usr/local/ofed/lib64\ -I/usr/include -O2 -g -pipe -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -mtune=nocona -MT ibnl_scanner.lo -MD -MP -MF .deps/ibnl_scanner.Tpo -c ibnl_scanner.cc -o ibnl_scanner.o /dev/null 21/bin/sh ../libtool --tag=CXX --mode=link g++ -O2 -DIBDM_IBNL_DIR='/usr/local/ofed/lib64' -I/usr/include -O2 -g -pipe -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -mtune=nocona -o libibdmcom.la -rpath /usr/local/ofed/lib64 -version-info 2:1:1 Fabric.lo SubnMgt.lo TraceRoute.lo CredLoops.lo TopoMatch.lo SysDef.lo LinkCover.lo Congestion.lo ibnl_parser.lo ibnl_scanner.lo g++ -shared -nostdlib /usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../../../lib64/crti.o /usr/lib/gcc/x86_64-redhat-linux/4.0.0/crtbeginS.o .libs/Fabric.o .libs/SubnMgt.o .libs/TraceRoute.o .libs/CredLoops.o .libs/TopoMatch.o .libs/SysDef.o .libs/LinkCover.o .libs/Congestion.o .libs/ibnl_parser.o .libs/ibnl_scanner.o -L/usr/lib/gcc/x86_64-redhat-linux/4.0.0 -L/usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../../../lib64 -L/usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../.. -L/lib/../lib64 -L/usr/lib/../lib64 -lstdc++ -lm -lc -lgcc_s /usr/lib/gcc/x86_64-redhat-linux/4.0.0/crtendS.o /usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../../../lib64/crtn.o -m64 -mtune=nocona -Wl,-soname -Wl,libibdmcom.so.1 -o .libs/libibdmcom.so.1.1.1/usr/bin/ld: /usr/lib/gcc/x86_64-redhat-linux/4.0.0/libstdc++.a(mt_allocator.o): relocation R_X86_64_32 against `__gnu_internal::freelist_key' can not be used when making a shared object; recompile with -fPIC/usr/lib/gcc/x86_64-redhat-linux/4.0.0/libstdc++.a: could not read symbols: Bad valuecollect2: ld returned 1 exit statusmake[3]: *** [libibdmcom.la] Error 1make[3]: Leaving directory `/var/tmp/OFEDRPM/BUILD/ibutils-1.0/ibdm/datamodel'make[2]: *** [all-recursive] Error 1make[2]: Leaving directory `/var/tmp/OFEDRPM/BUILD/ibutils-1.0/ibdm'make[1]: *** [all] Error 2make[1]: Leaving directory `/var/tmp/OFEDRPM/BUILD/ibutils-1.0/ibdm'make: *** [all-recursive] Error 1error: Bad exit status from /var/tmp/rpm-tmp.16738 (%install) RPM build errors: Bad exit status from /var/tmp/rpm-tmp.16738 (%install)ERROR: Failed executing rpmbuild --rebuild --define '_topdir /var/tmp/OFEDRPM' --define 'configure_options --prefix=/usr/local/ofed --mandir=/usr/local/ofed/share/man --cache-file=/var/tmp/OFED/ibutils.cache --with-osm=/var/tmp/OFED/usr/local/ofed --enable-ibmgtsim' --define '_prefix /usr/local/ofed' --define '_libdir /usr/local/ofed/lib64' --define '_mandir %{_prefix}/share/man' --define 'build_root /var/tmp/OFED' /home/snedigar/OFED-1.0/SRPMS/ibutils-1.0-0.src.rpm Can anyone shed any light on this ? Machine is dual Opteron, 2 gig memory, kernel 2.6.16 Don Snedigar Calpont Corp.
Re: [openib-general] OFED-1.0 fails install on AMD64
Actually, its FSM Labs v 2.2.3 with the 2.6.16 kernel. We hadFC4 on the box, but then added RTLinuxPro on the box. Yes, gcc is version 4 (gcc --version gives 4.0.0 20050519 (Red Hat 4.0.0-8) Only environment variables set would be the ones that the install script sets itself.\ don From: Paul [mailto:[EMAIL PROTECTED] Sent: Thursday, June 22, 2006 5:16 PMTo: Don SnedigarCc: openib-general@openib.orgSubject: Re: [openib-general] OFED-1.0 fails install on AMD64 Well taking a couple of stabs in the dark here. What version of redhat/fedora are you using ? I am using rhel 4 update 3 and it uses gcc version 3.4.5-2 by default. It appears as if your system is using 4.0.0. Also do you have any environment variables set ? Such as CFLAGS, CCFLAGS or the like ? For the record the only reason I mention gcc 4x is because it is the only time I have personally seen that error arise. On 6/22/06, Don Snedigar [EMAIL PROTECTED] wrote: I just downloaded the OFED-1.0 and the install was going fine until ibutils. At that point, the install fails with : Open MPI RPM will be created during the installation process Building ibutils RPM. Please wait... Running rpmbuild --rebuild --define '_topdir /var/tmp/OFEDRPM' --define 'configure_options --prefix=/usr/local/ofed --mandir=/usr/local/ofed/share/man --cache-file=/var/tmp/OFED/ibutils.cache --with-osm=/var/tmp/OFED/usr/local/ofed --enable-ibmgtsim' --define '_prefix /usr/local/ofed' --define '_libdir /usr/local/ofed/lib64' --define '_mandir %{_prefix}/share/man' --define 'build_root /var/tmp/OFED' /home/snedigar/OFED-1.0/SRPMS/ibutils-1.0-0.src.rpm-ERROR: Failed executing "rpmbuild --rebuild --define '_topdir /var/tmp/OFEDRPM' --define 'configure_options --prefix=/usr/local/ofed --mandir=/usr/local/ofed/share/man --cache-file=/var/tmp/OFED/ibutils.cache --with-osm=/var/tmp/OFED/usr/local/ofed --enable-ibmgtsim' --define '_prefix /usr/local/ofed' --define '_libdir /usr/local/ofed/lib64' --define '_mandir %{_prefix}/share/man' --define 'build_root /var/tmp/OFED' /home/snedigar/OFED-1.0/SRPMS/ibutils-1.0-0.src.rpm" See log file: /tmp/OFED.28656.log I dug down into the log file it indicates and found : g++ -DHAVE_CONFIG_H -I. -I. -I.. -O2 -DIBDM_IBNL_DIR=\"/usr/local/ofed/lib64\" -I/usr/include -O2 -g -pipe -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -mtune=nocona -MT ibnl_scanner.lo -MD -MP -MF .deps/ibnl_scanner.Tpo -c ibnl_scanner.cc -fPIC -DPIC -o .libs/ibnl_scanner.oibnl_scanner.ll: In function 'int ibnl_lex()':ibnl_scanner.ll:197: warning: ignoring return value of 'size_t fwrite(const void*, size_t, size_t, FILE*)', declared with attribute warn_unused_resultg++ -DHAVE_CONFIG_H -I. -I. -I.. -O2 -DIBDM_IBNL_DIR=\"/usr/local/ofed/lib64\" -I/usr/include -O2 -g -pipe -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -mtune=nocona -MT ibnl_scanner.lo -MD -MP -MF .deps/ibnl_scanner.Tpo -c ibnl_scanner.cc -o ibnl_scanner.o /dev/null 21/bin/sh ../libtool --tag=CXX --mode=link g++ -O2 -DIBDM_IBNL_DIR='"/usr/local/ofed/lib64"' -I/usr/include -O2 -g -pipe -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -mtune=nocona -o libibdmcom.la -rpath /usr/local/ofed/lib64 -version-info "2:1:1" Fabric.lo SubnMgt.lo TraceRoute.lo CredLoops.lo TopoMatch.lo SysDef.lo LinkCover.lo Congestion.lo ibnl_parser.lo ibnl_scanner.lo g++ -shared -nostdlib /usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../../../lib64/crti.o /usr/lib/gcc/x86_64-redhat-linux/4.0.0/crtbeginS.o .libs/Fabric.o .libs/SubnMgt.o .libs/TraceRoute.o .libs/CredLoops.o .libs/TopoMatch.o .libs/SysDef.o .libs/LinkCover.o .libs/Congestion.o .libs/ibnl_parser.o .libs/ibnl_scanner.o -L/usr/lib/gcc/x86_64-redhat-linux/4.0.0 -L/usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../../../lib64 -L/usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../.. -L/lib/../lib64 -L/usr/lib/../lib64 -lstdc++ -lm -lc -lgcc_s /usr/lib/gcc/x86_64-redhat-linux/4.0.0/crtendS.o /usr/lib/gcc/x86_64-redhat-linux/4.0.0/../../../../lib64/crtn.o -m64 -mtune=nocona -Wl,-soname -Wl,libibdmcom.so.1 -o .libs/libibdmcom.so.1.1.1/usr/bin/ld: /usr/lib/gcc/x86_64-redhat-linux/4.0.0/libstdc++.a(mt_allocator.o): relocation R_X86_64_32 against `__gnu_internal::freelist_key' can not be used when making a shared object; recompile with -fPIC/usr/lib/gcc/x86_64-redhat-linux/4.0.0/libstdc++.a: could not read symbols: Bad valuecollect2: ld returned 1 exit statusmake[3]: *** [libibdmcom.la] Error 1make[3]: Leaving directory `/var/tmp/OFEDRPM/BUILD/ibutils-1.0/ibdm/datamodel'make[2]: *** [all-recursive] Error 1make[2]: Leaving directory `/var/tmp/OFEDRPM/BUILD/ibutils-1.0/ibdm'make[1]: *** [all] Error 2make[1]: Leaving directory `/var/tmp/OFEDRPM/BUILD/ibutils-1.0/ibdm'make: *** [all-recursive] Error 1error: Bad exit status from /var/tmp/rpm-tmp.16738 (%install) RPM build errors:
[openib-general] Disabling end-to-end flow control
Is there a way to disable end-to-end flowcontrol using any of the API's ?Thanks,-Viswa ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general