Re: [openib-general] I created a git tree for the libibverbs man pages

2007-02-21 Thread Tang, Changqing
Hi, Roland: What is the Max # of cards OFED driver/library can support on a single node ? Thanks. --CQ -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Roland Dreier Sent: Tuesday, February 20, 2007 6:12 PM To: Dotan Barak Cc:

Re: [openib-general] How heavy to resize a CQ ?

2007-02-15 Thread Tang, Changqing
Thanks for your good point. --CQ -Original Message- From: Roland Dreier [mailto:[EMAIL PROTECTED] Sent: Thursday, February 15, 2007 11:59 AM To: Tang, Changqing Cc: Michael S. Tsirkin; openib-general@openib.org Subject: Re: How heavy to resize a CQ ? In dynamic process

Re: [openib-general] Immediate data question

2007-02-12 Thread Tang, Changqing
1. process A and process B is connected with QP. A first post a send to B, B does not post receive. Then A and B are doing a long time RDMA_WRITE each other, A and B just check memory for the RDMA_WRITE message. Finally B will post a receive. Does the first pending send in A

Re: [openib-general] Immediate data question

2007-02-07 Thread Tang, Changqing
[mailto:[EMAIL PROTECTED] Sent: Monday, February 05, 2007 5:03 PM To: Tang, Changqing Cc: Michael S. Tsirkin; openib-general@openib.org Subject: Re: Immediate data question Changqing Thank you. Other than using immediate data to send Changqing notification from one end to the other

Re: [openib-general] Immediate data question

2007-02-07 Thread Tang, Changqing
ChangqingDoes this pending SEND_WITH_IMM message affect the Changqing performance of the receiver process ? Is this message Changqing buffered in the receiver's HCA, or the sender retry and Changqing get RNR ack until receiver posts a receive ? If no receive is

[openib-general] Immediate data question

2007-02-05 Thread Tang, Changqing
Roland: If I only want to send/recv 4 bytes with immediate data: On sender side: opcode = IBV_WR_SEND_WITH_IMM; imm_data = my_4_bytes_data; Do I still need to specify sg_list and num_sge ? On receiver side, because the immediate data is inside the completion

Re: [openib-general] Immediate data question

2007-02-05 Thread Tang, Changqing
Message- From: Roland Dreier [mailto:[EMAIL PROTECTED] Sent: Monday, February 05, 2007 4:09 PM To: Tang, Changqing Cc: Michael S. Tsirkin; openib-general@openib.org Subject: Re: Immediate data question If I only want to send/recv 4 bytes with immediate data: I assume you mean

Re: [openib-general] git question

2007-01-25 Thread Tang, Changqing
I have trouble to get the latest OFED 1.2 source because of the firewall or other reason I don't know. Is there a checkout version somewhere I can access, for example. Daily build version ? Thanks. --CQ -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf

Re: [openib-general] Minutes for January 15, 2007 teleconference about OFED 1.2 development progress toward code freeze

2007-01-22 Thread Tang, Changqing
PM To: Tang, Changqing Cc: Tziporet Koren; EWG; OPENIB Subject: Re: [openib-general] Minutes for January 15, 2007 teleconference about OFED 1.2 development progress toward code freeze We need this compatible feature very much so I hope you double confirm this. Can app build with 1.0

Re: [openib-general] Minutes for January 15, 2007 teleconference about OFED 1.2 development progress toward code freeze

2007-01-22 Thread Tang, Changqing
when RDMA is used, a message is transferred from card A (in node A) to card B (in node B), card B delivers the message to to user buffer, and sends ACK to card A, but ACK is lost due to switch fail. So process on node A get fail for this transfer, but process on node B check the

Re: [openib-general] Minutes for January 15, 2007 teleconference about OFED 1.2 development progress toward code freeze

2007-01-16 Thread Tang, Changqing
-Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Roland Dreier Sent: Tuesday, January 16, 2007 1:11 PM To: Tziporet Koren Cc: EWG; OPENIB Subject: Re: [openib-general] Minutes for January 15, 2007 teleconference about OFED 1.2 development

Re: [openib-general] Minutes for January 15, 2007 teleconference about OFED 1.2 development progress toward code freeze

2007-01-16 Thread Tang, Changqing
+ libibverbs 1.0 to be included as well I believe I have a way to make libibverbs 1.1 binary-compatible with applications (but not drivers) built against libibverbs 1.0. So it will probably not be necessary to include libibverbs 1.0. We need this compatible feature

Re: [openib-general] [RFC] userspace IB SA support

2007-01-09 Thread Tang, Changqing
Where do you put these new user functions ? Do you create a new library ? I hope not to create a new library, we already have so many libraries now, it is hard to manage For users using dlopen(). Also when is it available ? Thanks. --CQ -Original Message- From: [EMAIL PROTECTED]

Re: [openib-general] using IB on a port without IPoIB running NIC

2007-01-08 Thread Tang, Changqing
: Or Gerlitz [mailto:[EMAIL PROTECTED] Sent: Monday, January 08, 2007 1:18 AM To: Tang, Changqing Cc: openib-general@openib.org Subject: using IB on a port without IPoIB running NIC Tang, Changqing wrote: We understand that, but we hope to have a connect/accept style IB connection setup, without

Re: [openib-general] using IB on a port without IPoIB running NIC

2007-01-08 Thread Tang, Changqing
: Monday, January 08, 2007 8:18 AM To: Tang, Changqing Cc: Or Gerlitz; openib-general@openib.org Subject: Re: using IB on a port without IPoIB running NIC Thank you for the information, I may change my mind to require IPoIB to run newer version of HP-MPI on OFED 1.2, if I don't find

Re: [openib-general] using IB on a port without IPoIB running NIC

2007-01-08 Thread Tang, Changqing
What I need is that, without IPoIB, how do I wire IB connection ? Currently with Verbs API, it is an alltoall QP number exchange. I want to remove the alltoall QP number exchange in MPI dynamic process. Well, does your MPI implementation currently use librdmacm? No, we don't use both

Re: [openib-general] using IB on a port without IPoIB running NIC

2007-01-08 Thread Tang, Changqing
As I said, the problem is the alltoall QP number exchange. I hope that a process can only provide one piece of information(such as ip/port in TCP/IP) so that all other processes have the same piece of info and can make connection to it. Well, start with a socket, each time a

Re: [openib-general] APM support in openib stack

2006-10-25 Thread Tang, Changqing
How do you know that the old path is back ? Do you have a notification handler called, or you need to query periodically ? Thanks. --CQ another question: If one brings back the old path (after the first failure) and use the old path record to do lap/apr then reenabling migration using

Re: [openib-general] APM support in openib stack

2006-10-25 Thread Tang, Changqing
? If path 1 is down, we can not set it as alternate path, right ? If we can not bring a path back on fly, the usage of APM is limited. --CQ -Original Message- From: Sean Hefty [mailto:[EMAIL PROTECTED] Sent: Wednesday, October 25, 2006 11:41 AM To: Tang, Changqing Cc: [EMAIL PROTECTED

Re: [openib-general] OpenFabrics Developer Summit at SC06, Tampa Nov 16 - 17

2006-10-18 Thread Tang, Changqing
Has the registration site been set up ? --CQ -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Jeff Squyres Sent: Tuesday, October 17, 2006 6:57 AM To: Bill Boas Cc: Open Fabrics; openib-general@openib.org; [EMAIL PROTECTED] Subject: Re:

Re: [openib-general] [openfabrics-ewg] OFED 1.1 RC7 fork() issue.

2006-10-17 Thread Tang, Changqing
What kernel are you testing? system() should work (in non threaded apps at least) with modern kernel. -- Gleb. ___ From the OFED release notes: 3. Fork support from kernel 2.6.12 and above is available provided

Re: [openib-general] [openfabrics-ewg] OFED 1.1 RC7 fork() issue.

2006-10-17 Thread Tang, Changqing
3. Fork support from kernel 2.6.12 and above is available provided that applications do not use threads. The fork() is supported as long as parent process does not run before child exits or calls exec(). After fork(), in child, before exec(), can we call printf(), putenv(), or even re-direct

Re: [openib-general] [openfabrics-ewg] OFED 1.1 RC7 fork() issue.

2006-10-17 Thread Tang, Changqing
Thanks for the clarification. --CQ You need to make a difference between full fork support that will be available only in libibverbs1.1 and the system /fork exec fork support that is depend on the kernel only and available from kernel 2.6.12. See also the explanation from Gleb on this

Re: [openib-general] [openfabrics-ewg] OFED 1.1 RC7 fork() issue.

2006-10-16 Thread Tang, Changqing
We tested RC7, but fork() does not work: 1. system() causes IB to fail. 2. fork(), child calling exit(0) immediately also causes IB to fail. Anyone has tested fork() related issue ? --CQ Tang, HP-MPI -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of

Re: [openib-general] Reuse pd amd mr

2006-09-18 Thread Tang, Changqing
No, a PD belongs to a specific device. However nothing prevents you from creating one PD for each device, and two MRs (one for each device, each using one of those two PDs) that cover the same memory. Roland: I did exactly what you said with two cards on a node, however, if I use the

Re: [openib-general] Reuse pd amd mr

2006-09-18 Thread Tang, Changqing
'retries exceeded' means that the transport retry count was exceeded, so most likely your timeout is set too low. Is there a common recommended value for this timeout ? I use 18, which represents 1 second. Without seeing your code, I couldn't begin to say why you don't see a send

Re: [openib-general] Reuse pd amd mr

2006-09-18 Thread Tang, Changqing
Changqing Is there a common recommended value for this timeout ? Changqing I use 18, which represents 1 second. 18 should be OK I guess, unless you have congestion in your fabric, in which case you have other problems anyway. Changqing It is very hard to reproduce this error with

Re: [openib-general] basic IB doubt

2006-08-23 Thread Tang, Changqing
structure, the other copy all data to provided dest buffer. --CQ -Original Message- From: Ralph Campbell [mailto:[EMAIL PROTECTED] Sent: Wednesday, August 23, 2006 1:16 PM To: Tang, Changqing Cc: Caitlin Bestler; openib-general@openib.org Subject: RE: [openib-general] basic IB doubt