Re: [OMPI devel] openib btl header caching

2007-08-13 Thread Galen Shipman
Ok here is the numbers on my machines: 0 bytes mvapich with header caching: 1.56 mvapich without header caching: 1.79 ompi 1.2: 1.59 So on zero bytes ompi not so bad. Also we can see that header caching decrease the mvapich latency on 0.23 1 bytes mvapich with header caching: 1.58 mvapich

Re: [OMPI devel] openib btl header caching

2007-08-13 Thread Galen Shipman
I think we need to take a step back from micro-optimizations such as header caching. Rich, George, Brian and I are currently looking into latency improvements. We came up with several areas of performance enhancements that can be done with minimal disruption. The progress issue that Chr

[OMPI devel] OMPI_FREE_LIST improvements

2007-07-12 Thread Galen Shipman
In working on my changes in the ib_multifrag branch I modified the ompi_free_list. The change enables a free list to have a bit more personality than what is dictated by the type of the item on the free list. The overall problem was that we often use different free list item types to simply

Re: [OMPI devel] OpenIB BTL and SRQs

2007-07-12 Thread Galen Shipman
On Jul 12, 2007, at 10:29 AM, Don Kerr wrote: Through mca parameters one can select the use of shared receive queues in the openib btl, other than having fewer queues I am wondering what are the benefits of using this option. Can anyone eleborate on using them vs the default? In the trunk the

Re: [OMPI devel] [devel-core] Collective Communications Optimization - MeetingScheduled in Albuquerque!

2007-07-10 Thread Galen Shipman
...@open-mpi.org] On Behalf Of Galen Shipman Sent: ב 09 יולי 2007 15:44 To: Open MPI Developers Subject: Re: [devel-core] Collective Communications Optimization - MeetingScheduled in Albuquerque! All, I have confirmed the meeting to be held at the HPC facility at UNM on Aug 6,7,8. Here is a lin

Re: [OMPI devel] flags in openib btl

2007-06-16 Thread Galen Shipman
These two: MCA_BTL_FLAGS_NEED_ACK MCA_BTL_FLAGS_NEED_CSUM Are used by DR. They aren't used by OB1. - Galen On Jun 15, 2007, at 9:27 AM, Jeff Squyres wrote: I notice that our help message for the btl_openib_flags MCA parameter seems to be a bit out of date: CHECK(reg_int("flags", "BTL fla

Re: [OMPI devel] Problem with openib on demand connection bring up.

2007-06-14 Thread Galen Shipman
The patch applies to ib_multifrag as is without a conflict. But the branch doesn't compile with or without the patch so I was not able to test it. Do you have some uncommitted changes that may generate a conflict? Can you commit them so they can be resolved? If there is no conflict between

Re: [OMPI devel] openib coord teleconf (was: Problem with openib on demand connection bring up)

2007-06-13 Thread Galen Shipman
On Jun 13, 2007, at 12:52 PM, Gleb Natapov wrote: On Wed, Jun 13, 2007 at 02:48:02PM -0400, Jeff Squyres wrote: On Jun 13, 2007, at 2:41 PM, Gleb Natapov wrote: Pasha tells me that the best times for Ishai and him are: - 2000-2030 Israel time - 1300-1300 US Eastern - 1100-1130 US Mountain -

Re: [OMPI devel] openib coord teleconf (was: Problem with openib on demand connection bring up)

2007-06-13 Thread Galen Shipman
On Jun 13, 2007, at 12:23 PM, Jeff Squyres wrote: On Jun 13, 2007, at 1:40 PM, Gleb Natapov wrote: [snip] coordination kind of teleconference. If people think this is a good idea, I can setup the call. sounds good to me. Sounds good to me to. Pasha also works on async event thread. Thi

Re: [OMPI devel] Problem with openib on demand connection bring up.

2007-06-13 Thread Galen Shipman
On Jun 13, 2007, at 12:07 PM, Gleb Natapov wrote: On Wed, Jun 13, 2007 at 02:05:00PM -0400, Jeff Squyres wrote: On Jun 13, 2007, at 1:54 PM, Jeff Squyres wrote: With today's trunk, I still see the problem: Same thing happens on v1.2 branch. I'll re-open #548. I am sure it was never test

Re: [OMPI devel] Problem with openib on demand connection bring up.

2007-06-13 Thread Galen Shipman
On Jun 13, 2007, at 11:33 AM, Jeff Squyres wrote: On Jun 13, 2007, at 1:15 PM, Nysal Jan wrote: There is a ticket (closed) here: https://svn.open-mpi.org/trac/ompi/ ticket/548 It was fixed by Galen for 1.2. Ah -- I forgot to look at closed tickets. I think we broke it again; it certainly f

Re: [OMPI devel] Problem with openib on demand connection bring up.

2007-06-13 Thread Galen Shipman
On Jun 13, 2007, at 11:15 AM, Nysal Jan wrote: I was just bitten yesterday by a problem that I've known about for a while but had never gotten around to looking into (I could have sworn that there was an open trac ticket on this, but I can't find one anywhere). I have 2 hosts: one with 3 act

Re: [OMPI devel] Problem with openib on demand connection bring up.

2007-06-13 Thread Galen Shipman
on the phone this morning). On Jun 13, 2007, at 11:38 AM, Galen Shipman wrote: Hi Gleb, As we have discussed before I am working on adding support for multiple QPs with either per peer resources or shared resources. As a result of this I am trying to clean up a lot of the OpenIB code. It has

Re: [OMPI devel] Problem with openib on demand connection bring up.

2007-06-13 Thread Galen Shipman
On Jun 13, 2007, at 9:49 AM, Torsten Hoefler wrote: Hi Galen,Gleb, there is also something weird going on if I call the basic alltoall during the module_init() of a collective module (I need to wire up my own QPs in my coll component). It takes 7 seconds for 4 nodes and more than 30 minutes for

Re: [OMPI devel] Problem with openib on demand connection bring up.

2007-06-13 Thread Galen Shipman
Hi Gleb, As we have discussed before I am working on adding support for multiple QPs with either per peer resources or shared resources. As a result of this I am trying to clean up a lot of the OpenIB code. It has grown up organically over the years and needs some attention. Perhaps we can co

Re: [OMPI devel] threaded builds

2007-06-11 Thread Galen Shipman
On Jun 11, 2007, at 8:25 AM, Jeff Squyres wrote: I leave it to the thread subgroup to decide... Should we discuss on the call tomorrow? I don't have a strong opinion; I was just testing both because it was easy to do so. If we want to concentrate on the trunk, I can adjust my MTT setup. I

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14768

2007-06-07 Thread Galen Shipman
early next week. Thanks, Galen On Jun 7, 2007, at 12:49 PM, Don Kerr wrote: It would be difficult for me to attend this afternoon. Tomorrow is much better for me. -DON George Bosilca wrote: I'm available this afternoon. george. On Jun 7, 2007, at 2:35 PM, Galen Shipman

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14768

2007-06-07 Thread Galen Shipman
Call in details: I have scheduled your requested audio conference "Open MPI" for today beginning at 2:30pm to 3:30pm mountain time with 7 ports. Dial in number: 5-4165 local 866-260-0475 toll free - Galen On Jun 7, 2007, at 1:47 PM, Galen Shipman wrote: On Jun 7, 2007, a

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14768

2007-06-07 Thread Galen Shipman
Monday/Tuesday. - Galen -DON George Bosilca wrote: I'm available this afternoon. george. On Jun 7, 2007, at 2:35 PM, Galen Shipman wrote: Are people available today to discuss this over the phone? - Galen On Jun 7, 2007, at 11:28 AM, Gleb Natapov wrote: On Thu, Jun 07, 2007

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14768

2007-06-07 Thread Galen Shipman
Okay, how is 2:30 mountain time for everyone? I will setup a a call in if this works. Thanks, Galen On Jun 7, 2007, at 12:39 PM, George Bosilca wrote: I'm available this afternoon. george. On Jun 7, 2007, at 2:35 PM, Galen Shipman wrote: Are people available today to discuss

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14768

2007-06-07 Thread Galen Shipman
Are people available today to discuss this over the phone? - Galen On Jun 7, 2007, at 11:28 AM, Gleb Natapov wrote: On Thu, Jun 07, 2007 at 11:11:12AM -0400, George Bosilca wrote: ) I expect you to revise the patch in order to propose a generic solution or I'll trigger a vote against the p

[OMPI devel] BTL Semantics Teleconference: was : Re: [OMPI svn] svn:open-mpi r14768

2007-06-07 Thread Galen Shipman
semantics of the BTL, whatever they may be. I have created a wiki here to help describe the issue as I currently see it, please feel free to add to this with suggestions/etc.. https://svn.open-mpi.org/trac/ompi/wiki/BTLSemantics - Galen On Jun 7, 2007, at 9:55 AM, Galen Shipman wrote

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14768

2007-06-07 Thread Galen Shipman
AM, Gleb Natapov wrote: Hi Galen, On Sun, May 27, 2007 at 10:19:09AM -0600, Galen Shipman wrote: With current code this is not the case. Order tag is set during a fragment allocation. It seems wrong according to your description. Attached patch fixes this. If no specific ordering t

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14782

2007-05-27 Thread Galen Shipman
Actually, we still need MCA_BTL_FLAGS_FAKE_RDMA , it can be used as a hint for components such as one-sided. Galen On May 27, 2007, at 5:25 AM, g...@osl.iu.edu wrote: Author: gleb Date: 2007-05-27 07:25:39 EDT (Sun, 27 May 2007) New Revision: 14782 URL: https://svn.open-mpi.org/trac/ompi/c

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14780

2007-05-27 Thread Galen Shipman
Can we get rid of mca_pml_ob1_send_fin_btl and just have mca_pml_ob1_send_fin? It seems we should just always send the fin over the same btl and this would clean up the code a bit. Thanks, Galen On May 27, 2007, at 2:29 AM, g...@osl.iu.edu wrote: Author: gleb Date: 2007-05-27 04:29:38

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14768

2007-05-27 Thread Galen Shipman
doh,, correction.. On May 27, 2007, at 10:23 AM, Galen Shipman wrote: The problem is that MCA_BTL_DES_FLAGS_PRIORITY was meant to indicate that the fragment was higher priority, but the fragment isn't higher priority. It simply needs to be ordered w.r.t. a previous fragment, an RD

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14768

2007-05-27 Thread Galen Shipman
The problem is that MCA_BTL_DES_FLAGS_PRIORITY was meant to indicate that the fragment was higher priority, but the fragment isn't higher priority. It simply needs to be ordered w.r.t. a previous fragment, an RDMA in this case. But after the change priority flags is totally ignored. So the

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14768

2007-05-27 Thread Galen Shipman
With current code this is not the case. Order tag is set during a fragment allocation. It seems wrong according to your description. Attached patch fixes this. If no specific ordering tag is provided to allocation function order of the fragment is set to be MCA_BTL_NO_ORDER. After call to s

Re: [OMPI devel] [OMPI svn] svn:open-mpi r14768

2007-05-25 Thread Galen Shipman
On May 24, 2007, at 2:48 PM, George Bosilca wrote: I see the problem this patch try to solve, but I fail to correctly understand the implementation. The patch affect all PML and BTL in the code base by adding one more argument to some of the most often called functions. And there is only o

Re: [OMPI devel] OMPI over ofed udapl over iwarp

2007-05-11 Thread Galen Shipman
As an aside, my personal feeling is that even when running over IB the preposting of recvs is worth the small overhead of piggybacking a credit system on the messages that already cross the wire. If nothing else, this avoids adding congestion of RNR-NAKS and the resends they trigger. Put

Re: [OMPI devel] OMPI over ofed udapl over iwarp

2007-05-11 Thread Galen Shipman
More like trying to work around the race condition that exists: The server side sends an rdma message first thus violating the iwarp protocol. For those who want the gory details: when the server sends first -and- that rdma message arrives at the client _before_ the client transitions into r

Re: [OMPI devel] Create success (r1.3a1r13481)

2007-02-02 Thread Galen Shipman
Expect this rev to die all over the place.. I had a bug in my r13481 checkin that prevented OB1 from getting selected, I corrected this in r13482. sorry bout that.. - Galen On Feb 2, 2007, at 7:44 PM, MPI Team wrote: Creating nightly snapshot SVN tarball was a success. Snapshot: 1.3a1

[OMPI devel] For Open MPI + BPROC users

2006-11-29 Thread Galen Shipman
We have found a potential issue with BPROC that may effect Open MPI. Open MPI by default uses PTYs for I/O forwarding, if PTYs aren't setup on the compute nodes, Open MPI will revert to using pipes. Recently (today) we found a potential issue with PTYs and BPROC. A simple reader/writer usin

[O-MPI devel] build warnings..

2005-08-12 Thread Galen Shipman
Current build warnings: mca_base_parse_paramfile_lex.c:1664: warning: 'yy_flex_realloc' defined but not used qsort.c:163: warning: cast from pointer to integer of different size show_help_lex.c:1606: warning: 'yy_flex_realloc' defined but not used rmgr_proxy.c:237: warning: ISO C forbids conver

Re: [O-MPI devel] couple of problems in openib mpool.

2005-08-12 Thread Galen Shipman
Hey Gleb, Sorry for the delay.. we have been doing a bit of reworking of the pml/btl so that the btl's can be shared outside of just the pml (collectives, etc). I have added the bug fix (old_reg). Will look at the assumption of non-null registration next. Thanks (and keep them coming ;-) ,

Re: [O-MPI devel] Fwd: Regarding MVAPI Component in Open MPI

2005-08-09 Thread Galen Shipman
Hi Sridhar, I have committed changes that allow you to set the debg verbosity, OMPI_MCA_btl_base_debug 0 - no debug output 1 - standard debug output 2 - very verbose debug output Also we have run the Pallas tests and are not able to reproduce your failures. We do see a warning in the Reduce te

Re: [O-MPI devel] Fwd: Regarding MVAPI Component in Open MPI

2005-08-09 Thread Galen Shipman
Hi On Aug 9, 2005, at 8:15 AM, Sridhar Chirravuri wrote: The same kind of output while running Pallas "pingpong" test. -Sridhar -Original Message- From: devel-boun...@open-mpi.org [mailto:devel-boun...@open-mpi.org] On Behalf Of Sridhar Chirravuri Sent: Tuesday, August 09, 2005 7:44 PM

Re: [O-MPI devel] [PATCH] wrong variable type in OpenIB.

2005-08-08 Thread Galen Shipman
Gleb, Changes are in the trunk thanks, Galen On Aug 7, 2005, at 4:32 AM, Gleb Natapov wrote: Hello Galen, Included patch changes type of returned value from ibv_poll_cq. It should be signed because we check if it is less then zero later in the code. Index: ompi/mca/btl/openib/btl_openib_co

Re: [O-MPI devel] [PATCH] for ompi_free_list.c

2005-08-08 Thread Galen Shipman
Changes are in the trunk Thanks, Gaeln On Aug 8, 2005, at 7:38 AM, Gleb Natapov wrote: Hello, Included patch fixes bugs in ompi_free_list in the case ompi_free_list was created with NULL class and/or mpool parameters. Index: ompi/class/ompi_free_list.c =

[O-MPI devel] Mellanox VAPI SRQ.

2005-07-28 Thread Galen Shipman
Using the mvapi btl you can now set OMPI_MCA_btl_mvapi_use_srq=1 which will cause mvapi to use a shared receive queue. This will allow much better scaling as receives are posted per interface port and not per queue pair. Note: older versions of mellanox firmware may see a substantial performanc