Ok here is the numbers on my machines:
0 bytes
mvapich with header caching: 1.56
mvapich without header caching: 1.79
ompi 1.2: 1.59
So on zero bytes ompi not so bad. Also we can see that header
caching
decrease the mvapich latency on 0.23
1 bytes
mvapich with header caching: 1.58
mvapich
I think we need to take a step back from micro-optimizations such as
header caching.
Rich, George, Brian and I are currently looking into latency
improvements. We came up with several areas of performance
enhancements that can be done with minimal disruption. The progress
issue that Chr
In working on my changes in the ib_multifrag branch I modified the
ompi_free_list.
The change enables a free list to have a bit more personality than
what is dictated by the type of the item on the free list. The
overall problem was that we often use different free list item types
to simply
On Jul 12, 2007, at 10:29 AM, Don Kerr wrote:
Through mca parameters one can select the use of shared receive queues
in the openib btl, other than having fewer queues I am wondering what
are the benefits of using this option. Can anyone eleborate on using
them vs the default?
In the trunk the
...@open-mpi.org] On Behalf Of Galen Shipman
Sent: ב 09 יולי 2007 15:44
To: Open MPI Developers
Subject: Re: [devel-core] Collective Communications Optimization -
MeetingScheduled in Albuquerque!
All,
I have confirmed the meeting to be held at the HPC facility at UNM on
Aug 6,7,8.
Here is a lin
These two:
MCA_BTL_FLAGS_NEED_ACK MCA_BTL_FLAGS_NEED_CSUM
Are used by DR. They aren't used by OB1.
- Galen
On Jun 15, 2007, at 9:27 AM, Jeff Squyres wrote:
I notice that our help message for the btl_openib_flags MCA parameter
seems to be a bit out of date:
CHECK(reg_int("flags", "BTL fla
The patch applies to ib_multifrag as is without a conflict. But the
branch
doesn't compile with or without the patch so I was not able to test
it.
Do you have some uncommitted changes that may generate a conflict? Can
you commit them so they can be resolved? If there is no conflict
between
On Jun 13, 2007, at 12:52 PM, Gleb Natapov wrote:
On Wed, Jun 13, 2007 at 02:48:02PM -0400, Jeff Squyres wrote:
On Jun 13, 2007, at 2:41 PM, Gleb Natapov wrote:
Pasha tells me that the best times for Ishai and him are:
- 2000-2030 Israel time
- 1300-1300 US Eastern
- 1100-1130 US Mountain
-
On Jun 13, 2007, at 12:23 PM, Jeff Squyres wrote:
On Jun 13, 2007, at 1:40 PM, Gleb Natapov wrote:
[snip]
coordination kind of teleconference. If people think this is a
good
idea, I can setup the call.
sounds good to me.
Sounds good to me to. Pasha also works on async event thread. Thi
On Jun 13, 2007, at 12:07 PM, Gleb Natapov wrote:
On Wed, Jun 13, 2007 at 02:05:00PM -0400, Jeff Squyres wrote:
On Jun 13, 2007, at 1:54 PM, Jeff Squyres wrote:
With today's trunk, I still see the problem:
Same thing happens on v1.2 branch. I'll re-open #548.
I am sure it was never test
On Jun 13, 2007, at 11:33 AM, Jeff Squyres wrote:
On Jun 13, 2007, at 1:15 PM, Nysal Jan wrote:
There is a ticket (closed) here: https://svn.open-mpi.org/trac/ompi/
ticket/548
It was fixed by Galen for 1.2.
Ah -- I forgot to look at closed tickets. I think we broke it again;
it certainly f
On Jun 13, 2007, at 11:15 AM, Nysal Jan wrote:
I was just bitten yesterday by a problem that I've known about for a
while but had never gotten around to looking into (I could have sworn
that there was an open trac ticket on this, but I can't find one
anywhere).
I have 2 hosts: one with 3 act
on the phone this morning).
On Jun 13, 2007, at 11:38 AM, Galen Shipman wrote:
Hi Gleb,
As we have discussed before I am working on adding support for
multiple QPs with either per peer resources or shared resources.
As a result of this I am trying to clean up a lot of the OpenIB code.
It has
On Jun 13, 2007, at 9:49 AM, Torsten Hoefler wrote:
Hi Galen,Gleb,
there is also something weird going on if I call the basic alltoall
during the module_init() of a collective module (I need to wire up my
own QPs in my coll component). It takes 7 seconds for 4 nodes and more
than 30 minutes for
Hi Gleb,
As we have discussed before I am working on adding support for
multiple QPs with either per peer resources or shared resources.
As a result of this I am trying to clean up a lot of the OpenIB code.
It has grown up organically over the years and needs some attention.
Perhaps we can co
On Jun 11, 2007, at 8:25 AM, Jeff Squyres wrote:
I leave it to the thread subgroup to decide... Should we discuss on
the call tomorrow?
I don't have a strong opinion; I was just testing both because it was
easy to do so. If we want to concentrate on the trunk, I can adjust
my MTT setup.
I
early next week.
Thanks,
Galen
On Jun 7, 2007, at 12:49 PM, Don Kerr wrote:
It would be difficult for me to attend this afternoon. Tomorrow is
much
better for me.
-DON
George Bosilca wrote:
I'm available this afternoon.
george.
On Jun 7, 2007, at 2:35 PM, Galen Shipman
Call in details:
I have scheduled your requested audio conference "Open MPI" for today
beginning at 2:30pm to 3:30pm mountain time with 7 ports.
Dial in number: 5-4165 local 866-260-0475 toll free
- Galen
On Jun 7, 2007, at 1:47 PM, Galen Shipman wrote:
On Jun 7, 2007, a
Monday/Tuesday.
- Galen
-DON
George Bosilca wrote:
I'm available this afternoon.
george.
On Jun 7, 2007, at 2:35 PM, Galen Shipman wrote:
Are people available today to discuss this over the phone?
- Galen
On Jun 7, 2007, at 11:28 AM, Gleb Natapov wrote:
On Thu, Jun 07, 2007
Okay, how is 2:30 mountain time for everyone?
I will setup a a call in if this works.
Thanks,
Galen
On Jun 7, 2007, at 12:39 PM, George Bosilca wrote:
I'm available this afternoon.
george.
On Jun 7, 2007, at 2:35 PM, Galen Shipman wrote:
Are people available today to discuss
Are people available today to discuss this over the phone?
- Galen
On Jun 7, 2007, at 11:28 AM, Gleb Natapov wrote:
On Thu, Jun 07, 2007 at 11:11:12AM -0400, George Bosilca wrote:
) I expect you to revise the patch in order to propose a generic
solution or I'll trigger a vote against the p
semantics of the BTL, whatever they may be.
I have created a wiki here to help describe the issue as I currently
see it, please feel free to add to this with suggestions/etc..
https://svn.open-mpi.org/trac/ompi/wiki/BTLSemantics
- Galen
On Jun 7, 2007, at 9:55 AM, Galen Shipman wrote
AM, Gleb Natapov wrote:
Hi Galen,
On Sun, May 27, 2007 at 10:19:09AM -0600, Galen Shipman wrote:
With current code this is not the case. Order tag is set during a
fragment
allocation. It seems wrong according to your description. Attached
patch fixes
this. If no specific ordering t
Actually, we still need MCA_BTL_FLAGS_FAKE_RDMA , it can be used as
a hint for components such as one-sided.
Galen
On May 27, 2007, at 5:25 AM, g...@osl.iu.edu wrote:
Author: gleb
Date: 2007-05-27 07:25:39 EDT (Sun, 27 May 2007)
New Revision: 14782
URL: https://svn.open-mpi.org/trac/ompi/c
Can we get rid of mca_pml_ob1_send_fin_btl and just have
mca_pml_ob1_send_fin? It seems we should just always send the fin
over the same btl and this would clean up the code a bit.
Thanks,
Galen
On May 27, 2007, at 2:29 AM, g...@osl.iu.edu wrote:
Author: gleb
Date: 2007-05-27 04:29:38
doh,, correction..
On May 27, 2007, at 10:23 AM, Galen Shipman wrote:
The problem is that MCA_BTL_DES_FLAGS_PRIORITY was meant to
indicate
that the fragment was higher priority, but the fragment isn't higher
priority. It simply needs to be ordered w.r.t. a previous fragment,
an RD
The problem is that MCA_BTL_DES_FLAGS_PRIORITY was meant to indicate
that the fragment was higher priority, but the fragment isn't higher
priority. It simply needs to be ordered w.r.t. a previous fragment,
an RDMA in this case.
But after the change priority flags is totally ignored.
So the
With current code this is not the case. Order tag is set during a
fragment
allocation. It seems wrong according to your description. Attached
patch fixes
this. If no specific ordering tag is provided to allocation
function order of
the fragment is set to be MCA_BTL_NO_ORDER. After call to s
On May 24, 2007, at 2:48 PM, George Bosilca wrote:
I see the problem this patch try to solve, but I fail to correctly
understand the implementation. The patch affect all PML and BTL in
the code base by adding one more argument to some of the most often
called functions. And there is only o
As an aside, my personal feeling is that even when running over IB the
preposting of recvs is worth the small overhead of piggybacking a
credit
system on the messages that already cross the wire. If nothing else,
this avoids adding congestion of RNR-NAKS and the resends they
trigger.
Put
More like trying to work around the race condition that exists: The
server side sends an rdma message first thus violating the iwarp
protocol. For those who want the gory details: when the server sends
first -and- that rdma message arrives at the client _before_ the
client
transitions into r
Expect this rev to die all over the place.. I had a bug in my r13481
checkin that prevented OB1 from getting selected, I corrected this in
r13482.
sorry bout that..
- Galen
On Feb 2, 2007, at 7:44 PM, MPI Team wrote:
Creating nightly snapshot SVN tarball was a success.
Snapshot: 1.3a1
We have found a potential issue with BPROC that may effect Open MPI.
Open MPI by default uses PTYs for I/O forwarding, if PTYs aren't
setup on the compute nodes, Open MPI will revert to using pipes.
Recently (today) we found a potential issue with PTYs and BPROC. A
simple reader/writer usin
Current build warnings:
mca_base_parse_paramfile_lex.c:1664: warning: 'yy_flex_realloc' defined
but not used
qsort.c:163: warning: cast from pointer to integer of different size
show_help_lex.c:1606: warning: 'yy_flex_realloc' defined but not used
rmgr_proxy.c:237: warning: ISO C forbids conver
Hey Gleb,
Sorry for the delay.. we have been doing a bit of reworking of the
pml/btl so that the btl's can be shared outside of just the pml
(collectives, etc).
I have added the bug fix (old_reg). Will look at the assumption of
non-null registration next.
Thanks (and keep them coming ;-) ,
Hi Sridhar,
I have committed changes that allow you to set the debg verbosity,
OMPI_MCA_btl_base_debug
0 - no debug output
1 - standard debug output
2 - very verbose debug output
Also we have run the Pallas tests and are not able to reproduce your
failures. We do see a warning in the Reduce te
Hi
On Aug 9, 2005, at 8:15 AM, Sridhar Chirravuri wrote:
The same kind of output while running Pallas "pingpong" test.
-Sridhar
-Original Message-
From: devel-boun...@open-mpi.org [mailto:devel-boun...@open-mpi.org] On
Behalf Of Sridhar Chirravuri
Sent: Tuesday, August 09, 2005 7:44 PM
Gleb,
Changes are in the trunk thanks,
Galen
On Aug 7, 2005, at 4:32 AM, Gleb Natapov wrote:
Hello Galen,
Included patch changes type of returned value from ibv_poll_cq.
It should be signed because we check if it is less then zero later
in the code.
Index: ompi/mca/btl/openib/btl_openib_co
Changes are in the trunk
Thanks,
Gaeln
On Aug 8, 2005, at 7:38 AM, Gleb Natapov wrote:
Hello,
Included patch fixes bugs in ompi_free_list in the case ompi_free_list
was
created with NULL class and/or mpool parameters.
Index: ompi/class/ompi_free_list.c
=
Using the mvapi btl you can now set OMPI_MCA_btl_mvapi_use_srq=1 which
will cause mvapi to use a shared receive queue. This will allow much
better scaling as receives are posted per interface port and not per
queue pair. Note: older versions of mellanox firmware may see a
substantial performanc
40 matches
Mail list logo