My guess would be that both aspects (sorting + CID allocation) could be
a problem.
There was a loong time back an effort to convert the sequence of
allgather + qsort into a distributed sort (based on a paper by Moody et.
al. where he demonstrated the benefits of this approach). We didn't get
I'll put it on my to do list to write the configure logic for that, shouldn't be too difficult. Thanks for the report.Edgar___
devel mailing list
devel@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/devel
On 08/07/16 03:14 PM, Edgar Gabriel wrote:
I think I found the problem, I filed a pr towards master, and if that
passes I will file a pr for the 2.x branch.
Thanks!
Edgar
On 7/8/2016 1:14 PM, Eric Chamberland wrote:
On 08/07/16 01:44 PM, Edgar Gabriel wrote:
ok, but just to be able
I think I found the problem, I filed a pr towards master, and if that
passes I will file a pr for the 2.x branch.
Thanks!
Edgar
On 7/8/2016 1:14 PM, Eric Chamberland wrote:
On 08/07/16 01:44 PM, Edgar Gabriel wrote:
ok, but just to be able to construct a test case, basically what you
devel mailing list
de...@open-mpi.org
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post:
http://www.open-mpi.org/community/lists/devel/2016/07/19171.php
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
de...@open-mpi.org
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post:
http://www.open-mpi.org/community/lists/devel/2016/07/19169.php
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Comput
ompio
Am I doing something wrong, or are these options not supported in
these versions?
kindest regards
Mike
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall
in the 1.7, 1.8 and 1.10 series ROMIO remains the default. In the
upcomgin 2.x series, OMPIO will be the default, except for Lustre file
systems, where we will stick with ROMIO as the primary resource.
Regarding performance comparison, we ran numerous tests late last year
and early this year.
.
--
Paul H. Hargrove phhargr...@lbl.gov <mailto:phhargr...@lbl.gov>
Computer Languages & Systems Software (CLaSS) Group
Computer Science Department Tel: +1-510-495-2352
Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
--
Edgar Gabriel
Associate Professo
the scheduled work on the power infrastructure is finished, the UH
jenkins node is available again.
Happy new year to everybody
Edgar
On 12/30/2015 4:29 PM, Edgar Gabriel wrote:
I apologize, I completely forgot that t this node is affected. There
is work on the electrical infrastructure
I apologize, I completely forgot that t this node is affected. There is work on the electrical infrastructure of the building that hosts the main router to the cs machines. The machines themselves are still up and running, but cannot be reached. It should be finished tomorrow.
I'll keep you
argh. Forget about it. Sorry for the noise. linked to the wrong version :-(
Edgar
On 11/12/2015 11:13 AM, Edgar Gabriel wrote:
I have an interesting observation on master, for whatever reason the new
non-blocking collective I/O interfaces don't seem to be generated
anymore correctly. Does
to `MPI_File_iwrite_at_all'
Thanks
Edgar
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA
Tel: +1 (713) 743-3857
On Wed, Sep 16, 2015 at 08:49:59AM -0500, Edgar Gabriel wrote:
something is borked right now on master in the management of inter vs. intra
communicators. It looks like intra communicators are wrongly selecting the
inter coll module thinking that it is an inter communicator, and we have
hangs
something is borked right now on master in the management of inter vs.
intra communicators. It looks like intra communicators are wrongly
selecting the inter coll module thinking that it is an inter
communicator, and we have hangs because of that. I attach a small
replicator, where a bcast of
?
And then -- assuming all goes well -- actually branch next Tuesday (16 June
2015) after the weekly webex.
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524
).
i also moved (up) some malloc in order to group them and simplify the
handling
of error cases.
per your comment, one move was incorrect indeed :-(
Cheers,
Gilles
On 5/28/2015 12:14 PM, Edgar Gabriel wrote:
ok, I see you moved the malloc up, the malloc was originally just for
the receiving side
looking at the old code, I understand what you tried to fix, I'll commit
a proper version in a couple of min.
Thanks
Edgar
On 5/28/2015 8:44 AM, Edgar Gabriel wrote:
ok, I see you moved the malloc up, the malloc was originally just for
the receiving side of the communication, you moved it up
ok, I see you moved the malloc up, the malloc was originally just for
the receiving side of the communication, you moved it up to cover both.
That is however unfortunately not correct.
I will fix it in a couple of mins.
Thanks
Edgar
On 5/28/2015 8:25 AM, Edgar Gabriel wrote:
Gilles,
I saw
needs to go.
---
Thanks
Edgar
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA
Tel: +1 (713) 743-3857 Fax: +1
the config file).
any thoughts ?
Cheers,
Gilles
___
devel mailing list
de...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post:
http://www.open-mpi.org/community/lists/devel/2015/05/17425.php
--
Edgar Gabri
ok, thanks! I will look into it.
Edgar
On 5/5/2015 8:23 PM, Orion Poplawski wrote:
On 05/05/2015 01:12 PM, Edgar Gabriel wrote:
Orion,
could you provide a couple of more details? I might not be able to fix
the problem for ompio in the 1.8 series, but I would definitely like
make sure
Orion,
could you provide a couple of more details? I might not be able to fix
the problem for ompio in the 1.8 series, but I would definitely like
make sure that it is not an issue in the master/1.9 series.
I compiled netcdf-4.3.3.1 and netcdf-fortran--4.4.2, using hdf-1.8.9,
parallel-tests
st
de...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post:
http://www.open-mpi.org/community/lists/devel/2015/03/17116.php
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Sc
ary to what the
error message says).
Thanks
Edgar
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA
Tel: +1 (713) 743-3
,
First, make sure your master includes
https://github.com/open-mpi/ompi/commit/05af80b3025dbb95bdd4280087450791291d7219
If this is not enough, try with --mca coll ^ml
Hope this helps
Gilles.
Edgar Gabriel <gabr...@cs.uh.edu>さんのメール:
I have some problems running jobs with ompi-master
n node crill-003
[crill-004:18161] mca: bml: Using tcp btl to [[3417,1],2] on node crill-003
^C
and than it just hangs.
Does anybody have an idea/suggestion what to try or look for?
Thanks
Edgar
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab
://www.open-mpi.org/community/lists/devel/2014/12/16555.php
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA
Tel: +1 (713) 743
, but
is set on the 1.8 series (1.8 series checkout was from Nov. 20, so if
something changed in between the result might be different).
On 12/2/2014 9:27 AM, Artem Polyakov wrote:
2014-12-02 20:59 GMT+06:00 Edgar Gabriel <gabr...@cs.uh.edu
<mailto:gabr...@cs.uh.edu>>:
I checked with the debugger, that it did skip the entire section
On 12/2/2014 9:04 AM, Jeff Squyres (jsquyres) wrote:
Oy -- I thought we fixed that. :-(
Are you saying that configure output says that ltdladvise is not found?
On Dec 2, 2014, at 9:59 AM, Edgar Gabriel <gabr...@cs.uh.
ost:
http://www.open-mpi.org/community/lists/devel/2014/12/16397.php
--
С Уважением, Поляков Артем Юрьевич
Best regards, Artem Y. Polyakov
___
devel mailing list
de...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link t
, Ralph Castain wrote:
>>
>>> On Nov 25, 2014, at 9:36 AM, Edgar Gabriel <gabr...@cs.uh.edu
<mailto:gabr...@cs.uh.edu>> wrote:
>>>
>>> On 11/25/2014 11:31 AM, Ralph Castain wrote:
>>>>
>>>>>
function pointers will be NULL for ROMIO. Just want to make sure this is
ok with everybody.
Thanks
Edgar
On 11/25/2014 11:43 AM, Ralph Castain wrote:
On Nov 25, 2014, at 9:36 AM, Edgar Gabriel <gabr...@cs.uh.edu> wrote:
On 11/25/2014 11:31 AM, Ralph Castain wrote:
On Nov 25, 2014
On 11/25/2014 11:31 AM, Ralph Castain wrote:
On Nov 25, 2014, at 8:24 AM, Edgar Gabriel <gabr...@cs.uh.edu
<mailto:gabr...@cs.uh.edu>> wrote:
On 11/25/2014 10:18 AM, Ralph Castain wrote:
Hmmm…no, nothing has changed with regard to declspec that I know
about. I’ll ask the ob
, but I am
not aware that it is illegal to call a function of a component from
another component.
Thanks
Edgar
On Nov 25, 2014, at 8:07 AM, Edgar Gabriel <gabr...@cs.uh.edu>
wrote:
Has something changed recently on the trunk/master regarding
OMPI_DECLSPEC? The reason I ask is because
and that problem was not there roughly two weeks back the last time I
tested. I did verify that the the function listed there has an
OMPI_DECLSPEC before its definition.
Thanks
Edgar
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer
http://www.cisco.com/web/about/doing_business/legal/cri/
___
devel mailing list
de...@open-mpi.org <mailto:de...@open-mpi.org>
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post:
http://www.open-mpi.org/co
_0) 4.9.1
Ralph
___
devel mailing list
de...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post:
http://www.open-mpi.org/community/lists/devel/2014/10/16088.php
--
Edgar Gabriel
Associate Professor
Parallel Softwa
hiqing
swise
tkordenbrock
vasily
vvenkates
vvenkatesan
yaeld
yosefe
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, U
of it?
Thanks
Edgar
On 08/28/2014 04:19 PM, Edgar Gabriel wrote:
we are having recently problems running trunk with openib component
enabled on one of our clusters. The problem occurs right in the
initialization part, here is the stack right before the segfault:
---snip---
(gdb) where
#0
we are having recently problems running trunk with openib component
enabled on one of our clusters. The problem occurs right in the
initialization part, here is the stack right before the segfault:
---snip---
(gdb) where
#0 mca_btl_openib_tune_endpoint (openib_btl=0x762a40,
sorry, let me be more precise for Wednesday, I have time before 12pm on
Wednesday.
Thanks
Edgar
On 6/19/2014 2:52 PM, Edgar Gabriel wrote:
> the best time for me would be either Wednesday morning (basically any
> time), or Thursday morning before 11am central.
>
> Thanks
> Edg
see that on the agenda, Edgar - can you please add it to ensure it
>>> gets covered?
>>>
>>>
>>> On Jun 19, 2014, at 10:36 AM, Edgar Gabriel <gabr...@cs.uh.edu> wrote:
>>>
>>>> If possible, I would like to attend remotely the discussion about
gt;
> Are there any other topics that people wanted to remote in to? Fair warning:
> remote attendance is "ok" via webex, but it's no substitute for being there.
>
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Departmen
_
> >> devel mailing list
> >> de...@open-mpi.org <mailto:de...@open-mpi.org>
> >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> >> Link to this post:
> http://www.open-mpi.org/com
alph Castain <r...@open-mpi.org
> <mailto:r...@open-mpi.org>> wrote:
>
>>
>> On May 27, 2014, at 12:50 PM, Edgar Gabriel <gabr...@cs.uh.edu
>> <mailto:gabr...@cs.uh.edu>> wrote:
>>
>>>
>>>
>>> On 5/27/2014 2:46 PM,
On 5/27/2014 2:46 PM, Ralph Castain wrote:
>
> On May 27, 2014, at 12:27 PM, Edgar Gabriel <gabr...@cs.uh.edu>
> wrote:
>
>> I'll let ORNL talk about the STCI component itself (which might
>> have additional reasons), but keeping the code in trunk vs. an
>>
rganizations is going to build against it - so what did it
> accomplish to push the code into the repo?
>
> Mind you, I'm not saying I'm staunchly opposed - just trying to
> understand how it benefits anyone.
>
>
> On May 27, 2014, at 11:28 AM, Edgar Gabriel <gabr...@cs.uh.edu>
this
>> post:
>> http://www.open-mpi.org/community/lists/devel/2014/05/14852.php
>
> _______ devel mailing list
> de...@open-mpi.org Subscription:
> http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post:
> http://www.o
gt;
> Tel: ++49(0)711-685-87203
> email: nietham...@hlrs.de
> http://www.hlrs.de/people/niethammer
>
>
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link
because there's serious discussions afoot to switch
> OMPI's main SVN repo to Git, and I want to get a feel for the current
> landscape out there.
>
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science
co.com>
>
> Indiana
> ==
> lums: Andrew Lumsdaine <l...@cs.indiana.edu> **NO COMMITS IN LAST YEAR**
> adkulkar: Abhishek Kulkarni <adkul...@osl.iu.edu>
> afriedle: Andrew Friedley <afrie...@osl.iu.edu> **NO COMMITS IN LAST YEAR**
> timattox: Tim Ma
as far as I can tell right now, yes, its the final thing...
Thanks
Edgar
On 10/30/2012 2:05 PM, Ralph Castain wrote:
> Grrbloody verb @##$@$.
>
> Okay, I'll make that edit. Is that the last thing required to fix this
> problem?
>
> On Oct 30, 2012, at 11:57 AM, E
; @@ -42,6 +42,7 @@
>
> sources = \
> io_ompio.h \
> +io_ompio_nbc.h \
> io_ompio.c \
> io_ompio_component.c \
> io_ompio_module.c \
> ___
> svn mailing list
> s...@open-mpi.org
> http://www.open
ile_read_all’:
> fcoll_static_file_read_all.c:74: warning: ‘sorted_file_offsets’ may be used
> uninitialized in this function
>
>
>
>
> ___
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
sorry, should be fixed with the last commit...
Thanks
Edgar
On 2/28/2012 8:37 AM, Edgar Gabriel wrote:
> I'll look into this...
>
> Thanks
> Edgar
>
> On 2/28/2012 8:36 AM, Ralph Castain wrote:
>> I tried to build the trunk this morning on a machine where the fcoll
a1/ompi/mca/fcoll/static/fcoll_static_file_write_all.c:929:
> first defined here
>
>
> Any suggestions?
> Ralph
>
>
> ___
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabri
___
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabriel
Associate Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Hou
the code has been committed in r25079. Let us know if there are any
issues, compilation problems etc. I also work on an FAQ entry as has
been suggested on the teleconf last week.
Thanks
Edgar
On 8/12/2011 3:09 PM, Edgar Gabriel wrote:
> WHAT: add the ompio io module and new parallel
Feedback is highly welcome.
Thanks
Edgar
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA
Tel: +1 (713) 743-3857
, not an int
)
Thanks
Edgar
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA
Tel: +1 (713) 743-3857 Fax: +1 (713
values, and there is no implication in any document
> I know of that it should be.
>
> Regards,
> Nick Maclaren.
>
>
>
> ___
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
vision, Stop 8911
>> 100 Bureau Drive
>> Gaithersburg, MD 20899-8911
>>
>>
>> ___
>> devel mailing list
>> de...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
>
> _
be
> inside the communicator code).
>
> https://svn.open-mpi.org/trac/ompi/ticket/2477
>
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Ho
to (but simpler
than) the coll framework.
I've started an hg for this work:
http://bitbucket.org/jsquyres/ompi-topo-fixes/
Comments?
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G
it will be available in 1.3.4...
Thanks
Edgar
Chris Samuel wrote:
Hi Edgar,
- "Edgar Gabriel" <gabr...@cs.uh.edu> wrote:
just wanted to give a heads-up that I *think* I know what the problem
is. I should have a fix (with a description) either later today or
tomorrow mo
0;
}
___
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University
just wanted to give a heads-up that I *think* I know what the problem
is. I should have a fix (with a description) either later today or
tomorrow morning...
Thanks
Edgar
Edgar Gabriel wrote:
so I can confirm that I can reproduce the hang, and we (George, Rainer
and me) have looked
there is a ticket on that topic already (#2009), and I just added some
comments to that...
Jeff Squyres wrote:
On Sep 10, 2009, at 7:12 PM, Edgar Gabriel wrote:
so I can confirm that I can reproduce the hang, and we (George, Rainer
and me) have looked into that and are continue digging.
I
), which seems to be consistent with two other bug
reports currently being discussed on the mailing list. I could reproduce
the hang with both sm and tcp, so its probably not a btl issue but
somewhere higher.
Thanks
Edgar
Thomas Ropars wrote:
Edgar Gabriel wrote:
Two short questions: do you have
.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA
Te
am at the newest revision. What am I doing wrong?
Thanks
Edgar
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA
Tel: +1 (713)
Geoffrey
___
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip
ementation. Personally I find this advice rather
strange and (if nobody can give a good reason for it) would encourage
its removal...
Best regards,
Christian
___
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
solved.
Thanks
Ralph
On Fri, May 1, 2009 at 6:08 AM, Edgar Gabriel <gabr...@cs.uh.edu>
wrote:
David,
is this code representative for what your app is doing?
E.g. you have a base communicator (e.g. MPI_COMM_WORLD)
which is being 'split', freed again, split,
of OMPI. It only hangs on the 1.3 line when the cid
reaches 65536.
-david
--
David Gunter
HPC-3: Parallel Tools Team
Los Alamos National Laboratory
On Apr 30, 2009, at 12:28 PM, Edgar Gabriel wrote:
cid's are in fact not recycled in the block algorithm. The problem is
that comm_free
, Edgar Gabriel wrote:
Brian W. Barrett wrote:
When we added the CM PML, we added a pml_max_contextid field to the
PML structure, which is the max size cid the PML can handle (because
the matching interfaces don't allow 32 bits to be used for the cid.
At the same time, the max cid for OB1
on, that is?
remember that its not the communicator id allocation that is failing at
this point, so the question is do we have to 'validate' a cid with the
pml before we declare it to be ok?
Thanks
Edgar
Brian
On Thu, 30 Apr 2009, Edgar Gabriel wrote:
cid's are in fact not recycled in the block
recycled properly.
Brian
___
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science
, whether for 1.3.0 or 1.3.1 I leave that
up to others to decide...
Thanks
Edgar
Edgar Gabriel wrote:
I'm already debugging it. the good news is that it only seems to appear
with trunk, with 1.3 (after copying the new tuned module over), all the
tests pass.
Now if somebody can tell me a trick on how
:09 PM, Edgar Gabriel wrote:
I just debugged the Reduce_scatter bug mentioned previously. The
bug is
unfortunately not in hierarch, but in tuned.
Here is the code snipplet causing the problems:
int reduce_scatter (, mca_coll_base_module_t *module)
{
...
err = comm->c_coll.coll_reduce (...
eem to do that mostly
correctly.
Thanks
Edgar
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA
Tel: +1 (713) 743-3857
next steps are to try to run this with the latest trunk and to narrow
down the failing case.
--td
___
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabriel
Assistant Professor
Parallel Software Techno
s that are out there.
- R.
___
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Ha
tes "all nodes below are -not- to be used".
Implementation isn't too hard once I have that...
On 3/3/08 9:44 AM, "Edgar Gabriel" <gabr...@cs.uh.edu> wrote:
Ralph,
could this mechanism be used also to exclude a node, indicating to never
run a job there? Here is the pro
) {
err = MPI_ERR_ARG;
} else {
OMPI_CHECK_DATATYPE_FOR_SEND(err, datatype, count);
___
svn mailing list
s...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/svn
--
Edgar Gabriel
Assistant Professor
Parallel S
://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA
Tel: +1 (713) 743-3857
it's
bad to free GROUP_EMPTY. I agree that this is somewhat counter to the
rest of the MPI philosophy of not freeing predefined handles, though.
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science
___
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G
nough away from v1.3 release that
this does not impact any release plans with VT.
--
Edgar Gabriel
Assistant Professor
Parallel Software Technologies Lab http://pstl.cs.uh.edu
Department of Computer Science University of Houston
Philip G. Hoffman Hall, Room 524Houston, TX-
Gleb Natapov wrote:
On Tue, Sep 11, 2007 at 10:00:07AM -0500, Edgar Gabriel wrote:
Gleb,
in the scenario which you describe in the comment to the patch, what
should happen is, that the communicator with the cid which started
already the allreduce will basically 'hang' until the other
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Gleb.
___
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Edgar Gabriel
Assist
Lisandro,
do you have an example for the extended collective operations tests
which fail? It would help track down the problem. I had a quick look at
our implementation but I can not find an obvious problem, so an example
would be extremely helpful.
Thanks
Edgar
- Some extended
to come out.
if ( NULL == array_of_errcodes ) {
return OMPI_ERRHANDLER_INVOKE(comm, MPI_ERR_ARG,
FUNC_NAME);
}
Looks like this has already been fixed for MPI_Comm_spawn().
Rolf
--
Edgar Gabriel
Assistant Professor
Rolf,
thanks for catching that, it is now fixed on the trunk and Jeff is
moving it right now to v1.0 and v1.1
Best regards
Edgar
Rolf Vandevaart wrote:
Repost because I did not see it in the archives after a day.
Rolf Vandevaart wrote On 05/09/06 17:32,:
We believe there is a minor bug
of the site)
Are there others?
--
==
Dr.-Ing. Edgar Gabriel
Clusters and Distributed Units
High Performance Computing Center Stuttgart (HLRS)
University of Stuttgart
Tel: +49 711 685 8039http://www.hlrs.de
96 matches
Mail list logo