Re: [OMPI devel] subcommunicator OpenMPI issues on K

2017-11-07 Thread Edgar Gabriel
My guess would be that both aspects (sorting + CID allocation) could be a problem. There was a loong time back an effort to convert the sequence of allgather + qsort into a distributed sort (based on a paper by Moody et. al. where he demonstrated the benefits of this approach).  We didn't get

Re: [OMPI devel] #warning "Including liblustreapi.h is deprecated. Include lustreapi.h directly."

2016-12-15 Thread Edgar Gabriel
I'll put it on my to do list to write the configure logic for that, shouldn't be too difficult. Thanks for the report.Edgar___ devel mailing list devel@lists.open-mpi.org https://rfd.newmexicoconsortium.org/mailman/listinfo/devel

Re: [OMPI devel] 2.0.0rc4 Crash in MPI_File_write_all_end

2016-07-12 Thread Edgar Gabriel
On 08/07/16 03:14 PM, Edgar Gabriel wrote: I think I found the problem, I filed a pr towards master, and if that passes I will file a pr for the 2.x branch. Thanks! Edgar On 7/8/2016 1:14 PM, Eric Chamberland wrote: On 08/07/16 01:44 PM, Edgar Gabriel wrote: ok, but just to be able

Re: [OMPI devel] 2.0.0rc4 Crash in MPI_File_write_all_end

2016-07-08 Thread Edgar Gabriel
I think I found the problem, I filed a pr towards master, and if that passes I will file a pr for the 2.x branch. Thanks! Edgar On 7/8/2016 1:14 PM, Eric Chamberland wrote: On 08/07/16 01:44 PM, Edgar Gabriel wrote: ok, but just to be able to construct a test case, basically what you

Re: [OMPI devel] 2.0.0rc4 Crash in MPI_File_write_all_end

2016-07-08 Thread Edgar Gabriel
devel mailing list de...@open-mpi.org Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: http://www.open-mpi.org/community/lists/devel/2016/07/19171.php -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu

Re: [OMPI devel] 2.0.0rc4 Crash in MPI_File_write_all_end

2016-07-08 Thread Edgar Gabriel
de...@open-mpi.org Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: http://www.open-mpi.org/community/lists/devel/2016/07/19169.php -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Comput

Re: [OMPI devel] parameters for OMPIO

2016-05-11 Thread Edgar Gabriel
ompio Am I doing something wrong, or are these options not supported in these versions? kindest regards Mike -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall

Re: [OMPI devel] OMPIO vs ROMIO

2016-05-10 Thread Edgar Gabriel
in the 1.7, 1.8 and 1.10 series ROMIO remains the default. In the upcomgin 2.x series, OMPIO will be the default, except for Lustre file systems, where we will stick with ROMIO as the primary resource. Regarding performance comparison, we ran numerous tests late last year and early this year.

Re: [OMPI devel] [2.0.0rc2] NetBSD build failure (ompio)

2016-05-03 Thread Edgar Gabriel
. -- Paul H. Hargrove phhargr...@lbl.gov <mailto:phhargr...@lbl.gov> Computer Languages & Systems Software (CLaSS) Group Computer Science Department Tel: +1-510-495-2352 Lawrence Berkeley National Laboratory Fax: +1-510-486-6900 -- Edgar Gabriel Associate Professo

Re: [OMPI devel] UH jenkins node seems out for the holidays

2015-12-31 Thread Edgar Gabriel
the scheduled work on the power infrastructure is finished, the UH jenkins node is available again. Happy new year to everybody Edgar On 12/30/2015 4:29 PM, Edgar Gabriel wrote: I apologize, I completely forgot that t this node is affected. There is work on the electrical infrastructure

Re: [OMPI devel] UH jenkins node seems out for the holidays

2015-12-30 Thread Edgar Gabriel
I apologize, I completely forgot that t this node is affected. There is work on the electrical infrastructure of the building that hosts the main router to the cs machines. The machines themselves are still up and running, but cannot be reached. It should be finished tomorrow. I'll keep you

Re: [OMPI devel] interfaces gone?

2015-11-12 Thread Edgar Gabriel
argh. Forget about it. Sorry for the noise. linked to the wrong version :-( Edgar On 11/12/2015 11:13 AM, Edgar Gabriel wrote: I have an interesting observation on master, for whatever reason the new non-blocking collective I/O interfaces don't seem to be generated anymore correctly. Does

[OMPI devel] interfaces gone?

2015-11-12 Thread Edgar Gabriel
to `MPI_File_iwrite_at_all' Thanks Edgar -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA Tel: +1 (713) 743-3857

Re: [OMPI devel] inter vs. intra communicator problem on master

2015-09-16 Thread Edgar Gabriel
On Wed, Sep 16, 2015 at 08:49:59AM -0500, Edgar Gabriel wrote: something is borked right now on master in the management of inter vs. intra communicators. It looks like intra communicators are wrongly selecting the inter coll module thinking that it is an inter communicator, and we have hangs

[OMPI devel] inter vs. intra communicator problem on master

2015-09-16 Thread Edgar Gabriel
something is borked right now on master in the management of inter vs. intra communicators. It looks like intra communicators are wrongly selecting the inter coll module thinking that it is an inter communicator, and we have hangs because of that. I attach a small replicator, where a bcast of

Re: [OMPI devel] Branch for v2.0.0

2015-06-08 Thread Edgar Gabriel
? And then -- assuming all goes well -- actually branch next Tuesday (16 June 2015) after the weekly webex. -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524

Re: [OMPI devel] change in io_ompio.c

2015-05-28 Thread Edgar Gabriel
). i also moved (up) some malloc in order to group them and simplify the handling of error cases. per your comment, one move was incorrect indeed :-( Cheers, Gilles On 5/28/2015 12:14 PM, Edgar Gabriel wrote: ok, I see you moved the malloc up, the malloc was originally just for the receiving side

Re: [OMPI devel] change in io_ompio.c

2015-05-28 Thread Edgar Gabriel
looking at the old code, I understand what you tried to fix, I'll commit a proper version in a couple of min. Thanks Edgar On 5/28/2015 8:44 AM, Edgar Gabriel wrote: ok, I see you moved the malloc up, the malloc was originally just for the receiving side of the communication, you moved it up

Re: [OMPI devel] change in io_ompio.c

2015-05-28 Thread Edgar Gabriel
ok, I see you moved the malloc up, the malloc was originally just for the receiving side of the communication, you moved it up to cover both. That is however unfortunately not correct. I will fix it in a couple of mins. Thanks Edgar On 5/28/2015 8:25 AM, Edgar Gabriel wrote: Gilles, I saw

[OMPI devel] change in io_ompio.c

2015-05-27 Thread Edgar Gabriel
needs to go. --- Thanks Edgar -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA Tel: +1 (713) 743-3857 Fax: +1

Re: [OMPI devel] Open MPI collectives algorithm selection

2015-05-20 Thread Edgar Gabriel
the config file). any thoughts ? Cheers, Gilles ___ devel mailing list de...@open-mpi.org Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: http://www.open-mpi.org/community/lists/devel/2015/05/17425.php -- Edgar Gabri

Re: [OMPI devel] 1.8.5 release

2015-05-06 Thread Edgar Gabriel
ok, thanks! I will look into it. Edgar On 5/5/2015 8:23 PM, Orion Poplawski wrote: On 05/05/2015 01:12 PM, Edgar Gabriel wrote: Orion, could you provide a couple of more details? I might not be able to fix the problem for ompio in the 1.8 series, but I would definitely like make sure

Re: [OMPI devel] 1.8.5 release

2015-05-05 Thread Edgar Gabriel
Orion, could you provide a couple of more details? I might not be able to fix the problem for ompio in the 1.8 series, but I would definitely like make sure that it is not an issue in the master/1.9 series. I compiled netcdf-4.3.3.1 and netcdf-fortran--4.4.2, using hdf-1.8.9, parallel-tests

Re: [OMPI devel] mpi_test_suite question

2015-03-06 Thread Edgar Gabriel
st de...@open-mpi.org Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: http://www.open-mpi.org/community/lists/devel/2015/03/17116.php -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Sc

[OMPI devel] openib receive queue settings

2014-12-26 Thread Edgar Gabriel
ary to what the error message says). Thanks Edgar -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA Tel: +1 (713) 743-3

Re: [OMPI devel] problem running jobs on ompi-master

2014-12-26 Thread Edgar Gabriel
, First, make sure your master includes https://github.com/open-mpi/ompi/commit/05af80b3025dbb95bdd4280087450791291d7219 If this is not enough, try with --mca coll ^ml Hope this helps Gilles. Edgar Gabriel <gabr...@cs.uh.edu>さんのメール: I have some problems running jobs with ompi-master

[OMPI devel] problem running jobs on ompi-master

2014-12-26 Thread Edgar Gabriel
n node crill-003 [crill-004:18161] mca: bml: Using tcp btl to [[3417,1],2] on node crill-003 ^C and than it just hangs. Does anybody have an idea/suggestion what to try or look for? Thanks Edgar -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab

Re: [OMPI devel] Trunk warnings

2014-12-12 Thread Edgar Gabriel
://www.open-mpi.org/community/lists/devel/2014/12/16555.php -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA Tel: +1 (713) 743

Re: [OMPI devel] RTLD_GLOBAL question

2014-12-02 Thread Edgar Gabriel
, but is set on the 1.8 series (1.8 series checkout was from Nov. 20, so if something changed in between the result might be different). On 12/2/2014 9:27 AM, Artem Polyakov wrote: 2014-12-02 20:59 GMT+06:00 Edgar Gabriel <gabr...@cs.uh.edu <mailto:gabr...@cs.uh.edu>>:

Re: [OMPI devel] RTLD_GLOBAL question

2014-12-02 Thread Edgar Gabriel
I checked with the debugger, that it did skip the entire section On 12/2/2014 9:04 AM, Jeff Squyres (jsquyres) wrote: Oy -- I thought we fixed that. :-( Are you saying that configure output says that ltdladvise is not found? On Dec 2, 2014, at 9:59 AM, Edgar Gabriel <gabr...@cs.uh.

Re: [OMPI devel] RTLD_GLOBAL question

2014-12-02 Thread Edgar Gabriel
ost: http://www.open-mpi.org/community/lists/devel/2014/12/16397.php -- С Уважением, Поляков Артем Юрьевич Best regards, Artem Y. Polyakov ___ devel mailing list de...@open-mpi.org Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel Link t

Re: [OMPI devel] question to OMPI_DECLSPEC

2014-11-26 Thread Edgar Gabriel
, Ralph Castain wrote: >> >>> On Nov 25, 2014, at 9:36 AM, Edgar Gabriel <gabr...@cs.uh.edu <mailto:gabr...@cs.uh.edu>> wrote: >>> >>> On 11/25/2014 11:31 AM, Ralph Castain wrote: >>>> >>>>>

Re: [OMPI devel] question to OMPI_DECLSPEC

2014-11-26 Thread Edgar Gabriel
function pointers will be NULL for ROMIO. Just want to make sure this is ok with everybody. Thanks Edgar On 11/25/2014 11:43 AM, Ralph Castain wrote: On Nov 25, 2014, at 9:36 AM, Edgar Gabriel <gabr...@cs.uh.edu> wrote: On 11/25/2014 11:31 AM, Ralph Castain wrote: On Nov 25, 2014

Re: [OMPI devel] question to OMPI_DECLSPEC

2014-11-25 Thread Edgar Gabriel
On 11/25/2014 11:31 AM, Ralph Castain wrote: On Nov 25, 2014, at 8:24 AM, Edgar Gabriel <gabr...@cs.uh.edu <mailto:gabr...@cs.uh.edu>> wrote: On 11/25/2014 10:18 AM, Ralph Castain wrote: Hmmm…no, nothing has changed with regard to declspec that I know about. I’ll ask the ob

Re: [OMPI devel] question to OMPI_DECLSPEC

2014-11-25 Thread Edgar Gabriel
, but I am not aware that it is illegal to call a function of a component from another component. Thanks Edgar On Nov 25, 2014, at 8:07 AM, Edgar Gabriel <gabr...@cs.uh.edu> wrote: Has something changed recently on the trunk/master regarding OMPI_DECLSPEC? The reason I ask is because

[OMPI devel] question to OMPI_DECLSPEC

2014-11-25 Thread Edgar Gabriel
and that problem was not there roughly two weeks back the last time I tested. I did verify that the the function listed there has an OMPI_DECLSPEC before its definition. Thanks Edgar -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer

Re: [OMPI devel] Open MPI Developers F2F Q1 2015 (poll closes on Friday, 7th of November)

2014-11-05 Thread Edgar Gabriel
http://www.cisco.com/web/about/doing_business/legal/cri/ ___ devel mailing list de...@open-mpi.org <mailto:de...@open-mpi.org> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: http://www.open-mpi.org/co

Re: [OMPI devel] Deprecated call in sharedfp framework

2014-10-24 Thread Edgar Gabriel
_0) 4.9.1 Ralph ___ devel mailing list de...@open-mpi.org Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: http://www.open-mpi.org/community/lists/devel/2014/10/16088.php -- Edgar Gabriel Associate Professor Parallel Softwa

Re: [OMPI devel] Need to know your Github ID

2014-09-10 Thread Edgar Gabriel
hiqing swise tkordenbrock vasily vvenkates vvenkatesan yaeld yosefe -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, U

Re: [OMPI devel] segfault in openib component on trunk

2014-08-28 Thread Edgar Gabriel
of it? Thanks Edgar On 08/28/2014 04:19 PM, Edgar Gabriel wrote: we are having recently problems running trunk with openib component enabled on one of our clusters. The problem occurs right in the initialization part, here is the stack right before the segfault: ---snip--- (gdb) where #0

[OMPI devel] segfault in openib component on trunk

2014-08-28 Thread Edgar Gabriel
we are having recently problems running trunk with openib component enabled on one of our clusters. The problem occurs right in the initialization part, here is the stack right before the segfault: ---snip--- (gdb) where #0 mca_btl_openib_tune_endpoint (openib_btl=0x762a40,

Re: [OMPI devel] Agenda for next week

2014-06-19 Thread Edgar Gabriel
sorry, let me be more precise for Wednesday, I have time before 12pm on Wednesday. Thanks Edgar On 6/19/2014 2:52 PM, Edgar Gabriel wrote: > the best time for me would be either Wednesday morning (basically any > time), or Thursday morning before 11am central. > > Thanks > Edg

Re: [OMPI devel] Agenda for next week

2014-06-19 Thread Edgar Gabriel
see that on the agenda, Edgar - can you please add it to ensure it >>> gets covered? >>> >>> >>> On Jun 19, 2014, at 10:36 AM, Edgar Gabriel <gabr...@cs.uh.edu> wrote: >>> >>>> If possible, I would like to attend remotely the discussion about

Re: [OMPI devel] Agenda for next week

2014-06-19 Thread Edgar Gabriel
gt; > Are there any other topics that people wanted to remote in to? Fair warning: > remote attendance is "ok" via webex, but it's no substitute for being there. > -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Departmen

Re: [OMPI devel] RFC: add STCI component to OMPI/RTE framework

2014-05-29 Thread Edgar Gabriel
_ > >> devel mailing list > >> de...@open-mpi.org <mailto:de...@open-mpi.org> > >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > >> Link to this post: > http://www.open-mpi.org/com

Re: [OMPI devel] RFC: add STCI component to OMPI/RTE framework

2014-05-27 Thread Edgar Gabriel
alph Castain <r...@open-mpi.org > <mailto:r...@open-mpi.org>> wrote: > >> >> On May 27, 2014, at 12:50 PM, Edgar Gabriel <gabr...@cs.uh.edu >> <mailto:gabr...@cs.uh.edu>> wrote: >> >>> >>> >>> On 5/27/2014 2:46 PM,

Re: [OMPI devel] RFC: add STCI component to OMPI/RTE framework

2014-05-27 Thread Edgar Gabriel
On 5/27/2014 2:46 PM, Ralph Castain wrote: > > On May 27, 2014, at 12:27 PM, Edgar Gabriel <gabr...@cs.uh.edu> > wrote: > >> I'll let ORNL talk about the STCI component itself (which might >> have additional reasons), but keeping the code in trunk vs. an >>

Re: [OMPI devel] RFC: add STCI component to OMPI/RTE framework

2014-05-27 Thread Edgar Gabriel
rganizations is going to build against it - so what did it > accomplish to push the code into the repo? > > Mind you, I'm not saying I'm staunchly opposed - just trying to > understand how it benefits anyone. > > > On May 27, 2014, at 11:28 AM, Edgar Gabriel <gabr...@cs.uh.edu>

Re: [OMPI devel] RFC: add STCI component to OMPI/RTE framework

2014-05-27 Thread Edgar Gabriel
this >> post: >> http://www.open-mpi.org/community/lists/devel/2014/05/14852.php > > _______ devel mailing list > de...@open-mpi.org Subscription: > http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: > http://www.o

Re: [OMPI devel] Wrong Endianness in Open MPI for external32 representation

2014-04-29 Thread Edgar Gabriel
gt; > Tel: ++49(0)711-685-87203 > email: nietham...@hlrs.de > http://www.hlrs.de/people/niethammer > > > > ___ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link

Re: [OMPI devel] 1-question developer poll

2014-04-16 Thread Edgar Gabriel
because there's serious discussions afoot to switch > OMPI's main SVN repo to Git, and I want to get a feel for the current > landscape out there. > -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science

Re: [OMPI devel] Annual OMPI membership review: SVN accounts

2013-07-08 Thread Edgar Gabriel
co.com> > > Indiana > == > lums: Andrew Lumsdaine <l...@cs.indiana.edu> **NO COMMITS IN LAST YEAR** > adkulkar: Abhishek Kulkarni <adkul...@osl.iu.edu> > afriedle: Andrew Friedley <afrie...@osl.iu.edu> **NO COMMITS IN LAST YEAR** > timattox: Tim Ma

Re: [OMPI devel] sbgp problem

2012-10-30 Thread Edgar Gabriel
as far as I can tell right now, yes, its the final thing... Thanks Edgar On 10/30/2012 2:05 PM, Ralph Castain wrote: > Grrbloody verb @##$@$. > > Okay, I'll make that edit. Is that the last thing required to fix this > problem? > > On Oct 30, 2012, at 11:57 AM, E

Re: [OMPI devel] [OMPI svn] svn:open-mpi r26934 - trunk/ompi/mca/io/ompio

2012-08-01 Thread Edgar Gabriel
; @@ -42,6 +42,7 @@ > > sources = \ > io_ompio.h \ > +io_ompio_nbc.h \ > io_ompio.c \ > io_ompio_component.c \ > io_ompio_module.c \ > ___ > svn mailing list > s...@open-mpi.org > http://www.open

Re: [OMPI devel] Warning in fcoll

2012-05-29 Thread Edgar Gabriel
ile_read_all’: > fcoll_static_file_read_all.c:74: warning: ‘sorted_file_offsets’ may be used > uninitialized in this function > > > > > ___ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel --

Re: [OMPI devel] Trunk build problem

2012-02-28 Thread Edgar Gabriel
sorry, should be fixed with the last commit... Thanks Edgar On 2/28/2012 8:37 AM, Edgar Gabriel wrote: > I'll look into this... > > Thanks > Edgar > > On 2/28/2012 8:36 AM, Ralph Castain wrote: >> I tried to build the trunk this morning on a machine where the fcoll

Re: [OMPI devel] Trunk build problem

2012-02-28 Thread Edgar Gabriel
a1/ompi/mca/fcoll/static/fcoll_static_file_write_all.c:929: > first defined here > > > Any suggestions? > Ralph > > > ___ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabri

Re: [OMPI devel] poor btl sm latency

2012-02-16 Thread Edgar Gabriel
___ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabriel Associate Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Hou

Re: [OMPI devel] RFC: adding OMPIO module and new parallel I/O frameworks to trunk

2011-08-25 Thread Edgar Gabriel
the code has been committed in r25079. Let us know if there are any issues, compilation problems etc. I also work on an FAQ entry as has been suggested on the teleconf last week. Thanks Edgar On 8/12/2011 3:09 PM, Edgar Gabriel wrote: > WHAT: add the ompio io module and new parallel

[OMPI devel] RFC: adding OMPIO module and new parallel I/O frameworks to trunk

2011-08-12 Thread Edgar Gabriel
Feedback is highly welcome. Thanks Edgar -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA Tel: +1 (713) 743-3857

[OMPI devel] problems compiling new ROMIO with PVFS2 support

2011-05-20 Thread Edgar Gabriel
, not an int ) Thanks Edgar -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA Tel: +1 (713) 743-3857 Fax: +1 (713

Re: [OMPI devel] MPI_File_get_size fails for files > 2 GB in Fortran

2010-12-20 Thread Edgar Gabriel
values, and there is no implication in any document > I know of that it should be. > > Regards, > Nick Maclaren. > > > > ___ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] MPI_File_get_size fails for files > 2 GB in Fortran

2010-12-20 Thread Edgar Gabriel
vision, Stop 8911 >> 100 Bureau Drive >> Gaithersburg, MD 20899-8911 >> >> >> ___ >> devel mailing list >> de...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/devel > > > _

Re: [OMPI devel] OMPI #2477

2010-07-13 Thread Edgar Gabriel
be > inside the communicator code). > > https://svn.open-mpi.org/trac/ompi/ticket/2477 > -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Ho

Re: [OMPI devel] RFC: revamp topo framework

2009-10-28 Thread Edgar Gabriel
to (but simpler than) the coll framework. I've started an hg for this work: http://bitbucket.org/jsquyres/ompi-topo-fixes/ Comments? -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G

Re: [OMPI devel] application hangs with multiple dup

2009-09-22 Thread Edgar Gabriel
it will be available in 1.3.4... Thanks Edgar Chris Samuel wrote: Hi Edgar, - "Edgar Gabriel" <gabr...@cs.uh.edu> wrote: just wanted to give a heads-up that I *think* I know what the problem is. I should have a fix (with a description) either later today or tomorrow mo

Re: [OMPI devel] Deadlock with comm_create since cid allocator change

2009-09-21 Thread Edgar Gabriel
0; } ___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University

Re: [OMPI devel] application hangs with multiple dup

2009-09-16 Thread Edgar Gabriel
just wanted to give a heads-up that I *think* I know what the problem is. I should have a fix (with a description) either later today or tomorrow morning... Thanks Edgar Edgar Gabriel wrote: so I can confirm that I can reproduce the hang, and we (George, Rainer and me) have looked

Re: [OMPI devel] application hangs with multiple dup

2009-09-16 Thread Edgar Gabriel
there is a ticket on that topic already (#2009), and I just added some comments to that... Jeff Squyres wrote: On Sep 10, 2009, at 7:12 PM, Edgar Gabriel wrote: so I can confirm that I can reproduce the hang, and we (George, Rainer and me) have looked into that and are continue digging. I

Re: [OMPI devel] application hangs with multiple dup

2009-09-10 Thread Edgar Gabriel
), which seems to be consistent with two other bug reports currently being discussed on the mailing list. I could reproduce the hang with both sm and tcp, so its probably not a btl issue but somewhere higher. Thanks Edgar Thomas Ropars wrote: Edgar Gabriel wrote: Two short questions: do you have

Re: [OMPI devel] application hangs with multiple dup

2009-09-10 Thread Edgar Gabriel
.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA Te

[OMPI devel] compile error on trunk

2009-05-08 Thread Edgar Gabriel
am at the newest revision. What am I doing wrong? Thanks Edgar -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA Tel: +1 (713)

Re: [OMPI devel] MPI_Group_compare is broken

2009-05-07 Thread Edgar Gabriel
Geoffrey ___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip

Re: [OMPI devel] Inherent limit on #communicators?

2009-05-05 Thread Edgar Gabriel
ementation. Personally I find this advice rather strange and (if nobody can give a good reason for it) would encourage its removal... Best regards, Christian ___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] Inherent limit on #communicators?

2009-05-02 Thread Edgar Gabriel
solved. Thanks Ralph On Fri, May 1, 2009 at 6:08 AM, Edgar Gabriel <gabr...@cs.uh.edu> wrote: David, is this code representative for what your app is doing? E.g. you have a base communicator (e.g. MPI_COMM_WORLD) which is being 'split', freed again, split,

Re: [OMPI devel] Inherent limit on #communicators?

2009-05-01 Thread Edgar Gabriel
of OMPI. It only hangs on the 1.3 line when the cid reaches 65536. -david -- David Gunter HPC-3: Parallel Tools Team Los Alamos National Laboratory On Apr 30, 2009, at 12:28 PM, Edgar Gabriel wrote: cid's are in fact not recycled in the block algorithm. The problem is that comm_free

Re: [OMPI devel] Inherent limit on #communicators?

2009-04-30 Thread Edgar Gabriel
, Edgar Gabriel wrote: Brian W. Barrett wrote: When we added the CM PML, we added a pml_max_contextid field to the PML structure, which is the max size cid the PML can handle (because the matching interfaces don't allow 32 bits to be used for the cid. At the same time, the max cid for OB1

Re: [OMPI devel] Inherent limit on #communicators?

2009-04-30 Thread Edgar Gabriel
on, that is? remember that its not the communicator id allocation that is failing at this point, so the question is do we have to 'validate' a cid with the pml before we declare it to be ok? Thanks Edgar Brian On Thu, 30 Apr 2009, Edgar Gabriel wrote: cid's are in fact not recycled in the block

Re: [OMPI devel] Inherent limit on #communicators?

2009-04-30 Thread Edgar Gabriel
recycled properly. Brian ___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science

Re: [OMPI devel] reduce_scatter bug with hierarch

2009-01-14 Thread Edgar Gabriel
, whether for 1.3.0 or 1.3.1 I leave that up to others to decide... Thanks Edgar Edgar Gabriel wrote: I'm already debugging it. the good news is that it only seems to appear with trunk, with 1.3 (after copying the new tuned module over), all the tests pass. Now if somebody can tell me a trick on how

Re: [OMPI devel] reduce_scatter bug with hierarch

2009-01-14 Thread Edgar Gabriel
:09 PM, Edgar Gabriel wrote: I just debugged the Reduce_scatter bug mentioned previously. The bug is unfortunately not in hierarch, but in tuned. Here is the code snipplet causing the problems: int reduce_scatter (, mca_coll_base_module_t *module) { ... err = comm->c_coll.coll_reduce (...

[OMPI devel] reduce_scatter bug with hierarch

2009-01-13 Thread Edgar Gabriel
eem to do that mostly correctly. Thanks Edgar -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA Tel: +1 (713) 743-3857

Re: [OMPI devel] Dropped message for the non-existing communicator

2008-11-08 Thread Edgar Gabriel
next steps are to try to run this with the latest trunk and to narrow down the failing case. --td ___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabriel Assistant Professor Parallel Software Techno

Re: [OMPI devel] Affect of compression on modex and launch messages

2008-04-04 Thread Edgar Gabriel
s that are out there. - R. ___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Ha

Re: [OMPI devel] [RFC] Default hostfile MCA param

2008-03-04 Thread Edgar Gabriel
tes "all nodes below are -not- to be used". Implementation isn't too hard once I have that... On 3/3/08 9:44 AM, "Edgar Gabriel" <gabr...@cs.uh.edu> wrote: Ralph, could this mechanism be used also to exclude a node, indicating to never run a job there? Here is the pro

Re: [OMPI devel] [OMPI svn] svn:open-mpi r17366

2008-02-04 Thread Edgar Gabriel
) { err = MPI_ERR_ARG; } else { OMPI_CHECK_DATATYPE_FOR_SEND(err, datatype, count); ___ svn mailing list s...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/svn -- Edgar Gabriel Assistant Professor Parallel S

Re: [OMPI devel] Hierarchical Collectives Query

2008-01-24 Thread Edgar Gabriel
://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-77204, USA Tel: +1 (713) 743-3857

Re: [OMPI devel] GROUP_EMPTY fixes break intel tests :-(

2007-12-06 Thread Edgar Gabriel
it's bad to free GROUP_EMPTY. I agree that this is somewhat counter to the rest of the MPI philosophy of not freeing predefined handles, though. -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science

Re: [OMPI devel] Hybrid examples

2007-10-17 Thread Edgar Gabriel
___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G

Re: [OMPI devel] VampirTrace integration / bundling 3rd party software in OMPI

2007-10-09 Thread Edgar Gabriel
nough away from v1.3 release that this does not impact any release plans with VT. -- Edgar Gabriel Assistant Professor Parallel Software Technologies Lab http://pstl.cs.uh.edu Department of Computer Science University of Houston Philip G. Hoffman Hall, Room 524Houston, TX-

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r16088

2007-09-11 Thread Edgar Gabriel
Gleb Natapov wrote: On Tue, Sep 11, 2007 at 10:00:07AM -0500, Edgar Gabriel wrote: Gleb, in the scenario which you describe in the comment to the patch, what should happen is, that the communicator with the cid which started already the allreduce will basically 'hang' until the other

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r16088

2007-09-11 Thread Edgar Gabriel
devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Gleb. ___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Edgar Gabriel Assist

Re: [OMPI devel] some possible bugs

2006-09-27 Thread Edgar Gabriel
Lisandro, do you have an example for the extended collective operations tests which fail? It would help track down the problem. I had a quick look at our implementation but I can not find an obvious problem, so an example would be extremely helpful. Thanks Edgar - Some extended

Re: [OMPI devel] MPI_Comm_spawn_multiple() and MPI_ERRORCODES_IGNORE

2006-05-15 Thread Edgar Gabriel
to come out. if ( NULL == array_of_errcodes ) { return OMPI_ERRHANDLER_INVOKE(comm, MPI_ERR_ARG, FUNC_NAME); } Looks like this has already been fixed for MPI_Comm_spawn(). Rolf -- Edgar Gabriel Assistant Professor

Re: [OMPI devel] Repost: MPI_Comm_spawn_multiple() and MPI_ERRORCODES_IGNORE

2006-05-11 Thread Edgar Gabriel
Rolf, thanks for catching that, it is now fixed on the trunk and Jeff is moving it right now to v1.0 and v1.1 Best regards Edgar Rolf Vandevaart wrote: Repost because I did not see it in the archives after a day. Rolf Vandevaart wrote On 05/09/06 17:32,: We believe there is a minor bug

Re: [O-MPI devel] collectives discussion @LANL

2005-07-18 Thread Edgar Gabriel
of the site) Are there others? -- == Dr.-Ing. Edgar Gabriel Clusters and Distributed Units High Performance Computing Center Stuttgart (HLRS) University of Stuttgart Tel: +49 711 685 8039http://www.hlrs.de