Re: [OMPI devel] [EXTERNAL] trunk install failure [brbarret]

2013-01-29 Thread Barrett, Brian W
Thanks for noticing this. Fixed in the trunk. Brian On 1/28/13 11:15 PM, "Paul Hargrove" wrote: > Using tonight's trunk tarball (r27954) configured using "--with-devel-headers" > it looks like "make install" is trying to install rte_orte.h TWICE: > >> /usr/bin/install -c -m 644 ../../../../

Re: [OMPI devel] RFC: opal_list iteration macros

2013-01-29 Thread Jeff Squyres (jsquyres)
Agreed. I like the idea, and recognize that it is inspired by Linux kernel macros. But I would prefer them to be upper case to match our conventions. Also, please be sure to put in good comments explaining their use in the .h file. Thanks! On Jan 29, 2013, at 12:18 PM, Ralph Castain wrote:

Re: [OMPI devel] RFC: opal_list iteration macros

2013-01-29 Thread Ralph Castain
Ja, I've considered a similar addition on occasion. +1 from me Only comment: you should change these to match our convention by making the macros be capital letters: e.g., OPAL_LIST_FOREACH On Jan 29, 2013, at 9:08 AM, Nathan Hjelm wrote: > What: Add two new macros to opal_list.h: > > #define

[OMPI devel] RFC: opal_list iteration macros

2013-01-29 Thread Nathan Hjelm
What: Add two new macros to opal_list.h: #define opal_list_foreach(item, list, type) \ for (item = (type *) (list)->opal_list_sentinel.opal_list_next ; \ item != (type *) &(list)->opal_list_sentinel ; \ item = (type *) ((opal_list_

Re: [OMPI devel] Open MPI on Cray XC30 - suspicous configury

2013-01-29 Thread Nathan Hjelm
Opps, that was my mistake. I wrote a fix for the CLE5 and --with-alps= code but I never pushed it. r27962 should fix the issue. -Nathan On Mon, Jan 28, 2013 at 09:05:32PM -0800, Ralph Castain wrote: > Thanks Paul - appreciate the help! I chatted with Nathan this evening and now > have a much be

Re: [OMPI devel] openib unloaded before last mem dereg

2013-01-29 Thread Steve Wise
Tests good on 1.6.3 too. Thanks Josh! On 1/29/2013 9:17 AM, Steve Wise wrote: I applied it to 1.6.4rc2 and it fixed the seg fault issue. Lemme try 1.6.3 too. On 1/29/2013 9:11 AM, Joshua Ladd wrote: It should apply cleanly to 1.6.3 branch, I tested it this morning. From top level OMPI dire

Re: [OMPI devel] openib unloaded before last mem dereg

2013-01-29 Thread Steve Wise
I applied it to 1.6.4rc2 and it fixed the seg fault issue. Lemme try 1.6.3 too. On 1/29/2013 9:11 AM, Joshua Ladd wrote: It should apply cleanly to 1.6.3 branch, I tested it this morning. From top level OMPI directory do patch -p1 < yossi_openib.patch Best, Josh Josh -Original Me

Re: [OMPI devel] openib unloaded before last mem dereg

2013-01-29 Thread Joshua Ladd
It should apply cleanly to 1.6.3 branch, I tested it this morning. From top level OMPI directory do patch -p1 < yossi_openib.patch Best, Josh Josh -Original Message- From: devel-boun...@open-mpi.org [mailto:devel-boun...@open-mpi.org] On Behalf Of Jeff Squyres (jsquyres) Sent: Tue

Re: [OMPI devel] openib unloaded before last mem dereg

2013-01-29 Thread Jeff Squyres (jsquyres)
It's on the ticket that I just assigned to you. :-) On Jan 29, 2013, at 10:03 AM, Steve Wise wrote: > Will do...once I get a patch. > > STeve > On 1/29/2013 7:40 AM, Jeff Squyres (jsquyres) wrote: >> Thanks Josh. >> >> Steve -- if you can confirm that this fixes your problem in the v1.6 seri

Re: [OMPI devel] openib unloaded before last mem dereg

2013-01-29 Thread Steve Wise
Will do...once I get a patch. STeve On 1/29/2013 7:40 AM, Jeff Squyres (jsquyres) wrote: Thanks Josh. Steve -- if you can confirm that this fixes your problem in the v1.6 series, we'll go ahead and commit the patch. FWIW: the OpenFabrics startup code got a little cleanup/revamp on the trunk/

Re: [OMPI devel] New ARM patch

2013-01-29 Thread Jeff Squyres (jsquyres)
On Jan 28, 2013, at 8:46 AM, Leif Lindholm wrote: > But giving some flexibility for roadblocks, can we say "this quarter"? Cool. > Apart from our *cough* convoluted architecture vs. processor naming scheme... > It should be ARMv4, ARMv5, ARMv6 and ARMv7. Fixed in the README; thanks. >> --> D

[OMPI devel] RFC: Remove (broken) heterogeneous support

2013-01-29 Thread Jeff Squyres (jsquyres)
WHAT: Remove the configure command line option to enable heterogeneous support WHY: The heterogeneous conversion code isn't working, very few people use this feature WHERE: README and config/opal_configure_options.m4. See attached patch. TIMEOUT: Next Tuesday teleconf, 5 Feb, 2013 MORE DETAIL

Re: [OMPI devel] openib unloaded before last mem dereg

2013-01-29 Thread Jeff Squyres (jsquyres)
Thanks Josh. Steve -- if you can confirm that this fixes your problem in the v1.6 series, we'll go ahead and commit the patch. FWIW: the OpenFabrics startup code got a little cleanup/revamp on the trunk/v1.7 -- I suspect that's why you're not seeing the problem on trunk/v1.7 (e.g., look at the

[OMPI devel] Open MPI on Cray XC30 status

2013-01-29 Thread Paul Hargrove
OK, I am now on the openmpi-1.9a1r27954 tarball. In order to build OMPI and compile apps on this machine I must 1) edit the xe6 platform to --disable-shared/--enable-static (site-specific) 2) edit the xe6 platform file to provide a full path to the alps headers because the logic in orte_check_alp

Re: [OMPI devel] openib unloaded before last mem dereg

2013-01-29 Thread Joshua Ladd
So, we (Mellanox) have observed this ourselves when no suitable CPC can be found. Seems the BTL associated with this port is not destroyed and the ref count is not decreased. Not sure why you don't see the problem in 1.7. But we have a patch that I'll CMR today. Please review our symptoms, diag

[OMPI devel] trunk install failure [brbarret]

2013-01-29 Thread Paul Hargrove
Using tonight's trunk tarball (r27954) configured using "--with-devel-headers" it looks like "make install" is trying to install rte_orte.h TWICE: /usr/bin/install -c -m 644 ../../../../../ompi/mca/rte/orte/rte_orte.h > ../../../../../ompi/mca/rte/orte/rte_orte.h > '/global/homes/h/hargrove/GSCR

Re: [OMPI devel] Open MPI on Cray XC30 - suspicous configury

2013-01-29 Thread Ralph Castain
Thanks Paul - appreciate the help! I chatted with Nathan this evening and now have a much better understanding of the problem driving the code. We are going to review it tomorrow. Hope to have a fix shortly. On Jan 28, 2013, at 9:01 PM, Paul Hargrove wrote: > It looks now like the very first

Re: [OMPI devel] Open MPI on Cray XC30 - suspicous configury

2013-01-29 Thread Paul Hargrove
It looks now like the very first line of ORTE_CHECK_ALPS is actually the one that is preventing $1_CPPFLAGS from getting set for any caller other than the first: if test -z "$orte_check_alps_happy"; then So, my previous patch (tested by editing configure directly) didn't do the job. Again, th