[OMPI devel] master nightly tarballs stopped on 11/21

2016-11-23 Thread Pritchard Jr., Howard
Hi Brian, Could you check what’s going on with the nightly tarball builds? Nothing new has been built since 11/21 even though a number of PR’s have been merged in since then. Thanks, Howard -- Howard Pritchard HPC-DES Los Alamos National Laboratory _

[OMPI devel] Follow-up to Open MPI SC'16 BOF

2016-11-22 Thread Pritchard Jr., Howard
Hello Folks, This is a followup to the question posed at the SC’16 Open MPI BOF: Would the community prefer to have a v2.2.x limited feature but backwards compatible release sometime in 2017, or would the community prefer a v3.x (not backwards compatible but potentially more features) sometime

Re: [OMPI devel] Developing MPI program without mpirun

2016-11-18 Thread Pritchard Jr., Howard
Hello Rui, Note there is no standard for the format of the port_name so don’t read much what it looks like when printed out. Could you provide some more information about your particular setup: - characteristics of the system you are using, e.g. a Linux cluster, laptop running os-x, etc. - wha

Re: [OMPI devel] Failure while loading shmem module

2016-11-16 Thread Pritchard Jr., Howard
m>>: Hi Pritchard, thank you for replying. Nothing changed adding the parameter you suggested. Can it depend on the fact that I'm running v.1.10.0rc7? It's a custom version, we didn't modify spml or sm related code though. 2016-11-15 14:12 GMT+01:00 Pritchard Jr., Howard mailt

Re: [OMPI devel] Failure while loading shmem module

2016-11-15 Thread Pritchard Jr., Howard
HI Gianmario, Probably something went wrong at the spml layer. Could you also add —mac spml_base_verbose 10 to the job launch line? Howard -- Howard Pritchard HPC-DES Los Alamos National Laboratory From: devel mailto:devel-boun...@lists.open-mpi.org>> on behalf of Gianmario Pozzi mailto:pozz

Re: [OMPI devel] regression: MPI_Neighbor_alltoall with empty graph

2016-11-12 Thread Pritchard Jr., Howard
Hi Daniel, Thanks very much for trying 2.0.1 out and reporting this. Looks like this may be a case of not checking “edge” cases in the use of this collective. Would you please open an issue? There is currently an issue 2324 related to neighbor collectives, but it isn’t directly related to the pr

[OMPI devel] direct launch problem with master

2016-10-30 Thread Pritchard Jr., Howard
Hi Folks, While trying to solve a different problem, I optimistically tried to use head-of –master to work on that problem. Now I’ve found a new problem with master when trying to do a direct launch with SLURM, srun: [nid00012:09456] [[27960,0],0] ERROR: Failed to identify the local daemon's UR

Re: [OMPI devel] OpenGrok (?)

2016-10-19 Thread Pritchard Jr., Howard
Hi Jeff, I’m not using it. Howard -- Howard Pritchard HPC-DES Los Alamos National Laboratory On 10/19/16, 9:21 AM, "devel on behalf of Jeff Squyres (jsquyres)" wrote: >Looking through the OpenGrok requirements, I have to admit that I'm not >excited about running and maintaining a Tomcat

Re: [OMPI devel] toward a unique session directory

2016-09-15 Thread Pritchard Jr., Howard
HI Gilles, From what point in the job launch are you needed to determine whether or not the job was direct launched? Howard -- Howard Pritchard HPC-DES Los Alamos National Laboratory On 9/15/16, 7:38 AM, "devel on behalf of Gilles Gouaillardet" wrote: >Ralph, > >that looks good to me. >

Re: [OMPI devel] Lots of new features rolled out on github.com today

2016-09-14 Thread Pritchard Jr., Howard
Ralph, I know with older versions of git you may have problems since you can’t use https. I think with newer versions it will prompt not just for passed but also 2-factor. That’s one problem I hit anyway when first enabling 2-factor. Howard -- Howard Pritchard HPC-DES Los Alamos National Lab

Re: [OMPI devel] OpenMPI 2.x: bug: violent break at beginning with (sequential) runs...

2016-09-13 Thread Pritchard Jr., Howard
Hello Eric, Is the failure seen with the same two tests? Or is it random which tests fail? If its not random, would you be able to post the tests to the list? Also, if possible, it would be great if you could test against a master snapshot: https://www.open-mpi.org/nightly/master/ Thanks,

Re: [OMPI devel] PMIx Language Bindings

2016-08-08 Thread Pritchard Jr., Howard
HI Ralph, If the java bindings are of use, I could see if my student how did a lot of the recent work in the Open MPI java bindings would be interested. He doesn¹t have a lot of extra cycles at the moment though. Howard -- Howard Pritchard HPC-DES Los Alamos National Laboratory On 8/7/16,

Re: [OMPI devel] tcp btl rendezvous performance question

2016-07-26 Thread Pritchard Jr., Howard
Hi Sreenidhi Only partial resolution. By pushing out the eager path to 4 MB we were able to get around 2GB/sec per socket connection with osu bw test. The kernel is quite old though - 2.6.x - and being a summer student project with a focus on IB vs rout able ROCE we've moved on. Howard _

[OMPI devel] LANL jenkins update

2016-07-21 Thread Pritchard Jr., Howard
Hi Folks, The LANL/(soon to not be iu) jenkins should now work with bot:lanl:retest Also, NERSC Cori system went down this morning for maintenance during CI check of PR 1896 on master. I didn't see any others impacted by the cori maintenance. Howard -- Howard Pritchard HPC-DES Los Alamos Nat

Re: [OMPI devel] 2.0.0rc4 Crash in MPI_File_write_all_end

2016-07-13 Thread Pritchard Jr., Howard
Jeff, I think this was fixed in PR 1227 on v2.x Howard -- Howard Pritchard HPC-DES Los Alamos National Laboratory On 7/13/16, 1:47 PM, "devel on behalf of Jeff Squyres (jsquyres)" wrote: >I literally just noticed that this morning (that singleton was broken on >master), but hadn't gotte

Re: [OMPI devel] Issue with 2.0.0rc3, singleton init

2016-06-16 Thread Pritchard Jr., Howard
Hi Ralph, thanks! does this impact particular systems or is it general problem. Howard -- Howard Pritchard HPC-DES Los Alamos National Laboratory From: devel mailto:devel-boun...@open-mpi.org>> on behalf of Ralph Castain mailto:r...@open-mpi.org>> Reply-To: Open MPI Developers mailto:de...@o

Re: [OMPI devel] Github migration: tomorrow

2014-09-30 Thread Pritchard Jr., Howard
Hi Jeff, When's the latest today that we can do checkins without causing problems? Howard -Original Message- From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Jeff Squyres (jsquyres) Sent: Tuesday, September 30, 2014 9:57 AM To: Open MPI Developers List Subject: [OMPI devel]

Re: [OMPI devel] [patch] libnbc intercommunicator iallgather bug

2014-09-30 Thread Pritchard Jr., Howard
Hi Takahiro, Thanks very much for the patch and the test! After the git migration we'll open an issue and patch nbc_iallgather. This will get pushed to 1.8.4. Howard -Original Message- From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Kawashima, Takahiro Sent: Monday, Sept

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r32814 - trunk/ompi/mca/coll/ml

2014-09-29 Thread Pritchard Jr., Howard
Hi Jeff, Sure if that's the preferred check inside ompi itself. Howard -Original Message- From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Jeff Squyres (jsquyres) Sent: Monday, September 29, 2014 3:59 PM To: Open MPI Developers List Subject: Re: [OMPI devel] [OMPI svn-full]

[OMPI devel] release 1.9

2014-09-29 Thread Pritchard Jr., Howard
Hi Folks, The release managers for the 1.9/2.0 stream have been putting together notes on features for this series, what sort of code pruning to do, etc. See https://github.com/open-mpi/ompi/wiki/Releasev19 We will be discussing the contents of the table(s) at the bottom of the wiki at tomorrow

[OMPI devel] dfference between my_node_rank and my_local_rank in orte proc_info_t

2014-09-26 Thread Pritchard Jr., Howard
Hi Folks, I'm trying to figure out something about the kind of info pmi's are suppose to be feeding back up in to orte/ompi, partly because native launch doesn't seem to work too well in trunk. One of the things I'm puzzling about is the purpose of the my_node_rank field in orte_proc_info_t. I'm

Re: [OMPI devel] RFC: "v1.9.0" (vs. "v1.9")

2014-09-22 Thread Pritchard Jr., Howard
Hi Folks, I thought that 1.9.X release would at some point become the 2.0 release. I thought trunk would go to 2.1 once we branch 1.9 from trunk, no? What Jeff and I don't like is using 1.9 with implicit 0, then having 1.9.1,1.9.2, etc. Howard -Original Message- From: devel [mailto:d

Re: [OMPI devel] RFC: remove the --with-threads configure option

2014-09-18 Thread Pritchard Jr., Howard
I second the motion. -Original Message- From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Thursday, September 18, 2014 8:43 AM To: Open MPI Developers Subject: Re: [OMPI devel] RFC: remove the --with-threads configure option None that I can think of - I'd re

[OMPI devel] PSM MTl work with srun launch?

2014-09-16 Thread Pritchard Jr., Howard
Hi Folks, I'm getting questions about the 1.8.2 and mtl psm and slurm-direct (srun) launch. User is hitting the problem that the global id is not being set. I've suggested for now that the user just set the OMPI_MCA_orte_precondition_transports environment variable for now to something like ex

Re: [OMPI devel] coll ml error with some nonblocking collectives

2014-09-15 Thread Pritchard Jr., Howard
cking collectives Confirmed that trunk version r32658 does pass the test. From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Pritchard Jr., Howard Sent: Monday, September 15, 2014 4:16 PM To: Open MPI Developers Subject: Re: [OMPI devel] coll ml error with some nonblocking collectives Hi Rolf,

[OMPI devel] removing cnos support from ompi

2014-09-15 Thread Pritchard Jr., Howard
Hi Folks, I'd like to rip out the cnos ess/alps code from ompi. Its dead - no one is using CNOS (old cray xt systems) - and its very confusing to leave around. Any objections? Howard - Howard Pritchard HPC-5 Los Alamos National Laboratory

Re: [OMPI devel] coll ml error with some nonblocking collectives

2014-09-15 Thread Pritchard Jr., Howard
Hi Rolf, This may be related to change set 32659. If you back this change out, do the tests pass? Howard From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Rolf vandeVaart Sent: Monday, September 15, 2014 8:55 AM To: de...@open-mpi.org Subject: [OMPI devel] coll ml error with some

[OMPI devel] mpirun + aprun question

2014-09-12 Thread Pritchard Jr., Howard
Hi Folks, So, I've got a testbed cray system with no batch scheduler, just use the native alps both as the resource manager and as the job launcher for the orte daemons. What I'm noticing is that the mpirun command and -host option, or otherwise trying to specify via an mpirun way, the nodes to r

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r32711 - trunk/opal/mca/pmix/cray

2014-09-11 Thread Pritchard Jr., Howard
thanks, it was bad cut/paste From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Tim Mattox Sent: Thursday, September 11, 2014 2:54 PM To: Open MPI Developers Subject: Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r32711 - trunk/opal/mca/pmix/cray I'm sure that is not what you meant to

Re: [OMPI devel] Need to know your Github ID

2014-09-10 Thread Pritchard Jr., Howard
hppritcha->hppritcha -Original Message- From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Jeff Squyres (jsquyres) Sent: Wednesday, September 10, 2014 4:46 AM To: Open MPI Developers List Subject: [OMPI devel] Need to know your Github ID As the next step of the planned migrati

Re: [OMPI devel] segfault in openib component on trunk

2014-08-28 Thread Pritchard Jr., Howard
Hi Edgar, Could you send me your conf file? I'll try to reproduce it. Maybe run with --mca btl_base_verbose 20 or something to see what the code that is parsing this field in the conf file is finding. Howard -Original Message- From: devel [mailto:devel-boun...@open-mpi.org] On Behalf

Re: [OMPI devel] RFC: Merge PMIx branch to trunk

2014-08-15 Thread Pritchard Jr., Howard
Hi Ralph, I would also add that the branch has the advantage that one can actually launch jobs on crays using aprun. That direct launch capability is busted in trunk at this point. Howard From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Thursday, August 14, 20

[OMPI devel] btl thread safety question

2014-08-11 Thread Pritchard Jr., Howard
Hi Folks, Has anyone checked about ompi thread safety support since the BTL move? I can only get the osu latency mt test to work using sm/shmem/vader. With TCP I see it hang after 32KB messages. Howard - Howard Pritchard HPC-5 Los Alamos Nationa

Re: [OMPI devel] v1.8.2 still held up...

2014-08-07 Thread Pritchard Jr., Howard
Hi Ralph, I'll review 4826 as proxy for hjelmn. I'm just checking that it builds on my system before saying okay. Howard From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Thursday, August 07, 2014 11:55 AM To: Open MPI Developers Subject: [OMPI devel] v1.8.2 stil

Re: [OMPI devel] opal_config_bottom.h question again

2014-08-04 Thread Pritchard Jr., Howard
der file. #ifndef OPAL_MALLOC_H #define OPAL_MALLOC_H #include "opal_config.h" #include /* * THIS FILE CANNOT INCLUDE ANY OTHER OPAL HEADER FILES!!! * * It is included via . Hence, it should not * include ANY other files, nor should it include "opal_config.h". * */ Don&

[OMPI devel] opal_config_bottom.h question again

2014-08-04 Thread Pritchard Jr., Howard
Hi Folks, As I said last week, I'm noticing now that on my opensuse 13.1 system and gcc 4.8.1, when I do a fresh checkout of trunk ompi and try to build, without any configure options, mca_base_mpool_frame.c does not compile. The reason is there is a conflict in opal_config_bottom.h and the co

Re: [OMPI devel] openmpi-1.8.2rc2 and f08 interface built with PGI-14.7 causes link error

2014-08-01 Thread Pritchard Jr., Howard
Sorry, finally got through all this ompi email and see this problem was fixed. -Original Message- From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Pritchard Jr., Howard Sent: Friday, August 01, 2014 8:59 AM To: Open MPI Developers Subject: Re: [OMPI devel] openmpi-1.8.2rc2

Re: [OMPI devel] openmpi-1.8.2rc2 and f08 interface built with PGI-14.7 causes link error

2014-08-01 Thread Pritchard Jr., Howard
Hi Jeff, Finally got info yesterday about where the newer PGI compilers are hiding out at LANL. I'll check this out today. Howard -Original Message- From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Jeff Squyres (jsquyres) Sent: Tuesday, July 29, 2014 5:24 PM To: Open MPI D

Re: [OMPI devel] RFC: job size info in OPAL

2014-07-31 Thread Pritchard Jr., Howard
elopers Subject: Re: [OMPI devel] RFC: job size info in OPAL What is your definition of "global job size"? George. On Jul 31, 2014, at 11:06 , Pritchard Jr., Howard wrote: > Hi Folks, > > I think given the way we want to use the btl's in lower levels like > opal,

Re: [OMPI devel] RFC: job size info in OPAL

2014-07-31 Thread Pritchard Jr., Howard
Hi Folks, I think given the way we want to use the btl's in lower levels like opal, it is pretty disgusting for a btl to need to figure out on its own something like a "global job size". That's not its business. Can't we add some attributes to the component's initialization method that provides

Re: [OMPI devel] OMPI_XXX defines in opal_config.h.in question

2014-07-29 Thread Pritchard Jr., Howard
2014 at 3:58 PM, Pritchard Jr., Howard mailto:howa...@lanl.gov>> wrote: Hi Folks, So I'm trying to get my pmix project back in order after making the big mistake of pulling in the BTL move stuff. I get bizillions of warning messages from the compiler now about undefined macros. A nu

[OMPI devel] OMPI_XXX defines in opal_config.h.in question

2014-07-29 Thread Pritchard Jr., Howard
Hi Folks, So I'm trying to get my pmix project back in order after making the big mistake of pulling in the BTL move stuff. I get bizillions of warning messages from the compiler now about undefined macros. A number of the culprits are in opal_config.h.in, and begin with OMPI, like OMPI_WANT_MEM

Re: [OMPI devel] SHMEM symmetric objects in shared libraries

2014-07-29 Thread Pritchard Jr., Howard
Hi Pasha, It can be done. Cray Fortran 2008 compiler had to do this to be fully compliant with the Fortran 2008 co-array feature. Howard -Original Message- From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Shamis, Pavel Sent: Tuesday, July 29, 2014 12:02 PM To: Open MPI Dev

Re: [OMPI devel] opal_config_bottom.h problem with trunk

2014-07-29 Thread Pritchard Jr., Howard
i.org] On Behalf Of Jeff Squyres (jsquyres) Sent: Monday, July 28, 2014 5:56 PM To: Open MPI Developers List Subject: Re: [OMPI devel] opal_config_bottom.h problem with trunk Nope, haven't seen that before... On Jul 28, 2014, at 6:43 PM, Pritchard Jr., Howard wrote: > Hi Folks, >

Re: [OMPI devel] SHMEM symmetric objects in shared libraries

2014-07-29 Thread Pritchard Jr., Howard
Hi Josh, No, I think what Bert is talking about is having something like foobar.c == #include “shmem.h” long my_dso_val = 0; int foobar(void) { printf(“Hello from a shared library\n”); } == cc –fPIC –c foobar.c cc –shared –o libfoobar.so foobar.o cc –o berts_cod

[OMPI devel] opal_config_bottom.h problem with trunk

2014-07-28 Thread Pritchard Jr., Howard
Hi Folks, I was feeling lucky and decided to a fresh svn checkout of trunk and simple ./autogen.pl, ./configure make on a opensuse 13.1. I get a blowup in opal_config_bottom.h: pp@hagel-vm:~>../../opal/include/opal_config_bottom.h:383:38: error: expected declaration specifiers or '...' before