Re: [OMPI devel] RFC: revamp btl rdma interface

2014-11-06 Thread Howard Pritchard
arget addr being in flight. Howard 2014-11-06 12:08 GMT-07:00 Nathan Hjelm : > I haven't look at that yet. Would be great to get the new osc component > working over both btls and mtls. I know portals supports atomics but I > don't know whether psm does. > > -Nathan >

Re: [OMPI devel] Pull requests on the trunk

2014-11-06 Thread Howard Pritchard
HI Ralph, We should discuss this on Tuesday. I thought we'd decided for master to use a model where developers would directly push to ompi/master. I'd be willing to pull the request from Giles marked as bugs tomorrow. Howard 2014-11-06 13:16 GMT-07:00 Ralph Castain : > Hey folk

Re: [OMPI devel] OMPI devel] Pull requests on the trunk

2014-11-07 Thread Howard Pritchard
ry, we'll be in good shape. Howard 2014-11-06 20:11 GMT-07:00 Ralph Castain : > Yeah - to be clear, I had no problem with anything you did, Gilles. I was > only noting that several of them had positive comments, but they weren’t > being merged. Hate to see the good work lost or forgott

Re: [OMPI devel] 1.8.3 and PSM errors

2014-11-11 Thread Howard Pritchard
ready "connected". I notice for the psm mtl in ompi, a mask array is not provided, just a NULL. Howard 2014-11-11 16:00 GMT-07:00 George Bosilca : > > > On Nov 11, 2014, at 17:13 , Jeff Squyres (jsquyres) > wrote: > > > >> More particularly, it looks like add_pr

Re: [OMPI devel] 1.8.3 and PSM errors

2014-11-12 Thread Howard Pritchard
HI Folks, I think this is a bug in the PSM MTL add_procs. The call to psm_ep_connect needs to be taking previously connected ep's into account, much like what is done in the libfabric psm provider code. Howard 2014-11-12 3:12 GMT-07:00 Rainer Keller : > Dear Andrew, > no, this

Re: [OMPI devel] 1.8.3 and PSM errors

2014-11-13 Thread Howard Pritchard
Hi Adrian, Please do your PSM results in the database. Would be very much appreciated. Howard 2014-11-13 7:46 GMT-07:00 Adrian Reber : > I applied the fix committed on master and described in > > https://github.com/open-mpi/ompi/issues/268 > > on 1.8.3 and 1.8.4rc1 and thi

[OMPI devel] RTLD_GLOBAL question

2014-12-01 Thread Howard Pritchard
the vm_open - modulo detection of the definition of RTLD_GLOBAL at compile time. Perhaps adding a way with an env. or config option to not enable RTLD_GLOBAL by default? Thanks, Howard

[OMPI devel] jenkins runtime failures

2014-12-03 Thread Howard Pritchard
Hi Folks, I can't reproduce the runtime error (looks like in MPI_Finalize) that the mlnx jenkins is hitting with our pull requests. Has anyone figured out the problem yet? I would prefer to have green checks on our pull requests before they get merged in. Thanks, Howard

Re: [OMPI devel] RTLD_GLOBAL question

2014-12-03 Thread Howard Pritchard
Hello Artem, No, but I was also told by schedmd that the slurm we have on our systems is ancient. So I'm no longer considering this problem very important. We have a workaround of always configuring with --disable-dlopen. Thanks, Howard 2014-12-02 20:59 GMT-07:00 Artem Polyakov : >

Re: [OMPI devel] jenkins runtime failures

2014-12-03 Thread Howard Pritchard
aster. I'd prefer to first have master fixed, then retest the PR, then merge if we get the green check. Howard 2014-12-03 12:47 GMT-07:00 Ralph Castain : > As for the checks before merge - I suspect this was done exactly that way, > if I am right as to the cause. The problem

Re: [OMPI devel] (no subject)

2014-12-08 Thread Howard Pritchard
Hello Kevin, Could you try testing with Open MPI 1.8.3? There was a bug in 1.8.1 that you are likely hitting in your testing. Thanks, Howard 2014-12-07 17:18 GMT-07:00 Kevin Buckley < kevin.buckley.ecs.vuw.ac...@gmail.com>: > Apologies for the lack of a subject line: cut and pasted

Re: [OMPI devel] [OMPI commits] Git: open-mpi/ompi branch master updated. dev-477-g09d03a1

2014-12-09 Thread Howard Pritchard
c40fd09d2a0575e493137158644fd2b610a48aca > > > > Howard's here at the Forum with me; I'll consult with him in person > later this morning. > > > > > > > > > > On Dec 9, 2014, at 7:15 AM, Ralph Castain wrote: > > > >> I

Re: [OMPI devel] Update to usnic BTL / libfabric

2014-12-09 Thread Howard Pritchard
HI Ralph, Jeff fixed this in c40fd09. That's the problem I hit, in addition to later not having psm_infinipath. After that commit,and commit cd0a54d you should be able to config and make again. Howard 2014-12-09 13:45 GMT-08:00 Ralph Castain : > Just as an FYI: we discove

[OMPI devel] opal_lifo/opal_fifo fail with make distcheck

2014-12-09 Thread Howard Pritchard
s' make[5]: Entering directory `/global/u2/h/hpp/ompi/openmpi-gitclone/_build/test/class' FAIL: opal_lifo FAIL: opal_fifo Has anyone else seen this? Howard

[OMPI devel] still supporting pgi?

2014-12-11 Thread Howard Pritchard
supporting building open mpi with pgi compilers? I'm using pgi 14.4. Thanks, Howard

Re: [OMPI devel] still supporting pgi?

2014-12-11 Thread Howard Pritchard
sks for it. Howard 2014-12-11 7:42 GMT-08:00 Jeff Squyres (jsquyres) : > > On Dec 11, 2014, at 7:40 AM, Ralph Castain wrote: > > > I’m unaware of any conscious decision to cut pgi off - I think it has > been more a case of nobody having a license to use for testing. > >

Re: [OMPI devel] OpenIB has some borked code

2014-12-12 Thread Howard Pritchard
Nathan, Please make sure the fix for this problem is contained in its own commit. Howard 2014-12-12 9:38 GMT-07:00 Nathan Hjelm : > > > Yeah, that code is completely wrong. I have a fix in my btl > modifications branch. > > > https://github.c

Re: [OMPI devel] [OMPI commits] Git: open-mpi/ompi branch master updated. dev-509-g38d6627

2014-12-15 Thread Howard Pritchard
I'd prefer Paul's suggestion to disable xpmem for sgi/uv for 1.8.X Is anyone actually supporting this? Howard 2014-12-15 8:56 GMT-07:00 Nathan Hjelm : > > > Not yet. I am still trying to pinpoint the problem. From what I can tell > the SGI version of XPMEM should be nearly

[OMPI devel] ofi/mtl causing problems

2014-12-17 Thread Howard Pritchard
/openmpi/mca_mtl_ofi.soundefined symbol: fi_getinfo: undefined symbol: fi_getinfo [NID 05538] 2014-12-17 05:58:50 Apid 9226246: initiated application termination Application 9226246 exit codes: 127 *Stderr* Any ideas on how to fix this? Thanks, Howard

Re: [OMPI devel] ofi/mtl causing problems

2014-12-17 Thread Howard Pritchard
12-17 10:09 GMT-07:00 Jeff Squyres (jsquyres) : > > Is this on a PSM-enabled cluster? > > Can you send the full output from configure, the config.log, and the > output from "make"? > > Are you building statically (i.e., libmpi.a)? > > > > On Dec 17, 2014,

Re: [OMPI devel] ofi/mtl causing problems

2014-12-17 Thread Howard Pritchard
cally (i.e., libmpi.a)? > > > > On Dec 17, 2014, at 12:04 PM, Howard Pritchard > wrote: > > > I noticed my MTT smoke test failed with todays master build: > > > > name=PMI_process_mapping, (val=(vector,(0,4,4))) > > ./c_hello./c_hello: : symbol lookup

Re: [OMPI devel] ofi/mtl causing problems

2014-12-17 Thread Howard Pritchard
Hi Josh, I did another mtt run with --disable-libfabric included on the configure line and still failed with the same problem, mtl/ofi thinks its okay to build... Howard 2014-12-17 11:48 GMT-07:00 Joshua Ladd : > > Seem to me this should be disabled by default until folks can quiet the &

Re: [OMPI devel] ofi/mtl causing problems

2014-12-17 Thread Howard Pritchard
c 17, 2014, at 2:19 PM, Howard Pritchard wrote: > > > I did another mtt run with --disable-libfabric included on the configure > line and still failed with the same problem, mtl/ofi thinks its okay to > build... > > FWIW: this problem is because the switch is --without-libfabr

Re: [OMPI devel] [OMPI commits] Git: open-mpi/ompi branch master updated. dev-564-g6c468b8

2014-12-17 Thread Howard Pritchard
Hi Jeff, Why did you delete the il libmca_common_alps_so_version thats going to break my stuff. 2014-12-17 14:36 GMT-07:00 : > > This is an automated email from the git hooks/post-receive script. It was > generated because a ref change was pushed to the repository containing > the project "ope

[OMPI devel] ABI compatibility proposal for 1.9/2.0 release stream

2014-12-18 Thread Howard Pritchard
lt;https://github.com/open-mpi/ompi/wiki/Releasev19> on the wiki to include this proposal. Please let us know if you think that it might be problematic to relax the ABI compatibility promise in the features release series. This will be on the agenda for the developers' workshop next month. Thanks, Howard

[OMPI devel] simple ./configure doesn't work on master/HEAD

2014-12-18 Thread Howard Pritchard
implementation header configure: error: Cannot continue This is on a linux/x86_64/open suse box. Howard

[OMPI devel] commit be6d4649

2014-12-18 Thread Howard Pritchard
Hi Folks, commit be6d4649 <https://github.com/open-mpi/ompi/commit/be6d46490f7b80d4f5ea90c859ccbebe96bdaaba> broke simple ./configure of master. I'd like to revert this commit unless someone can figure out a better solution to Gilles --without-hwloc issue soon. Howard

Re: [OMPI devel] BUG in ADIOI_NFS_WriteStrided

2014-12-19 Thread Howard Pritchard
HI Eric, Does your app also work with MPICH? The romio in Open MPI is getting a bit old, so it would be useful to know if you see the same valgrind error using a recent MPICH. Howard 2014-12-19 9:50 GMT-07:00 Eric Chamberland : > > Hi, > > I encountered a new bug while testing ou

Re: [OMPI devel] [Open MPI Announce] Open MPI 1.8.4 released

2014-12-22 Thread Howard Pritchard
eir GRU api. I'm pretty sure that's the way the sgi mpi delivers small messages efficiently. Howard 2014-12-22 8:43 GMT-07:00 Nathan Hjelm : > > Yeah, I figured out why XPMEM is failing on SGI UV but have not figured > out a fix. If possible can we remove the check for sn/xpmem

Re: [OMPI devel] RFC: remove --disable-smp-locks

2015-01-06 Thread Howard Pritchard
I agree. Please remove this config option. 2015-01-06 9:44 GMT-07:00 Nathan Hjelm : > > What: Remove the --disable-smp-locks configure option from master. > > Why: Use of this option produces incorrect results/undefined behavior > when any shared memory BTL is in use. Since BTL usage is enabled

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Howard Pritchard
se_verbose 100 and check to see that in fact that env. variable isn't in the list of passed env. variables? Would one of you mind opening an issue to track this problem? Thanks, Howard 2015-01-09 7:52 GMT-07:00 Friedley, Andrew : > No this is not expected behavior. > > The PSM

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Howard Pritchard
HI Folks, Sorry for my stupidity. I now see the problem. App is calling pmi_init twice because of the new ofiwg libfabric mtl. You can try mpirun blah blah blah --mca btl and things should work. Howard 2015-01-09 7:52 GMT-07:00 Friedley, Andrew : > No this is not expected behav

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Howard Pritchard
HI Adrian, Andrew, Sorry try again, both the libfabric psm provider and the open mpi psm mtl are trying to use psm_init. So, to avoid this problem, add --mca mtl psm to your mpirun command line. Sorry for the confusion. Howard 2015-01-09 7:52 GMT-07:00 Friedley, Andrew : > No this is

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Howard Pritchard
HI Adrian, Please open an issue. We don't want users having to explicitly specify the mtl to use just to get a job to run on a intel/infinipath system. Howard 2015-01-09 13:04 GMT-07:00 Adrian Reber : > Should I still open a ticket? Will these be changed or do I always have >

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-11 Thread Howard Pritchard
;d have non-trivial build environments/runtime environments which would be better at testing if something we broke something. Howard 2015-01-09 17:36 GMT-07:00 Burette, Yohann : > Hi, > > For those of you who don't know me, my name is Yohann Burette, I work for > Intel and I c

Re: [OMPI devel] #327

2015-01-11 Thread Howard Pritchard
e aware of particular RDMA networks' capabilities, and avoid having to extend the PML interface with unnecessary methods. Hope this helps, Howard 2015-01-09 15:30 GMT-07:00 George Bosilca : > I have some comments about this ticket and the corresponding patch. > Honestly, the patch lacks

Re: [OMPI devel] Another Open MPI <-> PSM question (MPI_Isend()/MPI_Cancel())

2015-01-15 Thread Howard Pritchard
thanks George! 2015-01-15 11:43 GMT-07:00 George Bosilca : > From the MPI standard perspective MPI_Cancel doesn't have to succeed, it > can also gracefully fail. However, the PSM MTL diverges from the MPI > standard and if a request cannot be canceled an error is returned. Here is > a patch to f

Re: [OMPI devel] Coll ML issues

2015-01-25 Thread Howard Pritchard
Hi George, I put this on the agenda for this week's meeting. Howard 2015-01-23 16:43 GMT-07:00 George Bosilca : > During some experiments we have identified several major issues with coll > ML with a very recent version of Open MPI master (22ab638 Jan 20 13:21:44). > Based on t

Re: [OMPI devel] When libltdl is not your friend

2015-02-02 Thread Howard Pritchard
Hi Paul, Thanks for checking in depth into this. Just to help in determining how to proceed, which national center is this? Howard 2015-02-02 19:35 GMT-07:00 Paul Hargrove : > Below is one example of what happens when you assume that you can trust > the libltdl installed an otherwis

Re: [OMPI devel] omni-release Github comment bot

2015-02-04 Thread Howard Pritchard
+1 great stuff 2015-02-04 5:55 GMT-07:00 Jeff Squyres (jsquyres) : > OMPI devs -- > > Per lots of previous discussions, you all know that you can't assign > labels, milestones, or users to issues/pull requests on the ompi-release > repo. > > Gilles has written a Github bot that will allow you to

Re: [OMPI devel] omni-release Github comment bot

2015-02-05 Thread Howard Pritchard
Hi Jeff Gilles ideas are great. I agree with your RM stamp of approval policy. No removal of rm approved in the event of subsequent commits. Howard On Feb 5, 2015 5:04 AM, "Jeff Squyres (jsquyres)" wrote: > Gilles came up with a cool idea for the OMPIBot (see below). We can d

[OMPI devel] turning the bot on for ompi-release?

2015-02-05 Thread Howard Pritchard
HI Jeff and Gilles Do we have an ETA for enabling the bot on ompi-release? I think it will be a great help. Howard

Re: [OMPI devel] ess:alps build failure with PGI

2015-02-09 Thread Howard Pritchard
HI Paul, I'll fix this. Howard 2015-02-06 17:38 GMT-07:00 Paul Hargrove : > The following in orte/mca/ess/alps/Makefile.am assumes a GNU (or GNU-like) > compiler: > > mca_ess_alps_la_CPPFLAGS = $(ess_alps_CPPFLAGS) -fno-ident > > If building with PGI, the result is

Re: [OMPI devel] OMPI devel] RoCE plus QDR IB tunable parameters

2015-02-10 Thread Howard Pritchard
HI George, I'd say commit cf377db82 explains the vanishing of the bandwidth metric as well as the mis-labeling of the latency metric. Howard 2015-02-10 18:41 GMT-07:00 George Bosilca : > Somehow one of the most basic information about the capabilities of the > BTLs (bandwidth)

[OMPI devel] opal_dss.load question

2015-02-11 Thread Howard Pritchard
do the above technique, the heap allocator blows up in OBJ_RELEASE of buffer. Thanks, Howard

[OMPI devel] MTT failures

2015-02-18 Thread Howard Pritchard
/u2/h/hpp/mtt_carver_tmp/installs/8v68/install ./c_hello Before people begin blaming this as a cray thing, this is from the NERSC carver system which is an ibm dataplex system running redhat and using MLNX connectX HCAs. Anyone else seeing these failures? Howard

Re: [OMPI devel] git commit id in coverity

2015-02-19 Thread Howard Pritchard
HI Ralph, How does one get this "MPI Create success" message? Is there a mailing list specifically for the nightly builds? Thanks, Howard 2015-02-16 21:48 GMT-07:00 Ralph Castain : > It's the git id of the nightly tarball - which you should get via the MPI > Create

Re: [OMPI devel] Tues Mar 3rd telecon

2015-02-26 Thread Howard Pritchard
I will also be available but suggest we skip next Tuesday. On Feb 25, 2015 5:04 PM, "Ralph Castain" wrote: > Hey folks > > Given that some number of us will be at the MPI Forum next week, do we > have a quorum available for the weekly telecon? Who would be able to make > it? > > Me: available >

[OMPI devel] opal_verbs_want_fork_support question

2015-02-26 Thread Howard Pritchard
Hi Folks, Just tried to build a fresh head of master and am getting opal_verbs_want_fork_support as undefined symbol when trying to build opal lib. Any ideas on where this should go? It would be nice to get jenkins checking everything, or at least a light weight travis check. Howard

[OMPI devel] psm and process affinity in open mpi

2015-03-03 Thread Howard Pritchard
es this problem. Could Open MPI also potentially have this same problem? If so, I'd want to add an mca param to set this option before calling psm_ep_open within psm mtl. Hmm.. maybe the ofi mtl supporter should talk with the libfabric psm provider folks about this. Thanks for any help, Howard

Re: [OMPI devel] psm and process affinity in open mpi

2015-03-03 Thread Howard Pritchard
Thanks Andrew, I was getting confused with the libfabric psm provider code inside open mpi. 2015-03-03 9:35 GMT-07:00 Friedley, Andrew : > Hi Howard, > > > > The PSM MTL sets PSM_EP_OPEN_AFFINITY_SKIP, so if I understand right, OMPI > already has the fix for you. > >

Re: [OMPI devel] libfabric code does not build with pgi-{10,11}

2015-03-05 Thread Howard Pritchard
just fine. If somebody does care, let me know who and I'll send logs off-list. However, this can be reproduced on carver.nersc.gov where at least Howard and Nathan have accounts. -Paul -- Paul H. Hargrove phhargr...@lbl.gov Computer Languages & Systems Software

Re: [OMPI devel] libfabric code does not build with pgi-{10,11}

2015-03-05 Thread Howard Pritchard
HI Paul, For the 10.9 and 11.9 does the libfabric get configured to build for you on carver? I get a failure at config. I don't think this should be high priority since the libfabric embedding within open mpi should hopefully soon be a thing of the past. Howard 2015-03-04 14:28 GMT-07:00

[OMPI devel] mpi_test_suite question

2015-03-06 Thread Howard Pritchard
Are there known issues with the io tests? My first guess is yes since I notice esslingen is excluding io from their MTT runs. Thanks for any info, Howard

Re: [OMPI devel] mpi_test_suite question

2015-03-06 Thread Howard Pritchard
HI Edgar, Thanks for the explanation. Howard 2015-03-06 12:43 GMT-07:00 Edgar Gabriel : > this error message comes from ompio, the split collective are not properly > implemented at this point in time, they are basically just a printf > statement. Once I have the non-blocking colle

Re: [OMPI devel] jenkins and openmpi

2015-03-09 Thread Howard Pritchard
Die batman Lampe, ein toller Einfall! ! 2015-03-09 10:11 GMT-06:00 Mike Dubman : > > Hello, > > Please check updated OMPI wiki page for detailed information for Jenkins > testing of OMPI repositories. > > https://github.com/open-mpi/ompi/wiki/PRJenkins > > Comments and suggestions are welcome. >

[OMPI devel] f08ts

2015-03-10 Thread Howard Pritchard
don't define them? If not, I'll open an issue. Thanks, Howard

Re: [OMPI devel] BML changes

2015-03-11 Thread Howard Pritchard
haswell node, not using HT), using the DMA engine of the NIC is not such a good idea. Howard 2015-03-11 10:57 GMT-06:00 Nathan Hjelm : > > Definitely a side-effect though it could be beneficial in some cases as > the RDMA engine in the HCA may be faster than using memcpy (larger

[OMPI devel] dlclose of libmpi, java gc, and pthread_key destructors

2015-04-06 Thread Howard Pritchard
alled, and then make sure there is an appropriate opal_tsd_key_destroy for the key during the MPI_Finalize procedure. Alternately, since this is basically a dso problem, one could define fini functions to run the destructors during the dlclose procedure. Any thoughts? Howard

Re: [OMPI devel] v1.8.5 NEWS and README

2015-04-17 Thread Howard Pritchard
Hi Jeff Minor cray corrections below On Apr 17, 2015 6:57 AM, "Jeff Squyres (jsquyres)" wrote: > > The v1.8 branch NEWS, README, and VERSION files have been updated in preparation for the v1.8.5 release. Please double check them -- especially NEWS, particularly to ensure that we are giving cred

[OMPI devel] mtt failures from last nite

2015-04-17 Thread Howard Pritchard
HI Folks, I'm seeing build failures on both carver/pgi at nersc and on a cray internal machine with the nightly build of master. >From the cray box: ommon_ugni.c:30:5: error: 'MCA_BASE_VERSION_2_0_0' undeclared here (not in a function) MCA_BASE_VERSION_2_0_0, common_ugni.c:31:5: warning: in

Re: [OMPI devel] v1.8.5 NEWS and README

2015-04-17 Thread Howard Pritchard
h work well for the slurm/pmi systems at trilabs than for the Cray's. I strongly encourage anyone wanting to use open mpi on cray systems to use master (on good days, today is not such a day) at this point in time. Sorry for the confusion. Howard 2015-04-17 8:18 GMT-06:00 Jeff Squyres (j

Re: [OMPI devel] v1.8.5 NEWS and README

2015-04-17 Thread Howard Pritchard
try and patch up the 1.8 release to work on the cray systems. Way too late in the game for that branch. But don't worry, for cory things will be running great using the 1.9/2.0. Of course, maybe for cory beyond-mpi program models might be more appropriate. Howard 2015-04-17 14:16 GMT-06:0

[OMPI devel] noticing odd message

2015-04-20 Thread Howard Pritchard
"orte", NULL, NULL, "server", 0); this just recently started appearing, perhaps today, but I've not been running anything over the weekend. Howard

Re: [OMPI devel] Fwd: OpenIB module initialisation causes segmentation fault when locked memory limit too low

2015-04-22 Thread Howard Pritchard
Hi Raphael, Thanks very much for the patches. Would one of the developers on the list have a system where they can make these kernel limit changes and which have HCAs installed? I don't have access to any system where I have such permissions. Howard 2015-04-22 8:55 GMT-06:00 Ra

Re: [OMPI devel] Fwd: OpenIB module initialisation causes segmentation fault when locked memory limit too low

2015-04-22 Thread Howard Pritchard
Hi Paul, silly me. forgot this was a ulimit thing. I'll test on carver. Howard 2015-04-22 10:45 GMT-06:00 Paul Hargrove : > And here is the backtrace I probably should have provided in the previous > email. > -Paul > > #0 0x2b4107ce9265 in raise () from /

Re: [OMPI devel] Fwd: OpenIB module initialisation causes segmentation fault when locked memory limit too low

2015-04-22 Thread Howard Pritchard
Hi Rafael, I give you an A+ for effort. We always appreciate patches. Howard 2015-04-22 12:43 GMT-06:00 Nathan Hjelm : > > Umm, why are you cleaning up this way. The allocated resources *should* > be freed by the udcm_module_finalize call. If there is a bug in that > path it sho

Re: [OMPI devel] Suggested README changes

2015-04-23 Thread Howard Pritchard
Hi Paul, Portals4 may be able to work on cray XE/XC on top of IAA (ibverbs simulation), but it absolutely is not the support library for Cray interconnects since XE days. Never was on Cray XT either, as you point out that was portals 3.X. Howard 2015-04-23 12:29 GMT-06:00 Paul Hargrove

[OMPI devel] romio refresh on master

2015-05-01 Thread Howard Pritchard
Hi Folks, I merged in the refresh of romio 3.1.4, special thanks to Gilles for doing this! I did some testing, but can't say it was extensive. If others would have time to run some of the MTT setups requesting romio rather than ompio for a bit that would be great. Thanks, Howard

[OMPI devel] is anyone seeing this on their intel/inifinipath cluster?

2015-05-01 Thread Howard Pritchard
bizarre is that psmx_eq_open shouldn't be visible outside of the libfabric.so itself. So having libfabric internal symbols required in a ompi mca lib seems to be incorrect. Howard

[OMPI devel] oops, jenkins mishap

2015-05-11 Thread Howard Pritchard
Hi Folks, Sorry for the comments pushed to PRs just a while ago. I was suppose to be configuring jenkins for a different project, not ompi. Sorry for the confusion. Howard

Re: [OMPI devel] [OMPI commits] Git: open-mpi/ompi branch master updated. dev-1731-g8e30579

2015-05-14 Thread Howard Pritchard
Is this by any chance associated with issue 579? 2015-05-14 20:49 GMT-06:00 Ralph Castain : > I'll look at the lines you cite, but that clearly isn't the problem we are > seeing here. I can verify that because the test case: > > mpirun -n 1 sleep 1000 > > does not open up any connections at all.

Re: [OMPI devel] Open MPI collectives algorithm selection

2015-05-19 Thread Howard Pritchard
ected. Following the KISS principal I would go with 2) returning a NULL rule when there is no matching size in the rule file for the communicator in question. Howard 2015-05-19 20:05 GMT-06:00 Gilles Gouaillardet : > Folks, > > this is a follow-up of a discussion on the user ML sta

Re: [OMPI devel] Proposal: update Open MPI's version number and release process

2015-05-20 Thread Howard Pritchard
If the smoke test fails, send a naughty-gram to the committer and copy devel. Pretty soon the developer will get trained to use the PR process, unless they are that engineer I've yet to meet who always writes flawless code. Howard > > Back in the SVN days it was nice to have a trunk

Re: [OMPI devel] README updates for new version number scheme

2015-06-03 Thread Howard Pritchard
looks good to me. 2015-05-28 12:54 GMT-06:00 Jeff Squyres (jsquyres) : > I'd appreciate some feedback on > https://github.com/open-mpi/ompi/commit/85f0fff1899ca8f4785776d0b301be513c33e675, > where I updated README on master to describe the new version number scheme > (note: this change is pending

[OMPI devel] ompi forking tomorrow

2015-06-15 Thread Howard Pritchard
Hi Folks, The plan is to fork ompi master tomorrow to a 2.0 branch. Last chance for any really good case that we should delay this by a day or two. Thanks, Howard

Re: [OMPI devel] ompi forking tomorrow

2015-06-15 Thread Howard Pritchard
Thanks for the heads up. taking a look. 2015-06-15 11:39 GMT-06:00 Ralph Castain : > You might take a gander at the MTT results first - they don’t look very > good on master :-( > > > > On Jun 15, 2015, at 10:14 AM, Howard Pritchard > wrote: > > > > Hi Folks,

[OMPI devel] Fwd: MTT test has completed, status: failed

2015-06-24 Thread Howard Pritchard
l: Aborted (6) [c1477:19137] Signal: Aborted (6) [c1477:19137] Signal code: (-6) [c1476:07375] Signal: Aborted (6) c_ring: pml_ob1_component.c:308: mca_pml_ob1_component_fini: Assertion `((0xdeafbeedULL << 32) + 0xdeafbeedULL) == ((opal_object_t *) (mca_pml_ob1_recvreq))->obj_magic_id&#

Re: [OMPI devel] === CREATE FAILURE (dev-1979-g13425e7) ===

2015-06-25 Thread Howard Pritchard
he PR process? We shouldn't have this kind of build failure anymore as long as people have stopped bypassing PR process. Howard 2015-06-25 19:22 GMT-06:00 MPI Team : > > ERROR: Command returned a non-zero exist status (dev-1979-g13425e7): >make -j 8 distcheck > > Sta

Re: [OMPI devel] === CREATE FAILURE (dev-1979-g13425e7) ===

2015-06-26 Thread Howard Pritchard
sorry, not true. look at the logs on IU. runs at 3:07 and 4:08 IU time. 2015-06-25 21:46 GMT-06:00 Jeff Squyres (jsquyres) : > Howard -- > > The LANL distcheck jenkins hasn't been running all day. > > > > On Jun 25, 2015, at 8:33 PM, Howard Pritchard > wrote:

[OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Howard Pritchard
Hi Folks, I'm seeing an error I've not seen before in the MTT runs on the ibm dataplex at NERSC. The mpirun launched jobs are failing with OMPI_PROC_BIND value is invalid errors. This is is for the trivial ring tests. Is anyone else seeing these types of errors? Howard

Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Howard Pritchard
laki is also showing the errors: Here's the shortened url: http://goo.gl/Ra264U looks like the badness started with the latest nightly. I think there was some activity in the orte binding area recently. Howard 2015-06-29 9:52 GMT-06:00 Jeff Squyres (jsquyres) : > Can you provid

Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Howard Pritchard
1:19 PM, Jeff Squyres (jsquyres) < > jsquy...@cisco.com> wrote: > >> Ahh... it's OMP_PROC_BIND, not OMPI_PROC_BIND. >> >> Yes, Ralph just added this. >> >> I chatted with him about this on the phone moments ago; he's pretty sure >> h

Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Howard Pritchard
I decided just to disable the carver/pgi mtt runs. 2015-06-29 15:10 GMT-06:00 Ralph Castain : > Very strange then - again, can you run it with the verbose flag and send > me the output? I can't replicate what you are seeing. > > > On Mon, Jun 29, 2015 at 4:05 PM, Howar

Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-30 Thread Howard Pritchard
e put in place around the v2.x release to avoid these kind of surprises there. Needless to say I will not be admitting this PR in to v2.x until its cleaned up enough to work with all major compilers, or else is only activated when OMPI is compiled with an Intel compiler. Howard 2015-06-30 16:00 G

[OMPI devel] getting v1.10 and v2.x nightly tarballs where?

2015-07-15 Thread Howard Pritchard
Hi Folks, I'm trying to locate on jaguar/www.open-mpi.org where the nightly tarballs are for v1.10 and v2.x. I'm needing these tarballs for our installs on some new systems at LANL where we want to start out with these versions. Thanks for any help, Howard

[OMPI devel] anyone built master on qlogic system today?

2015-07-22 Thread Howard Pritchard
#x27;t get past configure. Not sure if this is specific to systems with psm installed yet. Anyone else seen this? Howard

Re: [OMPI devel] anyone built master on qlogic system today?

2015-07-22 Thread Howard Pritchard
Hi Folks, Found the problem, had to do a hard reset to origin/master for some reason to get missing files back. Howard 2015-07-22 12:17 GMT-06:00 Jeff Squyres (jsquyres) : > On Jul 22, 2015, at 1:46 PM, Howard Pritchard wrote: > > > > Hello Folks, > > > > I&qu

Re: [OMPI devel] 1.10.0rc2

2015-07-24 Thread Howard Pritchard
looks like ofi mtl is being naughty. its tje onlx mtl which registers with opal progress in component init method. -- sent from my smart phonr so no good type. Howard On Jul 23, 2015 7:03 PM, "Ralph Castain" wrote: > It looks like one of the MTL components is registeri

Re: [OMPI devel] 1.10.0rc2

2015-07-24 Thread Howard Pritchard
Paul Could you rerun with --mca mtl_base_verbose 10 added to cmd line and send output? Howard -- sent from my smart phonr so no good type. Howard On Jul 23, 2015 6:06 PM, "Paul Hargrove" wrote: > Yohann, > > With PR409 as it stands right now (commit 6daef310) I s

Re: [OMPI devel] 1.10.0rc2

2015-07-24 Thread Howard Pritchard
Jeff I was wrong about this. all the mtls except for portals4 register with opal progress in their comp init. I dont see how this is a problem though as base select only invokes comp init on the selected mtl. Howard -- sent from my smart phonr so no good type. Howard On Jul 24, 2015

Re: [OMPI devel] 1.10.0rc2

2015-07-24 Thread Howard Pritchard
Hi Jeff, Nathan and I think this is generic to all the mtl's and masked by the stuff in the cm select method for upping the priority of the mtl. We'd see this behavior for all mtl's if this priority upping code wasn't there and we fell back to ob1. Howard 2015-07-24

[OMPI devel] mca_pml_cm_component_init

2015-07-24 Thread Howard Pritchard
Hi Folks, Should we do something better than what is done currently in the mca_pml_cm_component_init method around lines 158-162? That's what's causing a bunch of problems right now in 1.10. I'd like to see a better approach taken in the v2.x Howard

[OMPI devel] new IU jenkins project

2015-07-29 Thread Howard Pritchard
test script. The jenkin VM is a x86_64 (Intel E5-2690 cpus) VM running rhel 6.7. Thanks, Howard

[OMPI devel] new branch on open-mpi/ompi?

2015-08-05 Thread Howard Pritchard
HI Folks, There's a new branch on open-mpi/ompi repo. Is this intentional? Howard

Re: [OMPI devel] orte-dvm startup fails on HEAD

2015-08-21 Thread Howard Pritchard
I will check if i can reproduce on nersc systems. -- sent from my smart phonr so no good type. Howard On Aug 21, 2015 7:51 AM, "Ralph Castain" wrote: > I’ll take a look at it > > > On Aug 20, 2015, at 11:34 PM, Mark Santcroos > wrote: > > > > Hi a

Re: [OMPI devel] mca_mtl_psm and java

2015-08-25 Thread Howard Pritchard
I think rather than trying workarounds of dubious robustness inside open mpi we - dicument the issue on either the somewhat aged open mpi website faq or add it to a wiki page on github - file a bug against intel psm -- sent from my smart phonr so no good type. Howard On Aug 25, 2015 6

Re: [OMPI devel] [OMPI commits] Git: open-mpi/ompi branch master updated. dev-2362-ge2124c6

2015-08-25 Thread Howard Pritchard
is this going in to v2.x? -- sent from my smart phonr so no good type. Howard On Aug 25, 2015 7:54 AM, wrote: > This is an automated email from the git hooks/post-receive script. It was > generated because a ref change was pushed to the repository containing > the project

Re: [OMPI devel] mca_mtl_psm and java

2015-08-25 Thread Howard Pritchard
I'll update the java FAQ. 2015-08-25 8:36 GMT-06:00 Jeff Squyres (jsquyres) : > On Aug 25, 2015, at 10:00 AM, Howard Pritchard > wrote: > > > > I think rather than trying workarounds of dubious robustness inside open > mpi we > > > > - dicument the issu

Re: [OMPI devel] mca_mtl_psm and java

2015-08-25 Thread Howard Pritchard
the mpirun script to include the --mca mtl ^psm tag if > java is in the run string? > > -Nathan > > On Tue, Aug 25, 2015 at 9:47 AM, Howard Pritchard > wrote: > >> I'll update the java FAQ. >> >> 2015-08-25 8:36 GMT-06:00 Jeff Squyres (jsquyres) :

<    1   2   3   >