arget addr being in flight.
Howard
2014-11-06 12:08 GMT-07:00 Nathan Hjelm :
> I haven't look at that yet. Would be great to get the new osc component
> working over both btls and mtls. I know portals supports atomics but I
> don't know whether psm does.
>
> -Nathan
>
HI Ralph,
We should discuss this on Tuesday. I thought we'd decided for master to
use a model where developers would directly push to ompi/master.
I'd be willing to pull the request from Giles marked as bugs tomorrow.
Howard
2014-11-06 13:16 GMT-07:00 Ralph Castain :
> Hey folk
ry, we'll be
in good shape.
Howard
2014-11-06 20:11 GMT-07:00 Ralph Castain :
> Yeah - to be clear, I had no problem with anything you did, Gilles. I was
> only noting that several of them had positive comments, but they weren’t
> being merged. Hate to see the good work lost or forgott
ready
"connected". I notice for the psm mtl
in ompi, a mask array is not provided, just a NULL.
Howard
2014-11-11 16:00 GMT-07:00 George Bosilca :
>
> > On Nov 11, 2014, at 17:13 , Jeff Squyres (jsquyres)
> wrote:
> >
> >> More particularly, it looks like add_pr
HI Folks,
I think this is a bug in the PSM MTL add_procs. The call to psm_ep_connect
needs to be taking previously connected ep's into account,
much like what is done in the libfabric psm provider code.
Howard
2014-11-12 3:12 GMT-07:00 Rainer Keller :
> Dear Andrew,
> no, this
Hi Adrian,
Please do your PSM results in the database. Would be
very much appreciated.
Howard
2014-11-13 7:46 GMT-07:00 Adrian Reber :
> I applied the fix committed on master and described in
>
> https://github.com/open-mpi/ompi/issues/268
>
> on 1.8.3 and 1.8.4rc1 and thi
the vm_open - modulo detection of the definition of RTLD_GLOBAL at
compile time. Perhaps adding a way with an env. or config option to not
enable RTLD_GLOBAL by default?
Thanks,
Howard
Hi Folks,
I can't reproduce the runtime error (looks like in MPI_Finalize) that
the mlnx jenkins is hitting with our pull requests.
Has anyone figured out the problem yet?
I would prefer to have green checks on our pull requests before
they get merged in.
Thanks,
Howard
Hello Artem,
No, but I was also told by schedmd that the slurm we have on our systems is
ancient.
So I'm no longer considering this problem very important. We have a
workaround of always configuring
with --disable-dlopen.
Thanks,
Howard
2014-12-02 20:59 GMT-07:00 Artem Polyakov :
>
aster. I'd prefer
to first have master fixed, then retest the PR, then merge if we get the
green check.
Howard
2014-12-03 12:47 GMT-07:00 Ralph Castain :
> As for the checks before merge - I suspect this was done exactly that way,
> if I am right as to the cause. The problem
Hello Kevin,
Could you try testing with Open MPI 1.8.3? There was a bug in 1.8.1
that you are likely hitting in your testing.
Thanks,
Howard
2014-12-07 17:18 GMT-07:00 Kevin Buckley <
kevin.buckley.ecs.vuw.ac...@gmail.com>:
> Apologies for the lack of a subject line: cut and pasted
c40fd09d2a0575e493137158644fd2b610a48aca
> >
> > Howard's here at the Forum with me; I'll consult with him in person
> later this morning.
> >
> >
> >
> >
> > On Dec 9, 2014, at 7:15 AM, Ralph Castain wrote:
> >
> >> I
HI Ralph,
Jeff fixed this in c40fd09. That's the problem I hit, in addition to
later not having psm_infinipath. After that commit,and commit cd0a54d
you should be able to config and make again.
Howard
2014-12-09 13:45 GMT-08:00 Ralph Castain :
> Just as an FYI: we discove
s'
make[5]: Entering directory
`/global/u2/h/hpp/ompi/openmpi-gitclone/_build/test/class'
FAIL: opal_lifo
FAIL: opal_fifo
Has anyone else seen this?
Howard
supporting building open mpi with pgi compilers? I'm
using pgi
14.4.
Thanks,
Howard
sks for
it.
Howard
2014-12-11 7:42 GMT-08:00 Jeff Squyres (jsquyres) :
>
> On Dec 11, 2014, at 7:40 AM, Ralph Castain wrote:
>
> > I’m unaware of any conscious decision to cut pgi off - I think it has
> been more a case of nobody having a license to use for testing.
>
>
Nathan,
Please make sure the fix for this problem is contained in its own commit.
Howard
2014-12-12 9:38 GMT-07:00 Nathan Hjelm :
>
>
> Yeah, that code is completely wrong. I have a fix in my btl
> modifications branch.
>
>
> https://github.c
I'd prefer Paul's suggestion to disable xpmem for sgi/uv for 1.8.X
Is anyone actually supporting this?
Howard
2014-12-15 8:56 GMT-07:00 Nathan Hjelm :
>
>
> Not yet. I am still trying to pinpoint the problem. From what I can tell
> the SGI version of XPMEM should be nearly
/openmpi/mca_mtl_ofi.soundefined
symbol:
fi_getinfo: undefined symbol: fi_getinfo
[NID 05538] 2014-12-17 05:58:50 Apid 9226246: initiated application termination
Application 9226246 exit codes: 127
*Stderr*
Any ideas on how to fix this?
Thanks,
Howard
12-17 10:09 GMT-07:00 Jeff Squyres (jsquyres) :
>
> Is this on a PSM-enabled cluster?
>
> Can you send the full output from configure, the config.log, and the
> output from "make"?
>
> Are you building statically (i.e., libmpi.a)?
>
>
>
> On Dec 17, 2014,
cally (i.e., libmpi.a)?
>
>
>
> On Dec 17, 2014, at 12:04 PM, Howard Pritchard
> wrote:
>
> > I noticed my MTT smoke test failed with todays master build:
> >
> > name=PMI_process_mapping, (val=(vector,(0,4,4)))
> > ./c_hello./c_hello: : symbol lookup
Hi Josh,
I did another mtt run with --disable-libfabric included on the configure
line and still failed with the same problem, mtl/ofi thinks its okay to
build...
Howard
2014-12-17 11:48 GMT-07:00 Joshua Ladd :
>
> Seem to me this should be disabled by default until folks can quiet the
&
c 17, 2014, at 2:19 PM, Howard Pritchard wrote:
>
> > I did another mtt run with --disable-libfabric included on the configure
> line and still failed with the same problem, mtl/ofi thinks its okay to
> build...
>
> FWIW: this problem is because the switch is --without-libfabr
Hi Jeff,
Why did you delete the il
libmca_common_alps_so_version
thats going to break my stuff.
2014-12-17 14:36 GMT-07:00 :
>
> This is an automated email from the git hooks/post-receive script. It was
> generated because a ref change was pushed to the repository containing
> the project "ope
lt;https://github.com/open-mpi/ompi/wiki/Releasev19> on the wiki to include
this proposal.
Please let us know if you think that it might be problematic to relax
the ABI compatibility promise in the features release series.
This will be on the agenda for the developers' workshop next month.
Thanks,
Howard
implementation header
configure: error: Cannot continue
This is on a linux/x86_64/open suse box.
Howard
Hi Folks,
commit be6d4649
<https://github.com/open-mpi/ompi/commit/be6d46490f7b80d4f5ea90c859ccbebe96bdaaba>
broke
simple ./configure of master.
I'd like to revert this commit unless someone can figure
out a better solution to Gilles --without-hwloc issue
soon.
Howard
HI Eric,
Does your app also work with MPICH? The romio in Open MPI is getting a bit
old, so it would be useful to know if you see the same valgrind error using
a recent MPICH.
Howard
2014-12-19 9:50 GMT-07:00 Eric Chamberland :
>
> Hi,
>
> I encountered a new bug while testing ou
eir GRU api. I'm pretty sure that's the way the
sgi mpi delivers small messages efficiently.
Howard
2014-12-22 8:43 GMT-07:00 Nathan Hjelm :
>
> Yeah, I figured out why XPMEM is failing on SGI UV but have not figured
> out a fix. If possible can we remove the check for sn/xpmem
I agree. Please remove this config option.
2015-01-06 9:44 GMT-07:00 Nathan Hjelm :
>
> What: Remove the --disable-smp-locks configure option from master.
>
> Why: Use of this option produces incorrect results/undefined behavior
> when any shared memory BTL is in use. Since BTL usage is enabled
se_verbose 100
and check to see that in fact that env. variable isn't in the list
of passed env. variables?
Would one of you mind opening an issue to track this problem?
Thanks,
Howard
2015-01-09 7:52 GMT-07:00 Friedley, Andrew :
> No this is not expected behavior.
>
> The PSM
HI Folks,
Sorry for my stupidity. I now see the problem. App is calling pmi_init
twice because
of the new ofiwg libfabric mtl.
You can try
mpirun blah blah blah --mca btl
and things should work.
Howard
2015-01-09 7:52 GMT-07:00 Friedley, Andrew :
> No this is not expected behav
HI Adrian, Andrew,
Sorry try again, both the libfabric psm provider and the open mpi psm
mtl are trying to use psm_init.
So, to avoid this problem, add
--mca mtl psm
to your mpirun command line.
Sorry for the confusion.
Howard
2015-01-09 7:52 GMT-07:00 Friedley, Andrew :
> No this is
HI Adrian,
Please open an issue. We don't want users having to explicitly specify
the mtl to use just to get a job to run on a intel/infinipath system.
Howard
2015-01-09 13:04 GMT-07:00 Adrian Reber :
> Should I still open a ticket? Will these be changed or do I always have
>
;d have
non-trivial build
environments/runtime environments which would be better at testing
if something we broke something.
Howard
2015-01-09 17:36 GMT-07:00 Burette, Yohann :
> Hi,
>
> For those of you who don't know me, my name is Yohann Burette, I work for
> Intel and I c
e
aware of particular RDMA networks' capabilities, and avoid having to extend
the PML
interface with unnecessary methods.
Hope this helps,
Howard
2015-01-09 15:30 GMT-07:00 George Bosilca :
> I have some comments about this ticket and the corresponding patch.
> Honestly, the patch lacks
thanks George!
2015-01-15 11:43 GMT-07:00 George Bosilca :
> From the MPI standard perspective MPI_Cancel doesn't have to succeed, it
> can also gracefully fail. However, the PSM MTL diverges from the MPI
> standard and if a request cannot be canceled an error is returned. Here is
> a patch to f
Hi George,
I put this on the agenda for this week's meeting.
Howard
2015-01-23 16:43 GMT-07:00 George Bosilca :
> During some experiments we have identified several major issues with coll
> ML with a very recent version of Open MPI master (22ab638 Jan 20 13:21:44).
> Based on t
Hi Paul,
Thanks for checking in depth into this. Just to help in determining how to
proceed, which national center is this?
Howard
2015-02-02 19:35 GMT-07:00 Paul Hargrove :
> Below is one example of what happens when you assume that you can trust
> the libltdl installed an otherwis
+1
great stuff
2015-02-04 5:55 GMT-07:00 Jeff Squyres (jsquyres) :
> OMPI devs --
>
> Per lots of previous discussions, you all know that you can't assign
> labels, milestones, or users to issues/pull requests on the ompi-release
> repo.
>
> Gilles has written a Github bot that will allow you to
Hi Jeff
Gilles ideas are great.
I agree with your RM stamp of approval policy. No removal of rm approved in
the event of subsequent commits.
Howard
On Feb 5, 2015 5:04 AM, "Jeff Squyres (jsquyres)"
wrote:
> Gilles came up with a cool idea for the OMPIBot (see below). We can d
HI Jeff and Gilles
Do we have an ETA for enabling the bot on ompi-release?
I think it will be a great help.
Howard
HI Paul,
I'll fix this.
Howard
2015-02-06 17:38 GMT-07:00 Paul Hargrove :
> The following in orte/mca/ess/alps/Makefile.am assumes a GNU (or GNU-like)
> compiler:
>
> mca_ess_alps_la_CPPFLAGS = $(ess_alps_CPPFLAGS) -fno-ident
>
> If building with PGI, the result is
HI George,
I'd say commit cf377db82 explains the vanishing of the bandwidth metric as
well as the mis-labeling of the latency metric.
Howard
2015-02-10 18:41 GMT-07:00 George Bosilca :
> Somehow one of the most basic information about the capabilities of the
> BTLs (bandwidth)
do the above technique, the heap allocator blows up
in OBJ_RELEASE of buffer.
Thanks,
Howard
/u2/h/hpp/mtt_carver_tmp/installs/8v68/install ./c_hello
Before people begin blaming this as a cray thing, this is from the
NERSC carver system which is an ibm dataplex system running redhat and
using MLNX connectX HCAs.
Anyone else seeing these failures?
Howard
HI Ralph,
How does one get this "MPI Create success" message? Is there a mailing list
specifically for the nightly builds?
Thanks,
Howard
2015-02-16 21:48 GMT-07:00 Ralph Castain :
> It's the git id of the nightly tarball - which you should get via the MPI
> Create
I will also be available but suggest we skip next Tuesday.
On Feb 25, 2015 5:04 PM, "Ralph Castain" wrote:
> Hey folks
>
> Given that some number of us will be at the MPI Forum next week, do we
> have a quorum available for the weekly telecon? Who would be able to make
> it?
>
> Me: available
>
Hi Folks,
Just tried to build a fresh head of master and am getting
opal_verbs_want_fork_support as undefined symbol when trying to build opal
lib.
Any ideas on where this should go?
It would be nice to get jenkins checking everything, or at least a light
weight travis check.
Howard
es this problem.
Could Open MPI also potentially have this same problem? If so, I'd want to
add an mca param
to set this option before calling psm_ep_open within psm mtl. Hmm.. maybe
the ofi mtl
supporter should talk with the libfabric psm provider folks about this.
Thanks for any help,
Howard
Thanks Andrew, I was getting confused with the libfabric psm provider code
inside open mpi.
2015-03-03 9:35 GMT-07:00 Friedley, Andrew :
> Hi Howard,
>
>
>
> The PSM MTL sets PSM_EP_OPEN_AFFINITY_SKIP, so if I understand right, OMPI
> already has the fix for you.
>
>
just fine.
If somebody does care, let me know who and I'll send logs off-list.
However, this can be reproduced on carver.nersc.gov where at least Howard
and Nathan have accounts.
-Paul
--
Paul H. Hargrove phhargr...@lbl.gov
Computer Languages & Systems Software
HI Paul,
For the 10.9 and 11.9 does the libfabric get configured to build for you on
carver?
I get a failure at config.
I don't think this should be high priority since the libfabric embedding
within
open mpi should hopefully soon be a thing of the past.
Howard
2015-03-04 14:28 GMT-07:00
Are there known issues with the io tests? My first guess is yes
since I notice esslingen is excluding io from their MTT runs.
Thanks for any info,
Howard
HI Edgar,
Thanks for the explanation.
Howard
2015-03-06 12:43 GMT-07:00 Edgar Gabriel :
> this error message comes from ompio, the split collective are not properly
> implemented at this point in time, they are basically just a printf
> statement. Once I have the non-blocking colle
Die batman Lampe, ein toller Einfall!
!
2015-03-09 10:11 GMT-06:00 Mike Dubman :
>
> Hello,
>
> Please check updated OMPI wiki page for detailed information for Jenkins
> testing of OMPI repositories.
>
> https://github.com/open-mpi/ompi/wiki/PRJenkins
>
> Comments and suggestions are welcome.
>
don't define them? If not, I'll open an
issue.
Thanks,
Howard
haswell node, not using HT), using the DMA engine of the NIC
is not such a good idea.
Howard
2015-03-11 10:57 GMT-06:00 Nathan Hjelm :
>
> Definitely a side-effect though it could be beneficial in some cases as
> the RDMA engine in the HCA may be faster than using memcpy (larger
alled, and then make
sure there is an appropriate opal_tsd_key_destroy for the key during the
MPI_Finalize procedure. Alternately, since this is basically a dso
problem, one could define fini functions to run the destructors during the
dlclose procedure.
Any thoughts?
Howard
Hi Jeff
Minor cray corrections below
On Apr 17, 2015 6:57 AM, "Jeff Squyres (jsquyres)"
wrote:
>
> The v1.8 branch NEWS, README, and VERSION files have been updated in
preparation for the v1.8.5 release. Please double check them -- especially
NEWS, particularly to ensure that we are giving cred
HI Folks,
I'm seeing build failures on both carver/pgi at nersc and on a cray
internal machine
with the nightly build of master.
>From the cray box:
ommon_ugni.c:30:5: error: 'MCA_BASE_VERSION_2_0_0' undeclared here
(not in a function)
MCA_BASE_VERSION_2_0_0,
common_ugni.c:31:5: warning: in
h work well
for
the slurm/pmi systems at trilabs than for the Cray's.
I strongly encourage anyone wanting to use open mpi on cray systems
to use master (on good days, today is not such a day) at this point in time.
Sorry for the confusion.
Howard
2015-04-17 8:18 GMT-06:00 Jeff Squyres (j
try and patch up the 1.8 release to work on the cray
systems. Way too late in the game
for that branch.
But don't worry, for cory things will be running great using the 1.9/2.0.
Of course, maybe for cory beyond-mpi program
models might be more appropriate.
Howard
2015-04-17 14:16 GMT-06:0
"orte", NULL, NULL, "server", 0);
this just recently started appearing, perhaps today, but I've not been
running
anything over the weekend.
Howard
Hi Raphael,
Thanks very much for the patches.
Would one of the developers on the list have a system where they
can make these kernel limit changes and which have HCAs installed?
I don't have access to any system where I have such permissions.
Howard
2015-04-22 8:55 GMT-06:00 Ra
Hi Paul,
silly me. forgot this was a ulimit thing. I'll test on carver.
Howard
2015-04-22 10:45 GMT-06:00 Paul Hargrove :
> And here is the backtrace I probably should have provided in the previous
> email.
> -Paul
>
> #0 0x2b4107ce9265 in raise () from /
Hi Rafael,
I give you an A+ for effort. We always appreciate patches.
Howard
2015-04-22 12:43 GMT-06:00 Nathan Hjelm :
>
> Umm, why are you cleaning up this way. The allocated resources *should*
> be freed by the udcm_module_finalize call. If there is a bug in that
> path it sho
Hi Paul,
Portals4 may be able to work on cray XE/XC on top of IAA (ibverbs
simulation), but
it absolutely is not the support library for Cray interconnects since XE
days. Never was
on Cray XT either, as you point out that was portals 3.X.
Howard
2015-04-23 12:29 GMT-06:00 Paul Hargrove
Hi Folks,
I merged in the refresh of romio 3.1.4, special thanks to Gilles for doing
this!
I did some testing, but can't say it was extensive. If others would have
time
to run some of the MTT setups requesting romio rather than ompio for a bit
that would be great.
Thanks,
Howard
bizarre is that psmx_eq_open shouldn't be visible outside of the
libfabric.so itself. So
having libfabric internal symbols required in a ompi mca lib seems to be
incorrect.
Howard
Hi Folks,
Sorry for the comments pushed to PRs just a while ago. I was suppose to be
configuring jenkins for a different project, not ompi.
Sorry for the confusion.
Howard
Is this by any chance associated with issue 579?
2015-05-14 20:49 GMT-06:00 Ralph Castain :
> I'll look at the lines you cite, but that clearly isn't the problem we are
> seeing here. I can verify that because the test case:
>
> mpirun -n 1 sleep 1000
>
> does not open up any connections at all.
ected.
Following the KISS principal I would go with 2) returning a NULL rule when
there is no matching size in the rule file for the communicator in question.
Howard
2015-05-19 20:05 GMT-06:00 Gilles Gouaillardet :
> Folks,
>
> this is a follow-up of a discussion on the user ML sta
If the smoke test fails, send a naughty-gram to the committer and copy
devel. Pretty soon the developer will get trained to use the PR process,
unless they are that engineer I've yet to meet who always writes flawless
code.
Howard
>
> Back in the SVN days it was nice to have a trunk
looks good to me.
2015-05-28 12:54 GMT-06:00 Jeff Squyres (jsquyres) :
> I'd appreciate some feedback on
> https://github.com/open-mpi/ompi/commit/85f0fff1899ca8f4785776d0b301be513c33e675,
> where I updated README on master to describe the new version number scheme
> (note: this change is pending
Hi Folks,
The plan is to fork ompi master tomorrow to a 2.0 branch.
Last chance for any really good case that we should delay
this by a day or two.
Thanks,
Howard
Thanks for the heads up. taking a look.
2015-06-15 11:39 GMT-06:00 Ralph Castain :
> You might take a gander at the MTT results first - they don’t look very
> good on master :-(
>
>
> > On Jun 15, 2015, at 10:14 AM, Howard Pritchard
> wrote:
> >
> > Hi Folks,
l: Aborted (6)
[c1477:19137] Signal: Aborted (6)
[c1477:19137] Signal code: (-6)
[c1476:07375] Signal: Aborted (6)
c_ring: pml_ob1_component.c:308: mca_pml_ob1_component_fini: Assertion
`((0xdeafbeedULL << 32) +
0xdeafbeedULL) == ((opal_object_t *)
(mca_pml_ob1_recvreq))->obj_magic_id
he PR process?
We shouldn't have this kind of build failure anymore as long as people have
stopped bypassing
PR process.
Howard
2015-06-25 19:22 GMT-06:00 MPI Team :
>
> ERROR: Command returned a non-zero exist status (dev-1979-g13425e7):
>make -j 8 distcheck
>
> Sta
sorry, not true. look at the logs on IU.
runs at 3:07 and 4:08 IU time.
2015-06-25 21:46 GMT-06:00 Jeff Squyres (jsquyres) :
> Howard --
>
> The LANL distcheck jenkins hasn't been running all day.
>
>
> > On Jun 25, 2015, at 8:33 PM, Howard Pritchard
> wrote:
Hi Folks,
I'm seeing an error I've not seen before in the MTT runs on the ibm dataplex
at NERSC. The mpirun launched jobs are failing with
OMPI_PROC_BIND value is invalid
errors.
This is is for the trivial ring tests.
Is anyone else seeing these types of errors?
Howard
laki is also showing the errors:
Here's the shortened url:
http://goo.gl/Ra264U
looks like the badness started with the latest nightly.
I think there was some activity in the orte binding area recently.
Howard
2015-06-29 9:52 GMT-06:00 Jeff Squyres (jsquyres) :
> Can you provid
1:19 PM, Jeff Squyres (jsquyres) <
> jsquy...@cisco.com> wrote:
>
>> Ahh... it's OMP_PROC_BIND, not OMPI_PROC_BIND.
>>
>> Yes, Ralph just added this.
>>
>> I chatted with him about this on the phone moments ago; he's pretty sure
>> h
I decided just to disable the carver/pgi mtt runs.
2015-06-29 15:10 GMT-06:00 Ralph Castain :
> Very strange then - again, can you run it with the verbose flag and send
> me the output? I can't replicate what you are seeing.
>
>
> On Mon, Jun 29, 2015 at 4:05 PM, Howar
e put in place around the v2.x release to avoid these
kind of surprises there.
Needless to say I will not be admitting this PR in to v2.x until its cleaned
up enough to work with all major compilers, or else is only activated when
OMPI is compiled with an Intel compiler.
Howard
2015-06-30 16:00 G
Hi Folks,
I'm trying to locate on jaguar/www.open-mpi.org where the nightly
tarballs are for v1.10 and v2.x.
I'm needing these tarballs for our installs on some new systems
at LANL where we want to start out with these versions.
Thanks for any help,
Howard
#x27;t get past configure.
Not sure if this is specific to systems with psm installed yet.
Anyone else seen this?
Howard
Hi Folks,
Found the problem, had to do a hard reset to origin/master for some reason
to get missing files back.
Howard
2015-07-22 12:17 GMT-06:00 Jeff Squyres (jsquyres) :
> On Jul 22, 2015, at 1:46 PM, Howard Pritchard wrote:
> >
> > Hello Folks,
> >
> > I&qu
looks like ofi mtl is being naughty. its tje onlx mtl which registers with
opal progress in component init method.
--
sent from my smart phonr so no good type.
Howard
On Jul 23, 2015 7:03 PM, "Ralph Castain" wrote:
> It looks like one of the MTL components is registeri
Paul
Could you rerun with --mca mtl_base_verbose 10 added to cmd line and send
output?
Howard
--
sent from my smart phonr so no good type.
Howard
On Jul 23, 2015 6:06 PM, "Paul Hargrove" wrote:
> Yohann,
>
> With PR409 as it stands right now (commit 6daef310) I s
Jeff
I was wrong about this. all the mtls except for portals4 register with
opal progress in their comp init.
I dont see how this is a problem though as base select only invokes comp
init on the selected mtl.
Howard
--
sent from my smart phonr so no good type.
Howard
On Jul 24, 2015
Hi Jeff,
Nathan and I think this is generic to all the mtl's and masked by the stuff
in the cm select method for upping the priority
of the mtl. We'd see this behavior for all mtl's if this priority upping
code wasn't there and we fell back to ob1.
Howard
2015-07-24
Hi Folks,
Should we do something better than what is done currently in the
mca_pml_cm_component_init method around lines 158-162?
That's what's causing a bunch of problems right now in 1.10.
I'd like to see a better approach taken in the v2.x
Howard
test script.
The jenkin VM is a x86_64 (Intel E5-2690 cpus) VM running rhel 6.7.
Thanks,
Howard
HI Folks,
There's a new branch on open-mpi/ompi repo.
Is this intentional?
Howard
I will check if i can reproduce on nersc systems.
--
sent from my smart phonr so no good type.
Howard
On Aug 21, 2015 7:51 AM, "Ralph Castain" wrote:
> I’ll take a look at it
>
> > On Aug 20, 2015, at 11:34 PM, Mark Santcroos
> wrote:
> >
> > Hi a
I think rather than trying workarounds of dubious robustness inside open
mpi we
- dicument the issue on either the somewhat aged open mpi website faq or
add it to a wiki page on github
- file a bug against intel psm
--
sent from my smart phonr so no good type.
Howard
On Aug 25, 2015 6
is this going in to v2.x?
--
sent from my smart phonr so no good type.
Howard
On Aug 25, 2015 7:54 AM, wrote:
> This is an automated email from the git hooks/post-receive script. It was
> generated because a ref change was pushed to the repository containing
> the project
I'll update the java FAQ.
2015-08-25 8:36 GMT-06:00 Jeff Squyres (jsquyres) :
> On Aug 25, 2015, at 10:00 AM, Howard Pritchard
> wrote:
> >
> > I think rather than trying workarounds of dubious robustness inside open
> mpi we
> >
> > - dicument the issu
the mpirun script to include the --mca mtl ^psm tag if
> java is in the run string?
>
> -Nathan
>
> On Tue, Aug 25, 2015 at 9:47 AM, Howard Pritchard
> wrote:
>
>> I'll update the java FAQ.
>>
>> 2015-08-25 8:36 GMT-06:00 Jeff Squyres (jsquyres) :
101 - 200 of 286 matches
Mail list logo