[OMPI devel] [PATCH] iof/hnp: daemon part of the sink structure is not initialized when forwarding stdin to all ranks

2012-03-06 Thread nadia.derbey
Hi, When forwarding stdin to all ranks in the job (mpirun --stdin all), the following error message is output: -- [berlin73:02223] [[56600,0],0] ORTE_ERROR_LOG: A message is attempting to be sent to a process whose contact information is unknown in file ../../../../../orte/mca/rml

[OMPI devel] btl/openib: get_ib_dev_distance doesn't see processes as bound if the job has been launched by srun

2012-01-27 Thread nadia.derbey
Hi, If a job is launched using "srun --resv-ports --cpu_bind:..." and slurm is configured with: TaskPlugin=task/affinity TaskPluginParam=Cpusets each rank of that job is in a cpuset that contains a single CPU. Now, if we use carto on top of this, the following happens in get_ib_dev_distanc

[OMPI devel] Segfault in odls_fork_local_procs() for some values of npersocket

2011-11-08 Thread nadia.derbey
Hi, In v1.5, when mpirun is called with both the "-bind-to-core" and "-npersocket" options, and the npersocket value leads to less procs than sockets allocated on one node, we get a segfault Testing environment: openmpi v1.5 2 nodes with 4 8-cores sockets each mpirun -n 10 -bind-to-core -npersock

[OMPI devel] known limitation or bug in hwloc?

2011-08-29 Thread nadia.derbey
Hi list, I'm hitting a limitation with paffinity/hwloc with cpu numbers >= 64. In opal/mca/paffinity/hwloc/paffinity_hwloc_module.c, module_set() is the routine that sets the calling process affinity to the mask given as parameter. Note that "mask" is a opal_paffinity_base_cpu_set_t (so we allow

[OMPI devel] Fix a hang in carto_base_select() if carto_module_init() fails

2011-06-28 Thread nadia.derbey
Hi, When using the carto/file module with a syntactically incorrect carto file, we get stuck into opal_carto_base_select(). The attached trivial patch fixes the issue. Regards, Nadia -- nadia.derbey Fix a hang in carto_base_select if carto_module_init fails diff -r 94299d729b95 opal/mca

[OMPI devel] mpirun options in the MPMD model

2010-11-09 Thread nadia.derbey
like --output-filename is global to the job: even if it is given on any single line of an application context, with different values, the last value is the one that is actually taken as an output file prefix. Regards, Nadia -- nadia.derbey

[OMPI devel] potential memory leak in mpi_type_create_struct_f()?

2010-07-23 Thread nadia.derbey
Hi, In ompi/mpi/f77/type_create_struct_f.c, routine mpi_type_create_struct_f() mallocates c_type_old_array, but never frees it. Regards, Nadia -- nadia.derbey

Re: [OMPI devel] Committing to release branches

2010-07-22 Thread nadia.derbey
-btl-openib.txt", "of unknown > > event", > > true,orte_process_info.nodename, > > orte_process_info.pid, > > - event.event_type, xrc_event ? "true" : "false"); > > +event_type, xrc_event ? "true" : "false"); > > } > > ibv_ack_async_event(&event); > > } else { > > ___ > > svn-full mailing list > > svn-f...@open-mpi.org > > http://www.open-mpi.org/mailman/listinfo.cgi/svn-full > > > > -- nadia.derbey

Re: [OMPI devel] PATCH: Wrong event_type value passed in to show_help when getting xrc async events

2010-07-15 Thread nadia.derbey
On Thu, 2010-07-15 at 08:21 -0400, Jeff Squyres wrote: > On Jul 15, 2010, at 8:22 AM, nadia.derbey wrote: > > > So the solution is: > > 1. leave the intermediate event_type declared as an int. > > 2. then: > > . either cast it to ibv_event_type when c

Re: [OMPI devel] PATCH: Wrong event_type value passed in to show_help when getting xrc async events

2010-07-15 Thread nadia.derbey
On Thu, 2010-07-15 at 07:21 -0400, Jeff Squyres wrote: > On Jul 15, 2010, at 2:14 AM, nadia.derbey wrote: > > > The only warning I'm getting in the part of the code impacted by the > > patch is: > > - > > ../../../../../ompi/mca/btl/open

Re: [OMPI devel] PATCH: Wrong event_type value passed in to show_help when getting xrc async events

2010-07-15 Thread nadia.derbey
gards, On Wed, 2010-07-14 at 14:10 -0400, Jeff Squyres wrote: > Do you get additional warnings when compiling with the intel compiler (about > printf'ing an enum type)? I seem to recall that there's already a truckload > of those kinds of warnings... > > > On Jul 1

[OMPI devel] PATCH: Wrong event_type value passed in to show_help when getting xrc async events

2010-07-13 Thread nadia.derbey
this issue. Regards, Nadia -- nadia.derbey Wrong event_type value passed in to show_help when getting xrc async events diff -r e4bab4451664 ompi/mca/btl/openib/btl_openib_async.c --- a/ompi/mca/btl/openib/btl_openib_async.c Tue May 25 01:30:35 2010 +0200 +++ b/ompi/mca/btl/openib

[OMPI devel] v1.5: sigsegv in case of extremely low settings in the SRQs

2010-06-18 Thread nadia.derbey
Hi, Reference is the v1.5 branch If an SRQ has the following settings: S,,4,2,1 1) setup_qps() sets the following: mca_btl_openib_component.qp_infos[qp].u.srq_qp.rd_num=4 mca_btl_openib_component.qp_infos[qp].u.srq_qp.rd_init=rd_num/4=1 2) create_srq() sets the following: openib_btl->qps[qp].u.