On Mon, 2006-11-27 at 21:11 -0500, George Bosilca wrote:
> Which version of Open MPI are you using ? We can figure out what's
> wrong if we have the output of "ompi_info" and "ompi_info --param all
> all".
Forgot the "ompi_info --param all all". It's attached.
- Matt
>
> I wonder if some of the memory is not related to the size of the
> shared memory file. The default way to compute the size of the shared
> memory file is defined by the MCA parameter mpool_sm_per_peer_size.
> By default is set to 128MB for each local peer. Therefore using 2048
> procs on 256 nodes lead to using 8 procs by node i.e. at least 1GB
> only for the SM file. The problem right now with the SM file is that
> we're not reusing the buffers multiple times, instead we're using a
> new fragment each time we send a message, forcing the OS to map the
> entire file at one point.
>
> george.
>
> On Nov 27, 2006, at 8:21 PM, Matt Leininger wrote:
>
> > On Mon, 2006-11-27 at 16:45 -0800, Matt Leininger wrote:
> >> Has anyone testing OMPI's alltoall at > 2000 MPI tasks? I'm
> >> seeing each
> >> MPI task eat up > 1GB of memory (just for OMPI - not the app).
> >
> > I gathered some more data using the alltoall benchmark in mpiBench.
> > mpiBench is pretty smart about how large its buffers are. I set it to
> > use <= 100MB.
> >
> > num nodes num MPI tasks system mem mpibench buffer mem
> > 128 1024 1 GB 65 MB
> > 160 1280 1.2 GB 82 MB
> > 192 1536 1.4 GB 98 MB
> > 224 1792 1.6 GB 57 MB
> > 256 2048 1.6-1.8 GB < 100 MB
> >
> > The 256 node run was killed by the OOM for using too much memory. For
> > all these tests the OMPI alltoall is using 1 GB or more of system
> > memory. I know LANL is looking into optimized alltoall, but is anyone
> > looking into the scalability of the memory footprint?
> >
> > Thanks,
> >
> > - Matt
> >
> >>
> >> Thanks,
> >>
> >> - Matt
> >>
> >>
> >>
> >> _______________________________________________
> >> devel mailing list
> >> [email protected]
> >> http://www.open-mpi.org/mailman/listinfo.cgi/devel
> >>
> >
> >
> > _______________________________________________
> > devel mailing list
> > [email protected]
> > http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
> _______________________________________________
> devel mailing list
> [email protected]
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
MCA mca: parameter "mca_param_files" (current value:
"/g/g12/mlleinin/.openmpi/mca-params.conf:/g/g12/mlleinin/src/ompi-v1.2b-112506-gcc/etc/openmpi-mca-params.conf")
Path for MCA configuration files containing default
parameter values
MCA mca: parameter "mca_component_path" (current value:
"/g/g12/mlleinin/src/ompi-v1.2b-112506-gcc/lib/openmpi:/g/g12/mlleinin/.openmpi/components")
Path where to look for Open MPI and ORTE components
MCA mca: parameter "mca_verbose" (current value: <none>)
Top-level verbosity parameter
MCA mca: parameter "mca_component_show_load_errors" (current
value: "1")
Whether to show errors for components that failed to
load or not
MCA mca: parameter "mca_component_disable_dlopen" (current
value: "0")
Whether to attempt to disable opening dynamic
components or not
MCA mpi: parameter "mpi_param_check" (current value: "1")
Whether you want MPI API parameters checked at
run-time or not. Possible values are 0 (no checking) and 1 (perform checking
at run-time)
MCA mpi: parameter "mpi_yield_when_idle" (current value: "0")
Yield the processor when waiting for MPI
communication (for MPI processes, will default to 1 when oversubscribing nodes)
MCA mpi: parameter "mpi_event_tick_rate" (current value: "-1")
How often to progress TCP communications (0 = never,
otherwise specified in microseconds)
MCA mpi: parameter "mpi_show_handle_leaks" (current value: "0")
Whether MPI_FINALIZE shows all MPI handles that were
not freed or not
MCA mpi: parameter "mpi_no_free_handles" (current value: "0")
Whether to actually free MPI objects when their
handles are freed
MCA mpi: parameter "mpi_show_mca_params" (current value: "0")
Whether to show all MCA parameter value during
MPI_INIT or not (good for reproducability of MPI jobs)
MCA mpi: parameter "mpi_show_mca_params_file" (current value:
<none>)
If mpi_show_mca_params is true, setting this string
to a valid filename tells Open MPI to dump all the MCA parameter values into a
file suitable for reading via the mca_param_files parameter (good for
reproducability of MPI jobs)
MCA mpi: parameter "mpi_paffinity_alone" (current value: "0")
If nonzero, assume that this job is the only (set of)
process(es) running on each node and bind processes to processors, starting
with processor ID 0
MCA mpi: parameter "mpi_keep_peer_hostnames" (current value:
"1")
If nonzero, save the string hostnames of all MPI peer
processes (mostly for error / debugging output messages). This can add quite a
bit of memory usage to each MPI process.
MCA mpi: parameter "mpi_abort_delay" (current value: "0")
If nonzero, print out an identifying message when
MPI_ABORT is invoked (hostname, PID of the process that called MPI_ABORT) and
delay for that many seconds before exiting (a negative delay value means to
never abort). This allows attaching of a debugger before quitting the job.
MCA mpi: parameter "mpi_abort_print_stack" (current value: "0")
If nonzero, print out a stack trace when MPI_ABORT is
invoked
MCA mpi: parameter "mpi_preconnect_all" (current value: "0")
Whether to force MPI processes to create connections
/ warmup with *all* peers during MPI_INIT (vs. making connections lazily --
upon the first MPI traffic between each process peer pair)
MCA mpi: parameter "mpi_ddt_unpack_debug" (current value: "0")
Whether to output debugging information in the ddt
unpack functions (nonzero = enabled)
MCA mpi: parameter "mpi_ddt_pack_debug" (current value: "0")
Whether to output debugging information in the ddt
pack functions (nonzero = enabled)
MCA mpi: parameter "mpi_ddt_position_debug" (current value:
"0")
Non zero lead to output generated by the datatype
position functions
MCA mpi: parameter "mpi_ddt_copy_debug" (current value: "0")
Whether to output debugging information in the ddt
copy functions (nonzero = enabled)
MCA mpi: parameter "mpi_leave_pinned" (current value: "0")
leave_pinned
MCA mpi: parameter "mpi_leave_pinned_pipeline" (current value:
"0")
leave_pinned_pipeline
MCA orte: parameter "orte_base_user_debugger" (current value:
"totalview @mpirun@ -a @mpirun_args@ : fxp @mpirun@ -a @mpirun_args@")
Sequence of user-level debuggers to search for in
orterun
MCA orte: parameter "orte_debug" (current value: "0")
Whether or not to enable debugging output for all
ORTE components (0 or 1)
MCA orte: parameter "orte_debug_daemons" (current value: "0")
Whether or not to enable debugging of daemons (0 or 1)
MCA orte: parameter "orte_timing" (current value: "0")
Request that critical timing loops be measured
MCA opal: parameter "opal_signal" (current value: "6,7,8,11")
If a signal is received, display the stack trace frame
MCA backtrace: parameter "backtrace" (current value: <none>)
Default selection set of components for the backtrace
framework (<none> means "use all components that can be found")
MCA backtrace: parameter "backtrace_base_verbose" (current value:
"0")
Verbosity level for the backtrace framework (0 = no
verbosity)
MCA backtrace: parameter "backtrace_execinfo_priority" (current
value: "0")
MCA memory: parameter "memory" (current value: <none>)
Default selection set of components for the memory
framework (<none> means "use all components that can be found")
MCA memory: parameter "memory_base_verbose" (current value: "0")
Verbosity level for the memory framework (0 = no
verbosity)
MCA memory: parameter "memory_ptmalloc2_priority" (current value:
"0")
MCA paffinity: parameter "paffinity" (current value: <none>)
Default selection set of components for the paffinity
framework (<none> means "use all components that can be found")
MCA paffinity: parameter "paffinity_linux_priority" (current value:
"10")
Priority of the linux paffinity component
MCA paffinity: information "paffinity_linux_have_cpu_set_t" (value:
"1")
Whether this component was compiled on a system with
the type cpu_set_t or not (1 = yes, 0 = no)
MCA paffinity: information "paffinity_linux_CPU_ZERO_ok" (value: "1")
Whether this component was compiled on a system where
CPU_ZERO() is functional or broken (1 = functional, 0 = broken/not available)
MCA paffinity: information
"paffinity_linux_sched_setaffinity_num_params" (value: "3")
The number of parameters that sched_set_affinity()
takes on the machine where this component was compiled
MCA maffinity: parameter "maffinity" (current value: <none>)
Default selection set of components for the maffinity
framework (<none> means "use all components that can be found")
MCA maffinity: parameter "maffinity_first_use_priority" (current
value: "10")
Priority of the first_use maffinity component
MCA timer: parameter "timer" (current value: <none>)
Default selection set of components for the timer
framework (<none> means "use all components that can be found")
MCA timer: parameter "timer_base_verbose" (current value: "0")
Verbosity level for the timer framework (0 = no
verbosity)
MCA timer: parameter "timer_linux_priority" (current value: "0")
MCA allocator: parameter "allocator" (current value: <none>)
Default selection set of components for the allocator
framework (<none> means "use all components that can be found")
MCA allocator: parameter "allocator_base_verbose" (current value:
"0")
Verbosity level for the allocator framework (0 = no
verbosity)
MCA allocator: parameter "allocator_basic_priority" (current value:
"0")
MCA allocator: parameter "allocator_bucket_num_buckets" (current
value: "30")
MCA allocator: parameter "allocator_bucket_priority" (current value:
"0")
MCA coll: parameter "coll" (current value: <none>)
Default selection set of components for the coll
framework (<none> means "use all components that can be found")
MCA coll: parameter "coll_base_verbose" (current value: "0")
Verbosity level for the coll framework (0 = no
verbosity)
MCA coll: parameter "coll_basic_priority" (current value: "10")
Priority of the basic coll component
MCA coll: parameter "coll_basic_crossover" (current value: "4")
Minimum number of processes in a communicator before
using the logarithmic algorithms
MCA coll: parameter "coll_self_priority" (current value: "75")
MCA coll: parameter "coll_sm_priority" (current value: "0")
Priority of the sm coll component
MCA coll: parameter "coll_sm_control_size" (current value:
"4096")
Length of the control data -- should usually be
either the length of a cache line on most SMPs, or the size of a page on
machines that support direct memory affinity page placement (in bytes)
MCA coll: parameter "coll_sm_bootstrap_filename" (current
value: "shared_mem_sm_bootstrap")
Filename (in the Open MPI session directory) of the
coll sm component bootstrap rendezvous mmap file
MCA coll: parameter "coll_sm_bootstrap_num_segments" (current
value: "8")
Number of segments in the bootstrap file
MCA coll: parameter "coll_sm_fragment_size" (current value:
"8192")
Fragment size (in bytes) used for passing data
through shared memory (will be rounded up to the nearest control_size size)
MCA coll: parameter "coll_sm_mpool" (current value: "sm")
Name of the mpool component to use
MCA coll: parameter "coll_sm_comm_in_use_flags" (current value:
"2")
Number of "in use" flags, used to mark a message
passing area segment as currently being used or not (must be >= 2 and <=
comm_num_segments)
MCA coll: parameter "coll_sm_comm_num_segments" (current value:
"8")
Number of segments in each communicator's shared
memory message passing area (must be >= 2, and must be a multiple of
comm_in_use_flags)
MCA coll: parameter "coll_sm_tree_degree" (current value: "4")
Degree of the tree for tree-based operations (must be
=> 1 and <= min(control_size, 255))
MCA coll: information "coll_sm_shared_mem_used_bootstrap"
(value: "216")
Amount of shared memory used in the shared memory
bootstrap area (in bytes)
MCA coll: parameter "coll_sm_info_num_procs" (current value:
"4")
Number of processes to use for the calculation of the
shared_mem_size MCA information parameter (must be => 2)
MCA coll: information "coll_sm_shared_mem_used_data" (value:
"548864")
Amount of shared memory used in the shared memory
data area for info_num_procs processes (in bytes)
MCA coll: parameter "coll_tuned_priority" (current value: "30")
Priority of the tuned coll component
MCA coll: parameter
"coll_tuned_pre_allocate_memory_comm_size_limit" (current value: "32768")
Size of communicator were we stop pre-allocating
memory for the fixed internal buffer used for message requests etc that is hung
off the communicator data segment. I.e. if you have a 100'000 nodes you might
not want to pre-allocate 200'000 request handle slots per communicator instance!
MCA coll: parameter "coll_tuned_use_dynamic_rules" (current
value: "0")
Switch used to decide if we use static (compiled/if
statements) or dynamic (built at runtime) decision function rules
MCA coll: parameter "coll_tuned_init_tree_fanout" (current
value: "4")
Inital fanout used in the tree topologies for each
communicator. This is only an initial guess, if a tuned collective needs a
different fanout for an operation, it build it dynamically. This parameter is
only for the first guess and might save a little time
MCA coll: parameter "coll_tuned_init_chain_fanout" (current
value: "4")
Inital fanout used in the chain (fanout followed by
pipeline) topologies for each communicator. This is only an initial guess, if a
tuned collective needs a different fanout for an operation, it build it
dynamically. This parameter is only for the first guess and might save a little
time
MCA io: parameter "io_base_freelist_initial_size" (current
value: "16")
Initial MPI-2 IO request freelist size
MCA io: parameter "io_base_freelist_max_size" (current value:
"64")
Max size of the MPI-2 IO request freelist
MCA io: parameter "io_base_freelist_increment" (current
value: "16")
Increment size of the MPI-2 IO request freelist
MCA io: parameter "io" (current value: <none>)
Default selection set of components for the io
framework (<none> means "use all components that can be found")
MCA io: parameter "io_base_verbose" (current value: "0")
Verbosity level for the io framework (0 = no
verbosity)
MCA io: parameter "io_romio_priority" (current value: "10")
Priority of the io romio component
MCA io: parameter "io_romio_delete_priority" (current value:
"10")
Delete priority of the io romio component
MCA io: parameter "io_romio_enable_parallel_optimizations"
(current value: "0")
Enable set of Open MPI-added options to improve
collective file i/o performance
MCA mpool: parameter "mpool" (current value: <none>)
Default selection set of components for the mpool
framework (<none> means "use all components that can be found")
MCA mpool: parameter "mpool_base_verbose" (current value: "0")
Verbosity level for the mpool framework (0 = no
verbosity)
MCA mpool: parameter "mpool_openib_rcache_name" (current value:
"rb")
The name of the registration cache the mpool should
use
MCA mpool: parameter "mpool_openib_priority" (current value: "0")
MCA mpool: parameter "mpool_sm_allocator" (current value:
"bucket")
Name of allocator component to use with sm mpool
MCA mpool: parameter "mpool_sm_max_size" (current value:
"536870912")
Maximum size of the sm mpool shared memory file
MCA mpool: parameter "mpool_sm_min_size" (current value:
"134217728")
Minimum size of the sm mpool shared memory file
MCA mpool: parameter "mpool_sm_per_peer_size" (current value:
"33554432")
Size (in bytes) to allocate per local peer in the sm
mpool shared memory file, bounded by min_size and max_size
MCA mpool: parameter "mpool_sm_priority" (current value: "0")
MCA mpool: parameter "mpool_udapl_priority" (current value: "0")
MCA mpool: parameter "mpool_base_use_mem_hooks" (current value:
"0")
use memory hooks for deregistering freed memory
MCA mpool: parameter "mpool_use_mem_hooks" (current value: "0")
(deprecated, use mpool_base_use_mem_hooks)
MCA pml: parameter "pml" (current value: "ob1")
Default selection set of components for the pml
framework (<none> means "use all components that can be found")
MCA pml: parameter "pml_base_verbose" (current value: "0")
Verbosity level for the pml framework (0 = no
verbosity)
MCA pml: parameter "pml_cm_free_list_num" (current value: "4")
Initial size of request free lists
MCA pml: parameter "pml_cm_free_list_max" (current value: "-1")
Maximum size of request free lists
MCA pml: parameter "pml_cm_free_list_inc" (current value: "64")
Number of elements to add when growing request free
lists
MCA pml: parameter "pml_cm_priority" (current value: "1")
CM PML selection priority
MCA pml: parameter "pml_dr_free_list_num" (current value: "4")
MCA pml: parameter "pml_dr_free_list_max" (current value: "-1")
MCA pml: parameter "pml_dr_free_list_inc" (current value: "64")
MCA pml: parameter "pml_dr_priority" (current value: "1")
MCA pml: parameter "pml_dr_eager_limit" (current value:
"131072")
MCA pml: parameter "pml_dr_send_pipeline_depth" (current
value: "3")
MCA pml: parameter "pml_dr_wdog_timer_sec" (current value: "5")
MCA pml: parameter "pml_dr_wdog_timer_usec" (current value:
"0")
MCA pml: parameter "pml_dr_wdog_timer_multiplier" (current
value: "1")
MCA pml: parameter "pml_dr_wdog_retry_max" (current value: "1")
MCA pml: parameter "pml_dr_ack_timer_sec" (current value: "10")
MCA pml: parameter "pml_dr_ack_timer_usec" (current value: "0")
MCA pml: parameter "pml_dr_ack_timer_multiplier" (current
value: "1")
MCA pml: parameter "pml_dr_ack_retry_max" (current value: "3")
MCA pml: parameter "pml_dr_enable_csum" (current value: "1")
MCA pml: parameter "pml_ob1_free_list_num" (current value: "4")
MCA pml: parameter "pml_ob1_free_list_max" (current value:
"-1")
MCA pml: parameter "pml_ob1_free_list_inc" (current value:
"64")
MCA pml: parameter "pml_ob1_priority" (current value: "1")
MCA pml: parameter "pml_ob1_eager_limit" (current value:
"131072")
MCA pml: parameter "pml_ob1_send_pipeline_depth" (current
value: "3")
MCA pml: parameter "pml_ob1_recv_pipeline_depth" (current
value: "4")
MCA bml: parameter "bml" (current value: <none>)
Default selection set of components for the bml
framework (<none> means "use all components that can be found")
MCA bml: parameter "bml_base_verbose" (current value: "0")
Verbosity level for the bml framework (0 = no
verbosity)
MCA bml: parameter "bml_r2_show_unreach_errors" (current
value: "1")
Show error message when procs are unreachable
MCA bml: parameter "bml_r2_priority" (current value: "0")
MCA rcache: parameter "rcache" (current value: <none>)
Default selection set of components for the rcache
framework (<none> means "use all components that can be found")
MCA rcache: parameter "rcache_base_verbose" (current value: "0")
Verbosity level for the rcache framework (0 = no
verbosity)
MCA rcache: parameter "rcache_rb_priority" (current value: "0")
MCA rcache: parameter "rcache_vma_mru_len" (current value: "256")
The maximum size IN ENTRIES of the MRU (most recently
used) rcache list
MCA rcache: parameter "rcache_vma_mru_size" (current value:
"1073741824")
The maximum size IN BYTES of the MRU (most recently
used) rcache list
MCA rcache: parameter "rcache_vma_priority" (current value: "0")
MCA btl: parameter "btl_base_debug" (current value: "0")
If btl_base_debug is 1 standard debug is output, if >
1 verbose debug is output
MCA btl: parameter "btl" (current value: <none>)
Default selection set of components for the btl
framework (<none> means "use all components that can be found")
MCA btl: parameter "btl_base_verbose" (current value: "0")
Verbosity level for the btl framework (0 = no
verbosity)
MCA btl: parameter "btl_openib_verbose" (current value: "0")
Output some verbose OpenIB BTL information (0 = no
output, nonzero = output)
MCA btl: parameter "btl_openib_warn_no_hca_params_found"
(current value: "1")
Warn when no HCA-specific parameters are found in the
INI file specified by the btl_openib_hca_param_files MCA parameter (0 = do not
warn; any other value = warn)
MCA btl: parameter "btl_openib_warn_default_gid_prefix"
(current value: "1")
Warn when there is more than one active ports and at
least one of them connected to the network with only default GID prefix
configured (0 = do not warn; any other value = warn)
MCA btl: parameter "btl_openib_hca_param_files" (current
value:
"/g/g12/mlleinin/src/ompi-v1.2b-112506-gcc/share/openmpi/mca-btl-openib-hca-params.ini")
Colon-delimited list of INI-style files that contain
HCA vendor/part-specific parameters
MCA btl: parameter "btl_openib_max_btls" (current value: "-1")
Maximum number of HCA ports to use (-1 = use all
available, otherwise must be >= 1)
MCA btl: parameter "btl_openib_free_list_num" (current value:
"8")
Intial size of free lists (must be >= 1)
MCA btl: parameter "btl_openib_free_list_max" (current value:
"-1")
Maximum size of free lists (-1 = infinite, otherwise
must be >= 0)
MCA btl: parameter "btl_openib_free_list_inc" (current value:
"32")
Increment size of free lists (must be >= 1)
MCA btl: parameter "btl_openib_mpool" (current value: "openib")
Name of the memory pool to be used (it is unlikely
that you will ever want to change this
MCA btl: parameter "btl_openib_reg_mru_len" (current value:
"16")
Length of the registration cache most recently used
list (must be >= 1)
MCA btl: parameter "btl_openib_ib_cq_size" (current value:
"1000")
Size of the IB completion queue (will automatically
be set to a minimum of (2 * number_of_peers * btl_openib_rd_num))
MCA btl: parameter "btl_openib_ib_sg_list_size" (current
value: "4")
Size of IB segment list (must be >= 1)
MCA btl: parameter "btl_openib_ib_pkey_ix" (current value: "0")
InfiniBand pkey index (must be >= 0)
MCA btl: parameter "btl_openib_ib_psn" (current value: "0")
InfiniBand packet sequence starting number (must be
>= 0)
MCA btl: parameter "btl_openib_ib_qp_ous_rd_atom" (current
value: "4")
InfiniBand outstanding atomic reads (must be >= 0)
MCA btl: parameter "btl_openib_ib_mtu" (current value: "3")
IB MTU, in bytes (if not specified in INI files).
Valid values are: 1=256 bytes, 2=512 bytes, 3=1024 bytes, 4=2048 bytes, 5=4096
bytes
MCA btl: parameter "btl_openib_ib_min_rnr_timer" (current
value: "5")
InfiniBand minimum "receiver not ready" timer, in
seconds (must be >= 1)
MCA btl: parameter "btl_openib_ib_timeout" (current value:
"10")
InfiniBand transmit timeout, in seconds(must be >= 1)
MCA btl: parameter "btl_openib_ib_retry_count" (current value:
"7")
InfiniBand transmit retry count (must be >= 1)
MCA btl: parameter "btl_openib_ib_rnr_retry" (current value:
"7")
InfiniBand "receiver not ready" retry count (must be
>= 1)
MCA btl: parameter "btl_openib_ib_max_rdma_dst_ops" (current
value: "4")
InfiniBand maximum pending RDMA destination
operations (must be >= 1)
MCA btl: parameter "btl_openib_ib_service_level" (current
value: "0")
InfiniBand service level (must be >= 0)
MCA btl: parameter "btl_openib_ib_static_rate" (current value:
"0")
InfiniBand static rate (must be >= 0; defulat: %d)
MCA btl: parameter "btl_openib_exclusivity" (current value:
"1024")
OpenIB BTL exclusivity (must be >= 0)
MCA btl: parameter "btl_openib_rd_num" (current value: "8")
Number of receive descriptors to post to a queue pair
(must be >= 1)
MCA btl: parameter "btl_openib_rd_low" (current value: "6")
Low water mark before reposting occurs (must be >= 1)
MCA btl: parameter "btl_openib_rd_win" (current value: "4")
Window size at which generate explicit credit message
(must be >= 1)
MCA btl: parameter "btl_openib_use_srq" (current value: "0")
If nonzero, use the InfiniBand shared receive queue
("SRQ")
MCA btl: parameter "btl_openib_srq_rd_max" (current value:
"1000")
Maxium number of receive descriptors posted per SRQ
(only relevant if btl_openib_use_srq is true; must be >= 1)
MCA btl: parameter "btl_openib_srq_rd_per_peer" (current
value: "16")
Number of receive descriptors posted per peer in the
SRQ (only relevant if btl_openib_use_srq is true; must be >= 1)
MCA btl: parameter "btl_openib_srq_sd_max" (current value: "8")
Maximum number of send descriptors posted (only
relevant if btl_openib_use_srq is true; must be >= 1)
MCA btl: parameter "btl_openib_use_eager_rdma" (current value:
"1")
Use RDMA for eager messages
MCA btl: parameter "btl_openib_eager_rdma_threshold" (current
value: "16")
Use RDMA for short messages after this number of
messages are received from a given peer (must be >= 1)
MCA btl: parameter "btl_openib_max_eager_rdma" (current value:
"16")
Maximum number of peers allowed to use RDMA for short
messages (RDMA is used for all long messages, except if explicitly disabled,
such as with the "dr" pml) (must be >= 0)
MCA btl: parameter "btl_openib_eager_rdma_num" (current value:
"16")
Number of RDMA buffers to allocate for small
messages(must be >= 1)
MCA btl: parameter "btl_openib_btls_per_lid" (current value:
"1")
Number of BTLs to create for each InfiniBand LID
(must be >= 1)
MCA btl: parameter "btl_openib_max_lmc" (current value: "0")
Maximum number of LIDs to use for each HCA port (must
be >= 0, where 0 = use all available)
MCA btl: parameter "btl_openib_buffer_alignment" (current
value: "64")
Prefered communication buffer alignment, in bytes
(must be >= 0)
MCA btl: parameter "btl_openib_eager_limit" (current value:
"12288")
Eager send limit, in bytes (must be >= 1)
MCA btl: parameter "btl_openib_min_send_size" (current value:
"32768")
Minimum send size, in bytes (must be >= 1)
MCA btl: parameter "btl_openib_max_send_size" (current value:
"65536")
Maximum send size, in bytes (must be >= 1)
MCA btl: parameter "btl_openib_min_rdma_size" (current value:
"1048576")
Minimum RDMA size, in bytes (must be >= 1)
MCA btl: parameter "btl_openib_max_rdma_size" (current value:
"1048576")
Maximium RDMA size, in bytes (must be >= 1)
MCA btl: parameter "btl_openib_flags" (current value: "54")
BTL flags, added together: SEND=1, PUT=2, GET=4
(cannot be 0)
MCA btl: parameter "btl_openib_bandwidth" (current value:
"800")
Approximate maximum bandwidth of network (must be >=
1)
MCA btl: parameter "btl_openib_priority" (current value: "0")
MCA btl: parameter "btl_self_free_list_num" (current value:
"0")
Number of fragments by default
MCA btl: parameter "btl_self_free_list_max" (current value:
"-1")
Maximum number of fragments
MCA btl: parameter "btl_self_free_list_inc" (current value:
"32")
Increment by this number of fragments
MCA btl: parameter "btl_self_eager_limit" (current value:
"131072")
Eager size fragmeng (before the rendez-vous ptotocol)
MCA btl: parameter "btl_self_min_send_size" (current value:
"262144")
Minimum fragment size after the rendez-vous
MCA btl: parameter "btl_self_max_send_size" (current value:
"262144")
Maximum fragment size after the rendez-vous
MCA btl: parameter "btl_self_min_rdma_size" (current value:
"2147483647")
Maximum fragment size for the RDMA transfer
MCA btl: parameter "btl_self_max_rdma_size" (current value:
"2147483647")
Maximum fragment size for the RDMA transfer
MCA btl: parameter "btl_self_exclusivity" (current value:
"65536")
Device exclusivity
MCA btl: parameter "btl_self_flags" (current value: "10")
Active behavior flags
MCA btl: parameter "btl_self_priority" (current value: "0")
MCA btl: parameter "btl_sm_free_list_num" (current value: "8")
MCA btl: parameter "btl_sm_free_list_max" (current value: "-1")
MCA btl: parameter "btl_sm_free_list_inc" (current value: "64")
MCA btl: parameter "btl_sm_exclusivity" (current value:
"65535")
MCA btl: parameter "btl_sm_latency" (current value: "100")
MCA btl: parameter "btl_sm_max_procs" (current value: "-1")
MCA btl: parameter "btl_sm_sm_extra_procs" (current value: "2")
MCA btl: parameter "btl_sm_mpool" (current value: "sm")
MCA btl: parameter "btl_sm_eager_limit" (current value: "4096")
MCA btl: parameter "btl_sm_max_frag_size" (current value:
"32768")
MCA btl: parameter "btl_sm_size_of_cb_queue" (current value:
"128")
MCA btl: parameter "btl_sm_cb_lazy_free_freq" (current value:
"120")
MCA btl: parameter "btl_sm_priority" (current value: "0")
MCA btl: parameter "btl_tcp_if_include" (current value: <none>)
MCA btl: parameter "btl_tcp_if_exclude" (current value: "lo")
MCA btl: parameter "btl_tcp_free_list_num" (current value: "8")
MCA btl: parameter "btl_tcp_free_list_max" (current value:
"-1")
MCA btl: parameter "btl_tcp_free_list_inc" (current value:
"32")
MCA btl: parameter "btl_tcp_sndbuf" (current value: "131072")
MCA btl: parameter "btl_tcp_rcvbuf" (current value: "131072")
MCA btl: parameter "btl_tcp_endpoint_cache" (current value:
"30720")
MCA btl: parameter "btl_tcp_exclusivity" (current value: "0")
MCA btl: parameter "btl_tcp_eager_limit" (current value:
"65536")
MCA btl: parameter "btl_tcp_min_send_size" (current value:
"65536")
MCA btl: parameter "btl_tcp_max_send_size" (current value:
"131072")
MCA btl: parameter "btl_tcp_min_rdma_size" (current value:
"131072")
MCA btl: parameter "btl_tcp_max_rdma_size" (current value:
"2147483647")
MCA btl: parameter "btl_tcp_flags" (current value: "58")
MCA btl: parameter "btl_tcp_priority" (current value: "0")
MCA btl: parameter "btl_udapl_free_list_num" (current value:
"8")
MCA btl: parameter "btl_udapl_free_list_max" (current value:
"-1")
MCA btl: parameter "btl_udapl_free_list_inc" (current value:
"8")
MCA btl: parameter "btl_udapl_mpool" (current value: "udapl")
MCA btl: parameter "btl_udapl_max_modules" (current value: "8")
MCA btl: parameter "btl_udapl_evd_qlen" (current value: "32")
MCA btl: parameter "btl_udapl_num_recvs" (current value: "8")
MCA btl: parameter "btl_udapl_num_sends" (current value: "8")
MCA btl: parameter "btl_udapl_timeout" (current value:
"10000000")
MCA btl: parameter "btl_udapl_exclusivity" (current value:
"1014")
MCA btl: parameter "btl_udapl_eager_limit" (current value:
"32768")
MCA btl: parameter "btl_udapl_min_send_size" (current value:
"16384")
MCA btl: parameter "btl_udapl_max_send_size" (current value:
"65536")
MCA btl: parameter "btl_udapl_min_rdma_size" (current value:
"524288")
MCA btl: parameter "btl_udapl_max_rdma_size" (current value:
"131072")
MCA btl: parameter "btl_udapl_bandwidth" (current value: "225")
MCA btl: parameter "btl_udapl_priority" (current value: "0")
MCA btl: parameter "btl_base_include" (current value: <none>)
MCA btl: parameter "btl_base_exclude" (current value: <none>)
MCA btl: parameter "btl_base_warn_component_unused" (current
value: "1")
This parameter is used to turn on warning messages
when certain NICs are not used
MCA mtl: parameter "mtl" (current value: <none>)
Default selection set of components for the mtl
framework (<none> means "use all components that can be found")
MCA mtl: parameter "mtl_base_verbose" (current value: "0")
Verbosity level for the mtl framework (0 = no
verbosity)
MCA topo: parameter "topo" (current value: <none>)
Default selection set of components for the topo
framework (<none> means "use all components that can be found")
MCA topo: parameter "topo_base_verbose" (current value: "0")
Verbosity level for the topo framework (0 = no
verbosity)
MCA osc: parameter "osc" (current value: <none>)
Default selection set of components for the osc
framework (<none> means "use all components that can be found")
MCA osc: parameter "osc_base_verbose" (current value: "0")
Verbosity level for the osc framework (0 = no
verbosity)
MCA osc: parameter "osc_pt2pt_no_locks" (current value: "0")
Enable optimizations available only if MPI_LOCK is
not used.
MCA osc: parameter "osc_pt2pt_eager_limit" (current value:
"16384")
Max size of eagerly sent data
MCA osc: parameter "osc_pt2pt_priority" (current value: "0")
MCA osc: parameter "osc_rdma_fence_sync_method" (current
value: "reduce_scatter")
How to synchronize fence: reduce_scatter, allreduce,
alltoall
MCA osc: parameter "osc_rdma_eager_send" (current value: "0")
Attempt to start data movement during communication
call, instead of at synchrnoization time. Info key of same name overrides this
value, if info key given.
MCA osc: parameter "osc_rdma_no_locks" (current value: "0")
Enable optimizations available only if MPI_LOCK is
not used.
MCA osc: parameter "osc_rdma_priority" (current value: "0")
MCA errmgr: parameter "errmgr" (current value: <none>)
Default selection set of components for the errmgr
framework (<none> means "use all components that can be found")
MCA errmgr: parameter "errmgr_hnp_debug" (current value: "0")
MCA errmgr: parameter "errmgr_hnp_priority" (current value: "0")
MCA errmgr: parameter "errmgr_orted_debug" (current value: "0")
MCA errmgr: parameter "errmgr_orted_priority" (current value: "0")
MCA errmgr: parameter "errmgr_proxy_debug" (current value: "0")
MCA errmgr: parameter "errmgr_proxy_priority" (current value: "0")
MCA gpr: parameter "gpr_base_maxsize" (current value:
"2147483647")
MCA gpr: parameter "gpr_base_blocksize" (current value: "512")
MCA gpr: parameter "gpr" (current value: <none>)
Default selection set of components for the gpr
framework (<none> means "use all components that can be found")
MCA gpr: parameter "gpr_null_priority" (current value: "0")
MCA gpr: parameter "gpr_proxy_debug" (current value: "0")
MCA gpr: parameter "gpr_proxy_priority" (current value: "0")
MCA gpr: parameter "gpr_replica_debug" (current value: "0")
MCA gpr: parameter "gpr_replica_isolate" (current value: "0")
MCA gpr: parameter "gpr_replica_priority" (current value: "0")
MCA iof: parameter "iof_base_window_size" (current value:
"4096")
MCA iof: parameter "iof_base_service" (current value: "0.0.0")
MCA iof: parameter "iof" (current value: <none>)
Default selection set of components for the iof
framework (<none> means "use all components that can be found")
MCA iof: parameter "iof_proxy_debug" (current value: "1")
MCA iof: parameter "iof_proxy_priority" (current value: "0")
MCA iof: parameter "iof_svc_debug" (current value: "1")
MCA iof: parameter "iof_svc_priority" (current value: "0")
MCA ns: parameter "ns" (current value: <none>)
Default selection set of components for the ns
framework (<none> means "use all components that can be found")
MCA ns: parameter "ns_proxy_debug" (current value: "0")
MCA ns: parameter "ns_proxy_maxsize" (current value:
"2147483647")
MCA ns: parameter "ns_proxy_blocksize" (current value: "512")
MCA ns: parameter "ns_proxy_priority" (current value: "0")
MCA ns: parameter "ns_replica_debug" (current value: "0")
MCA ns: parameter "ns_replica_isolate" (current value: "0")
MCA ns: parameter "ns_replica_maxsize" (current value:
"2147483647")
MCA ns: parameter "ns_replica_blocksize" (current value:
"512")
MCA ns: parameter "ns_replica_priority" (current value: "0")
MCA oob: parameter "oob" (current value: <none>)
Default selection set of components for the oob
framework (<none> means "use all components that can be found")
MCA oob: parameter "oob_base_verbose" (current value: "0")
Verbosity level for the oob framework (0 = no
verbosity)
MCA oob: parameter "oob_tcp_peer_limit" (current value: "-1")
MCA oob: parameter "oob_tcp_peer_retries" (current value: "60")
MCA oob: parameter "oob_tcp_debug" (current value: "0")
MCA oob: parameter "oob_tcp_include" (current value: <none>)
MCA oob: parameter "oob_tcp_exclude" (current value: <none>)
MCA oob: parameter "oob_tcp_sndbuf" (current value: "131072")
MCA oob: parameter "oob_tcp_rcvbuf" (current value: "131072")
MCA oob: parameter "oob_tcp_connect_timeout" (current value:
"10")
connect() timeout in seconds, before trying next
interface
MCA oob: parameter "oob_tcp_listen_mode" (current value:
"event")
Mode for HNP to accept incoming connections: event,
listen_thread
MCA oob: parameter "oob_tcp_listen_thread_max_queue" (current
value: "10")
High water mark for queued accepted socket list size
MCA oob: parameter "oob_tcp_listen_thread_max_time" (current
value: "10")
Maximum amount of time (in milliseconds) to wait
between processing accepted socket list
MCA oob: parameter "oob_tcp_accept_spin_count" (current value:
"10")
Number of times to let accept return EWOULDBLOCK
before updating accepted socket list
MCA oob: parameter "oob_tcp_priority" (current value: "0")
MCA ras: parameter "ras" (current value: <none>)
MCA ras: parameter "ras_dash_host_priority" (current value:
"5")
Selection priority for the dash_host RAS component
MCA ras: parameter "ras_gridengine_debug" (current value: "0")
Enable debugging output for the gridengine ras
component
MCA ras: parameter "ras_gridengine_priority" (current value:
"100")
Priority of the gridengine ras component
MCA ras: parameter "ras_gridengine_verbose" (current value:
"0")
Enable verbose output for the gridengine ras component
MCA ras: parameter "ras_gridengine_show_jobid" (current value:
"0")
Show the JOB_ID of the Grid Engine job
MCA ras: parameter "ras_localhost_priority" (current value:
"0")
Selection priority for the localhost RAS component
MCA ras: parameter "ras_slurm_priority" (current value: "75")
Priority of the slurm ras component
MCA rds: parameter "rds" (current value: <none>)
MCA rds: parameter "rds_hostfile_debug" (current value: "0")
Toggle debug output for hostfile RDS component
MCA rds: parameter "rds_hostfile_path" (current value:
"/g/g12/mlleinin/src/ompi-v1.2b-112506-gcc/etc/openmpi-default-hostfile")
ORTE Host filename
MCA rds: parameter "rds_hostfile_priority" (current value: "0")
MCA rds: parameter "rds_proxy_priority" (current value: "0")
MCA rds: parameter "rds_resfile_debug" (current value: "0")
Toggle debug output for resfile RDS component
MCA rds: parameter "rds_resfile_name" (current value: <none>)
ORTE Resource filename
MCA rds: parameter "rds_resfile_priority" (current value: "0")
MCA rmaps: parameter "rmaps_base_verbose" (current value: "0")
Verbosity level for the rmaps framework
MCA rmaps: parameter "rmaps_base_schedule_policy" (current
value: "slot")
Scheduling Policy for RMAPS. [slot | node]
MCA rmaps: parameter "rmaps_base_pernode" (current value: "0")
Request one ppn if num procs not specified
MCA rmaps: parameter "rmaps_base_schedule_local" (current value:
"1")
If nonzero, allow scheduling MPI applications on the
same node as mpirun (default). If zero, do not schedule any MPI applications
on the same node as mpirun
MCA rmaps: parameter "rmaps_base_no_oversubscribe" (current
value: "0")
If nonzero, then do not allow oversubscription of
nodes - mpirun will return an error if there aren't enough nodes to launch all
processes without oversubscribing
MCA rmaps: parameter "rmaps" (current value: <none>)
Default selection set of components for the rmaps
framework (<none> means "use all components that can be found")
MCA rmaps: parameter "rmaps_proxy_debug" (current value: "0")
MCA rmaps: parameter "rmaps_proxy_priority" (current value: "0")
MCA rmaps: parameter "rmaps_round_robin_debug" (current value:
"1")
Toggle debug output for Round Robin RMAPS component
MCA rmaps: parameter "rmaps_round_robin_priority" (current
value: "1")
Selection priority for Round Robin RMAPS component
MCA rmgr: parameter "rmgr" (current value: <none>)
Default selection set of components for the rmgr
framework (<none> means "use all components that can be found")
MCA rmgr: parameter "rmgr_proxy_priority" (current value: "0")
MCA rmgr: parameter "rmgr_urm_priority" (current value: "0")
MCA rml: parameter "rml" (current value: <none>)
Default selection set of components for the rml
framework (<none> means "use all components that can be found")
MCA rml: parameter "rml_base_verbose" (current value: "0")
Verbosity level for the rml framework (0 = no
verbosity)
MCA rml: parameter "rml_oob_priority" (current value: "0")
MCA pls: parameter "pls" (current value: <none>)
Default selection set of components for the pls
framework (<none> means "use all components that can be found")
MCA pls: parameter "pls_base_verbose" (current value: "0")
Verbosity level for the pls framework (0 = no
verbosity)
MCA pls: parameter "pls_gridengine_debug" (current value: "0")
Enable debugging of gridengine pls component
MCA pls: parameter "pls_gridengine_verbose" (current value:
"0")
Enable verbose output of the gridengine qrsh -inherit
command
MCA pls: parameter "pls_gridengine_priority" (current value:
"100")
Priority of the gridengine pls component
MCA pls: parameter "pls_gridengine_orted" (current value:
"orted")
The command name that the gridengine pls component
will invoke for the ORTE daemon
MCA pls: parameter "pls_proxy_priority" (current value: "0")
MCA pls: parameter "pls_rsh_debug" (current value: "0")
Whether or not to enable debugging output for the rsh
pls component (0 or 1)
MCA pls: parameter "pls_rsh_num_concurrent" (current value:
"128")
How many pls_rsh_agent instances to invoke
concurrently (must be > 0)
MCA pls: parameter "pls_rsh_orted" (current value: "orted")
The command name that the rsh pls component will
invoke for the ORTE daemon
MCA pls: parameter "pls_rsh_priority" (current value: "10")
Priority of the rsh pls component
MCA pls: parameter "pls_rsh_delay" (current value: "1")
Delay (in seconds) between invocations of the remote
agent, but only used when the "debug" MCA parameter is true, or the top-level
MCA debugging is enabled (otherwise this value is ignored)
MCA pls: parameter "pls_rsh_reap" (current value: "1")
If set to 1, wait for all the processes to complete
before exiting. Otherwise, quit immediately -- without waiting for
confirmation that all other processes in the job have completed.
MCA pls: parameter "pls_rsh_assume_same_shell" (current value:
"1")
If set to 1, assume that the shell on the remote node
is the same as the shell on the local node. Otherwise, probe for what the
remote shell.
MCA pls: parameter "pls_rsh_agent" (current value: "ssh : rsh")
The command used to launch executables on remote
nodes (typically either "ssh" or "rsh")
MCA pls: parameter "pls_slurm_debug" (current value: "0")
Enable debugging of slurm pls
MCA pls: parameter "pls_slurm_priority" (current value: "75")
Default selection priority
MCA pls: parameter "pls_slurm_orted" (current value: "orted")
Command to use to start proxy orted
MCA pls: parameter "pls_slurm_args" (current value: <none>)
Custom arguments to srun
MCA sds: parameter "sds" (current value: <none>)
Default selection set of components for the sds
framework (<none> means "use all components that can be found")
MCA sds: parameter "sds_base_verbose" (current value: "0")
Verbosity level for the sds framework (0 = no
verbosity)
MCA sds: parameter "sds_env_priority" (current value: "0")
MCA sds: parameter "sds_pipe_priority" (current value: "0")
MCA sds: parameter "sds_seed_priority" (current value: "0")
MCA sds: parameter "sds_singleton_priority" (current value:
"0")
MCA sds: parameter "sds_slurm_priority" (current value: "0")