Thank Ralph, i will take a crack at it, and make the error propagatable.
Cheers, Gilles On Wed, Nov 23, 2016 at 3:34 AM, r...@open-mpi.org <r...@open-mpi.org> wrote: > The “correct” answer is, of course, to propagate the error upwards so that > the highest level caller (e.g., MPI_Init or ompi_info) can return it to the > user, who can then decide what to do. > > Disregarding the parameter is not an option as it violates our “do what the > user said to do, else return an error” policy > > On Nov 21, 2016, at 9:23 PM, Gilles Gouaillardet <gil...@rist.or.jp> wrote: > > Paul, > > > SIGSEGV is always a bad idea, even after having displayed a comprehensive > and user friendly error message > > > -------------------------------------------------------------------------- > MCA framework parameters can only take a single negation operator > ("^"), and it must be at the beginning of the value. The following > value violates this rule: > > ^tcp,^ib > > When used, the negation operator sets the "exclusive" behavior mode, > meaning that it will exclude all specified components (and implicitly > include all others). If the negation operator is not specified, the > "inclusive" mode is assumed, meaning that all specified components > will be included (and implicitly exclude all others). > > For example, "^a,b" specifies the exclusive behavior and means "use > all components *except* a and b", while "c,d" specifies the inclusive > behavior and means "use *only* components c and d." > > You cannot mix inclusive and exclusive behavior. > -------------------------------------------------------------------------- > > > > that raises the question, what should we do when we run into this case ? > > - one option is to propagate the error (currently, functions do not return > anything) (and do what after ?) > > - an other option is to brutally exit(1) > > - yet an other option is to disregard the incorrect value of the parameter > and continue > > > any thoughts anyone ? > > Cheers, > > Gilles > > On 11/14/2016 9:28 PM, Paul Kapinos wrote: > > Dear developers, > also the following issue is defintely raised by a misconfiguration of Open > MPI, SIGSEGV's in 'ompi_info' isn'n a good thing, thus this one mail. > > Just call: > $ export OMPI_MCA_mtl="^tcp,^ib" > $ ompi_info --param all all --level 9 > ... and take a look at the below core dump of 'ompi_info' like below one. > > (yes we know that "^tcp,^ib" is a bad idea). > > Have a nice day, > > Paul Kapinos > > P.S. Open MPI: 1.10.4 and 2.0.1 have the same behaviour > > -------------------------------------------------------------------------- > [lnm001:39957] *** Process received signal *** > [lnm001:39957] Signal: Segmentation fault (11) > [lnm001:39957] Signal code: Address not mapped (1) > [lnm001:39957] Failing at address: (nil) > [lnm001:39957] [ 0] /lib64/libpthread.so.0(+0xf100)[0x2b30f1a79100] > [lnm001:39957] [ 1] > /opt/MPI/openmpi-1.10.4/linux/intel_16.0.2.181/lib/libopen-pal.so.13(+0x2f11f)[0x2b30f084911f] > [lnm001:39957] [ 2] > /opt/MPI/openmpi-1.10.4/linux/intel_16.0.2.181/lib/libopen-pal.so.13(+0x2f265)[0x2b30f0849265] > [lnm001:39957] [ 3] > /opt/MPI/openmpi-1.10.4/linux/intel_16.0.2.181/lib/libopen-pal.so.13(opal_info_show_mca_params+0x91)[0x2b30f0849031] > [lnm001:39957] [ 4] > /opt/MPI/openmpi-1.10.4/linux/intel_16.0.2.181/lib/libopen-pal.so.13(opal_info_do_params+0x1f4)[0x2b30f0848e84] > [lnm001:39957] [ 5] ompi_info[0x402643] > [lnm001:39957] [ 6] /lib64/libc.so.6(__libc_start_main+0xf5)[0x2b30f1ca7b15] > [lnm001:39957] [ 7] ompi_info[0x4022a9] > [lnm001:39957] *** End of error message *** > zsh: segmentation fault (core dumped) ompi_info --param all all --level 9 > -------------------------------------------------------------------------- > > > > > > _______________________________________________ > users mailing list > us...@lists.open-mpi.org > https://rfd.newmexicoconsortium.org/mailman/listinfo/users > > > _______________________________________________ > devel mailing list > devel@lists.open-mpi.org > https://rfd.newmexicoconsortium.org/mailman/listinfo/devel > > > > _______________________________________________ > devel mailing list > devel@lists.open-mpi.org > https://rfd.newmexicoconsortium.org/mailman/listinfo/devel _______________________________________________ devel mailing list devel@lists.open-mpi.org https://rfd.newmexicoconsortium.org/mailman/listinfo/devel