Oh, good catch. I'm not sure who updates the platform files or who would have added the "carto" option to the no_build. It's the only difference between the the 1.3.4 platform files and the previous ones, save for some compiler flags.

-david

--
David Gunter
HPC-3: Infrastructure Team
Los Alamos National Laboratory




On Nov 5, 2009, at 3:55 PM, Jeff Squyres wrote:

I see:

enable_mca_no_build=carto,crs,routed-direct,routed-linear,snapc,pml- dr,pml-crcp2,pml-crcpw,pml-v,pml-example,crcp,pml-cm,filem

Which means that you're directing all carto components not to build at all.

It looks like carto is now required...?


On Nov 5, 2009, at 5:38 PM, Samuel K. Gutierrez wrote:

Hi Jeff,

This is how I configured my build.

./configure --with-platform=./contrib/platform/lanl/rr-class/ optimized-
panasas --prefix=/usr/projects/hpctools/samuel/local/rr-dev/apps/
openmpi/gcc/ompi-1.3.4rc4 --libdir=/usr/projects/hpctools/samuel/ local/
rr-dev/apps/openmpi/gcc/ompi-1.3.4rc4/lib64

I'll send the build log shortly.

Thanks!
--
Samuel K. Gutierrez
Los Alamos National Laboratory

On Nov 5, 2009, at 3:07 PM, Jeff Squyres wrote:

> How did you build?
>
> I see one carto component named "auto_detect" in the 1.3.4 source
> tree, but I don't see it in your ompi_info output.
>
> Did that component not build?
>
>
> On Nov 4, 2009, at 7:20 PM, Samuel K. Gutierrez wrote:
>
>> Hi All,
>>
>> I just built OMPI 1.3.4rc4 on one of our Roadrunner machines. When I
>> try to launch a simple MPI job, I get the following:
>>
>> [rra011a.rr.lanl.gov:31601] mca: base: components_open: Looking for
>> carto components
>> [rra011a.rr.lanl.gov:31601] mca: base: components_open: opening carto
>> components
>> [rra011a.rr.lanl.gov:31601] mca:base:select: Auto-selecting carto
>> components
>> [rra011a.rr.lanl.gov:31601] mca:base:select:(carto) No component
>> selected!
>> --------------------------------------------------------------------------
>> It looks like opal_init failed for some reason; your parallel
>> process is
>> likely to abort. There are many reasons that a parallel process can
>> fail during opal_init; some of which are due to configuration or
>> environment problems.  This failure appears to be an internal
>> failure;
>> here's some additional information (which may only be relevant to an
>> Open MPI developer):
>>
>>   opal_carto_base_select failed
>>   --> Returned value -13 instead of OPAL_SUCCESS
>> -------------------------------------------------------------------------- >> [rra011a.rr.lanl.gov:31601] [[INVALID],INVALID] ORTE_ERROR_LOG: Not
>> found in file runtime/orte_init.c at line 77
>> [rra011a.rr.lanl.gov:31601] [[INVALID],INVALID] ORTE_ERROR_LOG: Not
>> found in file orterun.c at line 541
>>
>> This may be an issue on our end regarding a runtime parameter that
>> isn't set correctly. See attached. Please let me know if you need
>> any more info.
>>
>> Thanks!
>> --
>> Samuel K. Gutierrez
>> Los Alamos National Laboratory
>>
>>
>> <lanl-rr-class-1.3.4rc4.tar.gz><ATT2634783.txt><ATT2634785.txt>
>
>
> --
> Jeff Squyres
> jsquy...@cisco.com
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel

_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel



--
Jeff Squyres
jsquy...@cisco.com

_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Reply via email to