Re: [OMPI devel] PathScale 3.0 problems with Open MPI 1.2.[34]
On Tue, 23 Oct 2007, Brian Barrett wrote: --without-memory-manager --enable-mca-static=btl-mx,mtl-mx From what I understand from the FAQ, the above --enable-mca-static option would include statically linked MX support in libmpi. But I was building all support in (--enable-static --disable-shared) for other reasons anyway, so I guess that it wouldn't make any difference. I've already tried to build all shared (the 'configure' default) and this made no difference; --without-memory-manager is the only option which makes a difference. Given that both Patrick and Brian recommend using "--without-memory-manager", I won't feel bad about doing it :-) Thanks a lot ! -- Bogdan Costescu IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY Telephone: +49 6221 54 8869, Telefax: +49 6221 54 8868 E-mail: bogdan.coste...@iwr.uni-heidelberg.de
[OMPI devel] RFC: Add "connect" field to openib BTL INI file
WHAT: Add a "connect" field to the openib BTL INI file to specify which connect pseudo-component(s) (CPCs) can be used for wireup of a given device. WHY: 2 new CPCs are being written (by different parties); certain devices can only use certain CPCs. WHERE: Mainly in btl_openib_ini.*, but probably also touch a few other btl_openib_* files. WHEN: Sometime in the near future; no specific time set. TIMEOUT: Friday, 2 Nov 2007 In addition to the already-existing "oob" connect pseudo-component (CPC), two more are in the works: - XRC: Pasha/Mellanox is working on a new XRC CPC for ConnectX HCAs. - RDMA CM: Welcome to Jon Mason from Open Grid Computing (for Chelsio) who will be doing iWARP-related work in the udapl and/or openib BTLs. Either he or I will be implementing the RDMA CM-based CPC. My proposal is that the "connect" field can be added to the INI file and take a comma-delimited list of values of acceptable CPCs for a given device. For example, the ConnectX HCA can take the following value: connect = xrc, rdma_cm, oob Meaning: - first, try the XRC CPC to try to make the connection - if that fails, try the RDMA CM CPC - if that fails, try the OOB CPC - if that fails, fail the connection iWARP-based NICs can use the following value: connect = rdma_cm If no "connect" value is specified, then default value of "oob, rdma_cm" can be assumed (possibly someday changing to "rdma_cm, oob"). I mention this here on the devel list because disparate groups are working on different CPC's; coordination will be required to implement this arbitration mechanism. Comments? -- Jeff Squyres Cisco Systems
Re: [OMPI devel] PML cm and heterogeneous support
George / Patrick / Rich / Christian -- Any idea why that's there? Is that because portals, MX, and PSM all require homogeneous environments? On Oct 18, 2007, at 3:59 PM, Sajjad Tabib wrote: Hi, I am tried to run an MPI program in a heterogeneous environment using the pml cm component. However, open mpi returned with an error message indicating that PML add procs returned "Not supported". I dived into the cm code to see what was wrong and I came upon the code below, which basically shows that if the processes are running on different architectures, then return "not supported". Now, I'm wondering whether my interpretation is correct or not. Is it true that the cm component does not support a heterogeneous environment? If so, will the developers support this in the future? How could I get around this while still using the cm component? What will happen if I rebuilt openmpi without these statements? I would appreciate your help. Code: mca_pml_cm_add_procs(){ #if OMPI_ENABLE_HETEROGENEOUS_SUPPORT 107 for (i = 0 ; i < nprocs ; ++i) { 108 if (procs[i]->proc_arch != ompi_proc_local()->proc_arch) { 109 return OMPI_ERR_NOT_SUPPORTED; 110 } 111 } 112 #endif . . . } Sajjad Tabib ___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Jeff Squyres Cisco Systems
Re: [OMPI devel] PML cm and heterogeneous support
No, it's because the CM PML was never designed to be used in a heterogeneous environment :). While the MX BTL does support heterogeneous operations (at one point, I believe I even had it working), none of the MTLs have ever been tested in heterogeneous environments and it's known the datatype usage in the CM PML won't support heterogeneous operation. Brian On Oct 24, 2007, at 6:21 PM, Jeff Squyres wrote: George / Patrick / Rich / Christian -- Any idea why that's there? Is that because portals, MX, and PSM all require homogeneous environments? On Oct 18, 2007, at 3:59 PM, Sajjad Tabib wrote: Hi, I am tried to run an MPI program in a heterogeneous environment using the pml cm component. However, open mpi returned with an error message indicating that PML add procs returned "Not supported". I dived into the cm code to see what was wrong and I came upon the code below, which basically shows that if the processes are running on different architectures, then return "not supported". Now, I'm wondering whether my interpretation is correct or not. Is it true that the cm component does not support a heterogeneous environment? If so, will the developers support this in the future? How could I get around this while still using the cm component? What will happen if I rebuilt openmpi without these statements? I would appreciate your help. Code: mca_pml_cm_add_procs(){ #if OMPI_ENABLE_HETEROGENEOUS_SUPPORT 107 for (i = 0 ; i < nprocs ; ++i) { 108 if (procs[i]->proc_arch != ompi_proc_local()- >proc_arch) { 109 return OMPI_ERR_NOT_SUPPORTED; 110 } 111 } 112 #endif . . . } Sajjad Tabib ___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Jeff Squyres Cisco Systems ___ devel mailing list de...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/devel