Re: [OMPI devel] Failure while loading shmem module

2016-11-16 Thread Gianmario Pozzi
Ok, good news. Running the command as root works, so it seems to be a permissions issue. Which is weird anyway, never happened on other configurations. Thank you for your time. 2016-11-16 11:23 GMT+01:00 Gianmario Pozzi : > Hi Pritchard, thank you for replying. > > Nothing changed a

Re: [OMPI devel] Failure while loading shmem module

2016-11-16 Thread Gianmario Pozzi
> HI Gianmario, > > Probably something went wrong at the spml layer. > Could you also add —mac spml_base_verbose 10 > to the job launch line? > > Howard > > -- > Howard Pritchard > HPC-DES > Los Alamos National Laboratory > > > From: devel on behalf of Gianmar

[OMPI devel] Failure while loading shmem module

2016-11-15 Thread Gianmario Pozzi
enced something similar? Thank you. -- *Gianmario Pozzi* *M.Sc. @ Politecnico di Milano* ___ devel mailing list devel@lists.open-mpi.org https://rfd.newmexicoconsortium.org/mailman/listinfo/devel

[OMPI devel] InfiniBand BTL structure

2016-10-05 Thread Gianmario Pozzi
capable of working also with InfiniBand-based BTL. Is there anyone who could help me to better understand how it works? It looks way more complex than TCP. Does it use only RDMA? If not, what else? Any hint is appreciated. Thank you guys, have a nice day. -- *Gianmario Pozzi* *M.Sc. @ Politecnico

Re: [OMPI devel] orted-children communication

2016-01-26 Thread Gianmario Pozzi
as migration, and Annu Dasari and Dave Solt are working on the error > response APIs to the client can request specific responses by the resource > manager. > > > On Jan 19, 2016, at 4:57 AM, Gianmario Pozzi > wrote: > > Hi everyone! > > I'm one of Federico's col

[OMPI devel] orted-children communication

2016-01-19 Thread Gianmario Pozzi
Hi everyone! I'm one of Federico's colleagues working on orted migration. In this post ( http://www.open-mpi.org/community/lists/devel/2015/10/18268.php) he made a summary of all the communication channels involved in each run. I need to know which framework manages communications between orted an

Re: [OMPI devel] Checkpoint/restart + migration

2015-10-27 Thread Gianmario Pozzi
Thank you guys, your help is really appriciated! We'll keep in touch for further information. Gianmario Il 23/ott/2015 12:44 "Jeff Squyres (jsquyres)" ha scritto: > On Oct 22, 2015, at 7:17 AM, Gilles Gouaillardet < > gilles.gouaillar...@gmail.com> wrote: > > > > Gianmario, > > > > there was c/

[OMPI devel] Checkpoint/restart + migration

2015-10-22 Thread Gianmario Pozzi
Hi everyone! My team and I are working on the possibility to checkpoint a process and restarting it on another node. We are using CRIU framework for the checkpoint/restart part, but we are facing some issues related to migration. First of all: we found out that some attempts to C/R an OMPI proces

[OMPI devel] Info about ORTE structure

2015-03-26 Thread Gianmario Pozzi
Hi everyone, I'm an italian M.Sc. student in Computer Engineering at Politecnico di Milano. My team and I are trying to integrate OpenMPI with a real time resource manager written by a group of students named BBQ ( http://bosp.dei.polimi.it/ ). We are encountering some troubles, though. Our main