> On Jun 11, 2015, at 1:13 AM, Federico Reghenzani 
> <federico1.reghenz...@mail.polimi.it> wrote:
> 
> Just some other questions on ras functions:
> - we found this old thread 
> <http://www.open-mpi.org/community/lists/users/2007/04/2974.php> in which you 
> said that orte doesn't allow to spawn processes outside ras allocation (e.g. 
> with MPI_Comm_spawn). Is it still  like that? So, allocate() function is 
> called only once time?

Not really - you can use an MPI_Info key to “add-host” or “add-hostfile”

> - where can we found the number of requested processes for the task? We 
> searched in orte_job_t passed to allocate() but we cannot find it. We don't 
> know if take it from argv ("mpirun -np 4 ...") it's a good strategy.

It is in the orte_job_t:

    /* number of procs in this job */
    orte_vpid_t num_procs;


> 
> 
> We really appreciate your help guys :)
> 
> Cheers,
> Federico Reghenzani
> 
> 2015-04-13 21:08 GMT+02:00 Ralph Castain <r...@open-mpi.org 
> <mailto:r...@open-mpi.org>>:
> Yes - but the processes must stay in the same location
> 
>> On Apr 13, 2015, at 12:02 PM, Federico Reghenzani 
>> <federico1.reghenz...@mail.polimi.it 
>> <mailto:federico1.reghenz...@mail.polimi.it>> wrote:
>> 
>> Thank you.
>> 
>> And, to workaround, is it possible to temporary suspend processes on a node 
>> and later resume it (requested by RM)?  I saw in the code that orted can 
>> receive SIGTSTP and SIGCONT to suspend/resume processes.
>> 
>> 
>> Cheers,
>> Federico Reghenzani
>> 
>> 2015-04-10 16:58 GMT+02:00 Ralph Castain <r...@open-mpi.org 
>> <mailto:r...@open-mpi.org>>:
>> I’m afraid not. The MPI job would not be very happy to suddenly lose some 
>> nodes during execution, and relocating MPI processes during execution is 
>> something we don’t currently support.
>> 
>> There is work underway to integrate the RM more fully into that procedure so 
>> it could tell the MPI job to checkpoint, wait until that completed, 
>> terminate the job, and then fast-restart it on the new nodes - but that 
>> isn’t here yet.
>> 
>> 
>>> On Apr 10, 2015, at 7:54 AM, Federico Reghenzani 
>>> <federico1.reghenz...@mail.polimi.it 
>>> <mailto:federico1.reghenz...@mail.polimi.it>> wrote:
>>> 
>>> The RM can ask for deallocation of some nodes?
>>> 
>>> For example, mpirun asks to the RM which resources are available (let 
>>> node1, node2, node3) and spawns orted in the nodes. After some time during 
>>> the elaboration, can the RM ask to deassign node3 or  reassign jobs on 
>>> node3 to node4?
>>> 
>>> Cheers,
>>> Federico Reghenzani
>>> 
>>> 2015-03-26 18:09:22 GMT+06:00 Artem Polyakov <artpol84_at_[hidden]>:  
>>> 
>>> P.S. also check ESS (orte/mca/ess) for environment setup. 
>>> 2015-03-26 18:06 GMT+06:00 Artem Polyakov <artpol84_at_[hidden]>: 
>>> > 
>>> > 2015-03-26 17:58 GMT+06:00 Gianmario Pozzi <pozzigmario_at_[hidden]>: 
>>> > 
>>> >> Hi everyone, 
>>> >> I'm an italian M.Sc. student in Computer Engineering at Politecnico di 
>>> >> Milano. 
>>> >> 
>>> >> My team and I are trying to integrate OpenMPI with a real time resource 
>>> >> manager written by a group of students named BBQ ( 
>>> >> http://bosp.dei.polimi.it/ <http://bosp.dei.polimi.it/> ). We are 
>>> >> encountering some troubles, though. 
>>> >> 
>>> >> Our main issue is to understand how ORTE interacts with the resource 
>>> >> manager, which parts of the code (if any) are executed on the "slave" 
>>> >> nodes 
>>> >> and which ones on the "master". 
>>> >> We spent some time looking at the source code but we still have many 
>>> >> doubts. 
>>> >> 
>>> > 
>>> > Hello, 
>>> > check orte/mca/plm and orte/mca/ras 
>>> > PLM - process lifecycle manager 
>>> > RAS - resource allocation subsystem. 
>>> > 
>>> > In RAS mpirun detects under which RM it works and gets the allocation. 
>>> > in PLM spawn of remote processes is done. 
>>> > mpirun spawns orted daemons on the slave nodes and all the rest is done 
>>> > without RM intervention (IMHO). 
>>> > 
>>> > 
>>> >> 
>>> >> Thank you. 
>>> >> 
>>> >> _______________________________________________ 
>>> >> devel mailing list 
>>> >> devel_at_[hidden] 
>>> >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
>>> >> <http://www.open-mpi.org/mailman/listinfo.cgi/devel> 
>>> >> Link to this post: 
>>> >> http://www.open-mpi.org/community/lists/devel/2015/03/17157.php 
>>> >> <http://www.open-mpi.org/community/lists/devel/2015/03/17157.php> 
>>> >> 
>>> > 
>>> > 
>>> > 
>>> > -- 
>>> > С Уважением, ÐŸÐ¾Ð»Ñ ÐºÐ¾Ð² Рртем Юрьевич 
>>> > Best regards, Artem Y. Polyakov 
>>> > 
>>> 
>>> --
>>> С Уважением, ÐŸÐ¾Ð»Ñ ÐºÐ¾Ð² Рртем Юрьевич
>>> Best regards, Artem Y. Polyakov
>>> _______________________________________________
>>> devel mailing list
>>> de...@open-mpi.org <mailto:de...@open-mpi.org>
>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
>>> <http://www.open-mpi.org/mailman/listinfo.cgi/devel>
>>> Link to this post: 
>>> http://www.open-mpi.org/community/lists/devel/2015/04/17210.php 
>>> <http://www.open-mpi.org/community/lists/devel/2015/04/17210.php>
>> 
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org <mailto:de...@open-mpi.org>
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
>> <http://www.open-mpi.org/mailman/listinfo.cgi/devel>
>> Link to this post: 
>> http://www.open-mpi.org/community/lists/devel/2015/04/17211.php 
>> <http://www.open-mpi.org/community/lists/devel/2015/04/17211.php>
>> 
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org <mailto:de...@open-mpi.org>
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
>> <http://www.open-mpi.org/mailman/listinfo.cgi/devel>
>> Link to this post: 
>> http://www.open-mpi.org/community/lists/devel/2015/04/17215.php 
>> <http://www.open-mpi.org/community/lists/devel/2015/04/17215.php>
> 
> _______________________________________________
> devel mailing list
> de...@open-mpi.org <mailto:de...@open-mpi.org>
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
> <http://www.open-mpi.org/mailman/listinfo.cgi/devel>
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2015/04/17216.php 
> <http://www.open-mpi.org/community/lists/devel/2015/04/17216.php>
> 
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2015/06/17491.php

Reply via email to