Hi,
i am building a small 16 nodes cluster gentoo based.
I succesfully installed openmpi and i succesfully tried some simple small
test parallel program on a single host but...
i can't run parallel program on more than one nodes
The nodes are cloned (so they are equals).
The mpiuser (and their ss
using openmpi 1.4.2
On Fri, Dec 24, 2010 at 11:17 AM, Advanced Computing Group University of
Padova wrote:
> Hi,
> i am building a small 16 nodes cluster gentoo based.
> I succesfully installed openmpi and i succesfully tried some simple small
> test parallel program on a single hos
ang wrote:
> have you tested your ssh key setup, fire wall, and switch settings to
> ensure all nodes are talking to each other?
>
> On Mon, Dec 27, 2010 at 1:07 AM, Advanced Computing Group University of
> Padova wrote:
>
>> using openmpi 1.4.2
>>
>>
>>
param, so you can set it in your
> environ or put it in your default MCA param file.
>
>
> On Dec 28, 2010, at 3:31 AM, Advanced Computing Group University of Padova
> wrote:
>
> yes i've tested 'em
> In fact using the --debug-daemons switch everything works fine!
On Wed, Dec 29, 2010 at 10:10 AM, Advanced Computing Group University of
Padova wrote:
> Thank you Ralph,
> Your suspects seems to be quite interesting :)
> I try to run the same program from node 192.168.1/2.11 using also
> 192.168.2.12 "tracing" .12 activities.
> I at
n the ssh
> session is terminated, but I have no clue why.
>
> Given the small cluster size, I would just add this to your default param
> file and not worry about it:
>
> orte_leave_session_attached = 1
>
>
> On Dec 29, 2010, at 2:10 AM, Advanced Computing Group Universit