A Julia cluster is just a cluster with Julia installed on all the nodes. 
One way of achieving this is to create a cluster using PelicanHPC, and then 
do one of:
1) install julia in the /home/user directory, for example, by compiling 
from source. This directory is NFS shared by all nodes, when the cluster is 
set up.
or
2) run apt-get install julia on all the nodes.

A PHPC cluster is a reasonable solution for a single user. I used to 
develop it, and used it for a number of years on a 4 node cluster. It's 
Debian-based.

On Friday, September 25, 2015 at 11:42:59 PM UTC+2, Ismael VC wrote:
>
> Hello everyone!
>
> I am trying to set up a Julia cluster with 20 nodes, this is the very 
> first time I've tried something like this. I have looked around for 
> examples, but documentation is not very helpful for me:
>
> *Julia can be started in parallel mode with either the -p or 
> the --machinefile options. -p n will launch an additional n worker 
> processes, while --machinefile file will launch a worker for each line in 
> file file. The machines defined in file must be accessible via a 
> passwordless ssh login, with Julia installed at the same location as the 
> current host. Each machine definition takes the 
> form [count*][user@]host[:port] [bind_addr[:port]] . user defaults to 
> current user, port to the standard ssh port. count is the number of workers 
> to spawn on the node, and defaults to 1. The 
> optional bind-to bind_addr[:port] specifies the ip-address and port that 
> other workers should use to connect to this worker.*
>
> This is what I think I have understood so far:
>
> Ok I list the machines on a machine file, that's easy, I have a file like 
> this:
>
> n user@555.555.555.555
> n user@555.555.555.556
> n user@555.555.555.555
>
>
> *The machines defined in file must be accessible via a 
> passwordless ssh login,*
>
> This is the part that is difficult for me the most, it says that machines 
> must be accesible via paswordless ssh
>
> * with Julia installed at the same location as the current host.*
>
> I understand this as I need to install Julia en every node in the same 
> location, so I have 20 nodes, same software and hardware stacks. Does this 
> means that the nodes must be of the same operating system? the same bits 
> (32/64) only?
>
> Right now I have *20 CentOS 6.7 (64 bits)* nodes with* julia-0.3.11* 
> installed from the *generic linux binaries (64bits)*, all of them 
> installed at */opt/julia-0.3.11/bin* (added to the PATH and already 
> exported in /etc/profile)
>
> Now the plan in my mind is to use my laptop *(windows 7 64 bits, 
> julia-0.3.11 64 bits)* as master node and control the cluster with that, 
> so according to what I understand, I'll need to do (leaving password blank):
>
> ssh-keygen -t rsa
>
>
> From my Windows laptop (I plan to install Arch Linux soon), in order to 
> create my ssh key and then:
>
>
> cat ~/.ssh/id_rsa.pub | ssh user@hostname 'cat >> .ssh/authorized_keys'
>
>
>
> To every node? So I have to be running the ssh server at every one of them? 
> (I understand I'll need it at the master node) This is where I simply don't 
> understand anymore, I haven't seen any tutorial, or article, or something 
> like that, just that paragraph in the manual, I know there is 
> ClusterManagers.jl but that sounds even more complicated for me right now.
>
>
> I also want to help David Sanders to set up another cluster (once I got this 
> figured out) in his lab at Science Faculty, UNAM. I promise to enhance the 
> documentation around this topic once I understand this.
>
>
> What do you guys think, do I have it all wrong?
>
>
> If anyone can help me, I'll be very grateful, thank's in advance!
>
>

Reply via email to