A Julia cluster is just a cluster with Julia installed on all the nodes. One way of achieving this is to create a cluster using PelicanHPC, and then do one of: 1) install julia in the /home/user directory, for example, by compiling from source. This directory is NFS shared by all nodes, when the cluster is set up. or 2) run apt-get install julia on all the nodes.
A PHPC cluster is a reasonable solution for a single user. I used to develop it, and used it for a number of years on a 4 node cluster. It's Debian-based. On Friday, September 25, 2015 at 11:42:59 PM UTC+2, Ismael VC wrote: > > Hello everyone! > > I am trying to set up a Julia cluster with 20 nodes, this is the very > first time I've tried something like this. I have looked around for > examples, but documentation is not very helpful for me: > > *Julia can be started in parallel mode with either the -p or > the --machinefile options. -p n will launch an additional n worker > processes, while --machinefile file will launch a worker for each line in > file file. The machines defined in file must be accessible via a > passwordless ssh login, with Julia installed at the same location as the > current host. Each machine definition takes the > form [count*][user@]host[:port] [bind_addr[:port]] . user defaults to > current user, port to the standard ssh port. count is the number of workers > to spawn on the node, and defaults to 1. The > optional bind-to bind_addr[:port] specifies the ip-address and port that > other workers should use to connect to this worker.* > > This is what I think I have understood so far: > > Ok I list the machines on a machine file, that's easy, I have a file like > this: > > n user@555.555.555.555 > n user@555.555.555.556 > n user@555.555.555.555 > > > *The machines defined in file must be accessible via a > passwordless ssh login,* > > This is the part that is difficult for me the most, it says that machines > must be accesible via paswordless ssh > > * with Julia installed at the same location as the current host.* > > I understand this as I need to install Julia en every node in the same > location, so I have 20 nodes, same software and hardware stacks. Does this > means that the nodes must be of the same operating system? the same bits > (32/64) only? > > Right now I have *20 CentOS 6.7 (64 bits)* nodes with* julia-0.3.11* > installed from the *generic linux binaries (64bits)*, all of them > installed at */opt/julia-0.3.11/bin* (added to the PATH and already > exported in /etc/profile) > > Now the plan in my mind is to use my laptop *(windows 7 64 bits, > julia-0.3.11 64 bits)* as master node and control the cluster with that, > so according to what I understand, I'll need to do (leaving password blank): > > ssh-keygen -t rsa > > > From my Windows laptop (I plan to install Arch Linux soon), in order to > create my ssh key and then: > > > cat ~/.ssh/id_rsa.pub | ssh user@hostname 'cat >> .ssh/authorized_keys' > > > > To every node? So I have to be running the ssh server at every one of them? > (I understand I'll need it at the master node) This is where I simply don't > understand anymore, I haven't seen any tutorial, or article, or something > like that, just that paragraph in the manual, I know there is > ClusterManagers.jl but that sounds even more complicated for me right now. > > > I also want to help David Sanders to set up another cluster (once I got this > figured out) in his lab at Science Faculty, UNAM. I promise to enhance the > documentation around this topic once I understand this. > > > What do you guys think, do I have it all wrong? > > > If anyone can help me, I'll be very grateful, thank's in advance! > >