Which OS for Spark cluster nodes?

2015-04-03 Thread Horsmann, Tobias
Hi, Are there any recommendations for operating systems that one should use for setting up Spark/Hadoop nodes in general? I am not familiar with the differences between the various linux distributions or how well they are (not) suited for cluster set-ups, so I wondered if there is some

Re: Which OS for Spark cluster nodes?

2015-04-03 Thread Akhil Das
There isn't any specific Linux distro, but i would prefer Ubuntu for a beginner as its very easy to apt-get install stuffs on it. Thanks Best Regards On Fri, Apr 3, 2015 at 4:58 PM, Horsmann, Tobias tobias.horsm...@uni-due.de wrote: Hi, Are there any recommendations for operating systems

Re: Which OS for Spark cluster nodes?

2015-04-03 Thread Charles Feduke
As Akhil says Ubuntu is a good choice if you're starting from near scratch. Cloudera CDH virtual machine images[1] include Hadoop, HDFS, Spark, and other big data tools so you can get a cluster running with very little effort. Keep in mind Cloudera is a for-profit corporation so they are also