Hi,
Are there any recommendations for operating systems that one should use for
setting up Spark/Hadoop nodes in general?
I am not familiar with the differences between the various linux distributions
or how well they are (not) suited for cluster set-ups, so I wondered if there
is some
There isn't any specific Linux distro, but i would prefer Ubuntu for a
beginner as its very easy to apt-get install stuffs on it.
Thanks
Best Regards
On Fri, Apr 3, 2015 at 4:58 PM, Horsmann, Tobias tobias.horsm...@uni-due.de
wrote:
Hi,
Are there any recommendations for operating systems
As Akhil says Ubuntu is a good choice if you're starting from near scratch.
Cloudera CDH virtual machine images[1] include Hadoop, HDFS, Spark, and
other big data tools so you can get a cluster running with very little
effort. Keep in mind Cloudera is a for-profit corporation so they are also