The next release of EMR will include Mahout 0.11.1 so that Mahout on Spark
works with Spark 1.6. Sorry for any inconvenience until then.
By the way, I'm interested to know what your use case is for running Mahout
on Spark, so please feel free to PM me if you are able to share any details.
Thank
Please update to Mahout 0.11.1 for spark versions > 1.3.
Original message
From: Zhun Shen
Date: 02/23/2016 8:57 PM (GMT-05:00)
To: user@mahout.apache.org
Subject: mahout spark-itemsimilarity does not work on EMR 4.3
Hi,
mahout version: 0.11.0
EMR
Hi,
mahout version: 0.11.0
EMR version: 4.3
spark version: 1.6.0
I try to run mahout spark-itemsimilarity on AWS EMR, but it told me that
“MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Cannot find Spark classpath. Is 'SPARK_HOME' set?”, Is it a bug for EMR or I
use mahout spark
Guys, one more question ... Are there some incremental methods to do this?
I don't want to run the whole job again once a new document is added. In
case of LDA ... I guess the best way is to calculate the topics on the new
document using the topics from the previous LDA run ... And then every once