Re: mahout spark-itemsimilarity does not work on EMR 4.3

2016-02-23 Thread Jonathan Kelly
The next release of EMR will include Mahout 0.11.1 so that Mahout on Spark works with Spark 1.6. Sorry for any inconvenience until then. By the way, I'm interested to know what your use case is for running Mahout on Spark, so please feel free to PM me if you are able to share any details. Thank

RE: mahout spark-itemsimilarity does not work on EMR 4.3

2016-02-23 Thread Andrew Palumbo
Please update to Mahout 0.11.1 for spark versions > 1.3. Original message From: Zhun Shen Date: 02/23/2016 8:57 PM (GMT-05:00) To: user@mahout.apache.org Subject: mahout spark-itemsimilarity does not work on EMR 4.3 Hi, mahout version: 0.11.0 EMR

mahout spark-itemsimilarity does not work on EMR 4.3

2016-02-23 Thread Zhun Shen
Hi, mahout version: 0.11.0 EMR version: 4.3 spark version: 1.6.0 I try to run mahout spark-itemsimilarity on AWS EMR, but it told me that “MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. Cannot find Spark classpath. Is 'SPARK_HOME' set?”, Is it a bug for EMR or I use mahout spark

Re: Document similarity

2016-02-23 Thread David Starina
Guys, one more question ... Are there some incremental methods to do this? I don't want to run the whole job again once a new document is added. In case of LDA ... I guess the best way is to calculate the topics on the new document using the topics from the previous LDA run ... And then every once