Re: Spark MLlib ALS algorithm

2016-09-25 Thread Roshani Nagmote
f ratings in the dataset > for performance testing purposes, to illustrate the scalability of Spark > ALS. > > It is not something you would normally do on your training dataset. > > On Fri, 23 Sep 2016 at 20:07, Roshani Nagmote <roshaninagmo...@gmail.com> > wrote: > >>

Spark MLlib ALS algorithm

2016-09-23 Thread Roshani Nagmote
Hello, I was working on Spark MLlib ALS Matrix factorization algorithm and came across the following blog post: https://databricks.com/blog/2014/07/23/scalable-collaborative-filtering-with-spark-mllib.html Can anyone help me understanding what "s" scaling factor does and does it really give

Re: Issues while running MLlib matrix factorization ALS algorithm

2016-09-19 Thread Roshani Nagmote
intInterval:Int):ALS.this.type) > > > On Mon, 19 Sep 2016 at 20:01 Roshani Nagmote <roshaninagmo...@gmail.com> > wrote: > >> Hello Sean, >> >> Can you please tell me how to set checkpoint interval? I did set >> checkpointDir("hdfs:/") But if I want

Re: Issues while running MLlib matrix factorization ALS algorithm

2016-09-19 Thread Roshani Nagmote
ri, Sep 16, 2016 at 11:14 AM, Roshani Nagmote <roshaninagmo...@gmail.com > wrote: > Hello, > > Thanks for your reply. > > Yes, Its netflix dataset. And when I get no space on device, my ‘/mnt’ > directory gets filled up. I checked. > > /usr/lib/spark/bin/spark-su

Re: Issues while running MLlib matrix factorization ALS algorithm

2016-09-16 Thread Roshani Nagmote
Hello, Thanks for your reply. Yes, Its netflix dataset. And when I get no space on device, my ‘/mnt’ directory gets filled up. I checked. /usr/lib/spark/bin/spark-submit --deploy-mode cluster --master yarn --class org.apache.spark.examples.mllib.MovieLensALS --jars

Re: Issues while running MLlib matrix factorization ALS algorithm

2016-09-16 Thread Roshani Nagmote
something is > fairly wrong in your setup or maybe in other parts of your user code. > > On Thu, Sep 15, 2016 at 10:00 PM, Roshani Nagmote > <roshaninagmo...@gmail.com <javascript:;>> wrote: > > Hi, > > > > I need help to run matrix factorization ALS algorithm in

Issues while running MLlib matrix factorization ALS algorithm

2016-09-15 Thread Roshani Nagmote
Hi, I need help to run matrix factorization ALS algorithm in Spark MLlib. I am using dataset(1.5Gb) having 480189 users and 17770 items formatted in similar way as Movielens dataset. I am trying to run MovieLensALS example jar on this dataset on AWS Spark EMR cluster having 14 M4.2xlarge