f ratings in the dataset
> for performance testing purposes, to illustrate the scalability of Spark
> ALS.
>
> It is not something you would normally do on your training dataset.
>
> On Fri, 23 Sep 2016 at 20:07, Roshani Nagmote <roshaninagmo...@gmail.com>
> wrote:
>
>>
Hello,
I was working on Spark MLlib ALS Matrix factorization algorithm and came
across the following blog post:
https://databricks.com/blog/2014/07/23/scalable-collaborative-filtering-with-spark-mllib.html
Can anyone help me understanding what "s" scaling factor does and does it
really give
intInterval:Int):ALS.this.type)
>
>
> On Mon, 19 Sep 2016 at 20:01 Roshani Nagmote <roshaninagmo...@gmail.com>
> wrote:
>
>> Hello Sean,
>>
>> Can you please tell me how to set checkpoint interval? I did set
>> checkpointDir("hdfs:/") But if I want
ri, Sep 16, 2016 at 11:14 AM, Roshani Nagmote <roshaninagmo...@gmail.com
> wrote:
> Hello,
>
> Thanks for your reply.
>
> Yes, Its netflix dataset. And when I get no space on device, my ‘/mnt’
> directory gets filled up. I checked.
>
> /usr/lib/spark/bin/spark-su
Hello,
Thanks for your reply.
Yes, Its netflix dataset. And when I get no space on device, my ‘/mnt’
directory gets filled up. I checked.
/usr/lib/spark/bin/spark-submit --deploy-mode cluster --master yarn --class
org.apache.spark.examples.mllib.MovieLensALS --jars
something is
> fairly wrong in your setup or maybe in other parts of your user code.
>
> On Thu, Sep 15, 2016 at 10:00 PM, Roshani Nagmote
> <roshaninagmo...@gmail.com <javascript:;>> wrote:
> > Hi,
> >
> > I need help to run matrix factorization ALS algorithm in
Hi,
I need help to run matrix factorization ALS algorithm in Spark MLlib.
I am using dataset(1.5Gb) having 480189 users and 17770 items formatted in
similar way as Movielens dataset.
I am trying to run MovieLensALS example jar on this dataset on AWS Spark EMR
cluster having 14 M4.2xlarge