Thanks a lot for your reply, but i have tried the built-in RDD.cartesian()
method before, it didn't make it faster.
qinwei
From: Alex BoisvertDate: 2014-04-26 00:32To: userSubject: Re: what is the best
way to do cartesianYou might want to try the built-in RDD.cartesian() method.
On
You might want to try the built-in RDD.cartesian() method.
On Thu, Apr 24, 2014 at 9:05 PM, Qin Wei wei@dewmobile.net wrote:
Hi All,
I have a problem with the Item-Based Collaborative Filtering Recommendation
Algorithms in spark.
The basic flow is as below:
Depending on the size of the rdd you could also do a collect broadcast and
then compute the product in a map function over the other rdd. If this is
the same rdd you might also want to cache it. This pattern worked quite
good for me
Le 25 avr. 2014 18:33, Alex Boisvert alex.boisv...@gmail.com a