Re: Duplicates in Collaborative Filtering Output

2023-01-23 Thread Kartik Ohri
Hi again! Ironically, soon after sending the previous email I actually found the bug in our setup that was resulting in duplicates and it wasn't Mllib ALS after all. Sorry for the confusion. Regards. On Mon, Jan 23, 2023 at 1:09 PM Kartik Ohri wrote: > Hi! > > We are using Spark mllib (on

Duplicates in Collaborative Filtering Output

2023-01-22 Thread Kartik Ohri
Hi! We are using Spark mllib (on Spark 3.2.0) ALS Model for an implicit feedback based collaborative filtering recommendation job. While looking at the output of recommendForUserSubset