Re: Data distribution guidance for recommendation engines

2013-07-31 Thread Sean Owen
On Thu, Aug 1, 2013 at 3:15 AM, Chloe Guszo wrote: > If I split my data into train and test sets, I can show good performance of Good performance according to what metric? it makes a lot of difference whether you are talking about precision/recall or RMSE. > the model on the train set. What migh

Data distribution guidance for recommendation engines

2013-07-31 Thread Chloe Guszo
Hi all, This questions stems from my use of the alternating least squares method in mahout, but errs on the theoretical side. If this is the wrong place for such a question, I apologize up front and would gladly direct my question to a more appropriate forum, as per your suggestions. I have been