On Sun, Apr 4, 2010 at 4:10 PM, Sean Owen <sro...@gmail.com> wrote: > I think you want to write this to accept "generic" data, and not > necessarily assume the Netflix input format. I suggest you accept CSV > data, in the form "userID,itemID,value", since that is what all the > recommenders do. > > Sure, I'll write it for "userID, movieID, rating". Netflix also provides dates but we can ignore it for the time being.
> You may need a quick utility program to convert Netflix data format to > this. this wouldn't be part of the project, or else, we can put it in > utils later. > > I have done this already. I have a 1.2GB CSV file containing all the 100 million records in the Netflix dataset as "userID, movieID, rating, date". -- SK