On Sun, Apr 4, 2010 at 4:10 PM, Sean Owen <sro...@gmail.com> wrote:

> I think you want to write this to accept "generic" data, and not
> necessarily assume the Netflix input format. I suggest you accept CSV
> data, in the form "userID,itemID,value", since that is what all the
> recommenders do.
>
> Sure, I'll write it for "userID, movieID, rating". Netflix also provides
dates but we can ignore it for the time being.


> You may need a quick utility program to convert Netflix data format to
> this. this wouldn't be part of the project, or else, we can put it in
> utils later.
>
> I have done this already. I have a 1.2GB CSV file containing all the 100
million records in the Netflix dataset as "userID, movieID, rating, date".


-- 
SK

Reply via email to