Sure, that sounds like something I can find a good home for. Create a new issue with the patch here: https://issues.apache.org/jira/secure/CreateIssue!default.jspa?pid=12310751
I'll do the rest. best, Sean On Sun, Apr 4, 2010 at 11:03 AM, Sebastian Schelter <sebast...@alombra.de> wrote: > Hi there, > > My name is Sebastian, I'm a student and currently writing my diploma thesis > about the comparison of several recommendation algorithms for a large german > ecommerce site. > > The algorithms I evaluate include item-based collaborative filtering, what > makes me a taste and mahout user. One major task in my recent work regarding > item-based collaborative filtering was the precomputation of the item > similarities with Map/Reduce. I decided to use a slightly modified version of > the algorithm in [1] to compute the pairwise cosine similarity between all the > item-vectors. > > I'd be happy to donate this code to mahout, if you find it useful for your > project. So just tell me and I'd provide a patch! > > Regards, > Sebastian > > > [1] Elsayed et al: Pairwise Document Similarity in Large Collections with > MapReduce, > http://www.umiacs.umd.edu/~jimmylin/publications/Elsayed_etal_ACL2008_short.pdf > > > >