Re: Pairwise similarity using map reduce

2016-08-10 Thread zhiyuan yang
. Also, you can control the speed of processing by controlling number of reducers. Thanks! Zhiyuan > From: Madhav Sharan <msha...@usc.edu> > Subject: Pairwise similarity using map reduce > Date: August 10, 2016 at 12:25:46 PM PDT > To: user <user@hadoop.apache.org> > C

Pairwise similarity using map reduce

2016-08-10 Thread Madhav Sharan
Hi hadoop users, I have a set of vectors stored in .txt files on HDFS. Goal is to take every pair of vector and compute similarity between them. 1. We generate pairs of vectors by a python script and give it as a input to MR jobs. Input file has comma separated path to vector files. "