. Also, you can control the speed
of processing by controlling number of reducers.
Thanks!
Zhiyuan
> From: Madhav Sharan <msha...@usc.edu>
> Subject: Pairwise similarity using map reduce
> Date: August 10, 2016 at 12:25:46 PM PDT
> To: user <user@hadoop.apache.org>
> C
Hi hadoop users,
I have a set of vectors stored in .txt files on HDFS. Goal is to take every
pair of vector and compute similarity between them.
1. We generate pairs of vectors by a python script and give it as a
input to MR jobs. Input file has comma separated path to vector files. "