Edward J. Yoon wrote:
Hi communities,

Do you have any idea how to get the pairs of all row key combinations
w/o repetition on Map/Reduce as describe below?

Input : (MapFile or Hbase Table)

<Key1, Value or RowResult>
<Key2, Value or RowResult>
<Key3, Value or RowResult>
<Key4, Value or RowResult>

Output :

<Key1, Key2>
<Key1, Key3>
<Key1, Key4>
<Key2, Key3>
<Key2, Key4>
<Key3, Key4>
One way to do it would be as follows
For every key with index i,
for (k=0; k < i; k++) {
emit(i,key_i)
}
So the above input becomes
1,key1
1,key1
1,key1
It would be nice if someone can review my pseudo code of traditional
CF using cosine similarity.
http://wiki.apache.org/hama/TraditionalCollaborativeFiltering

Thanks.

Reply via email to