date:20081205

Re: implementing join on two Hbase tables

2008-12-05 Thread abhinit

Thanks, I will have a look at what you mentioned. I have another question. In Pig Latin data analysis tasks are expressed as queries. Pig Latin has join and cogroup operators which does the task using Map/Reduce on hadoop. Can anyone share how does Pig Latin implementation do it? Thanks On Fri, D

RE: implementing join on two Hbase tables

2008-12-05 Thread Jonathan Gray

I'm not aware of anything that is completely equipped for the task, however this could be done more simply with one of the Hadoop MapReduce tools. My personal favorite is Cascading (http://www.cascading.org) by Chris Wensel. This can help you with doing something like reading in two different tab

implementing join on two Hbase tables

2008-12-05 Thread abhinit

I am trying to implement hash-join and nested join on two Hbase tables. However, I am stuck. I came across the package *org.apache.hadoop.mapred.join* which joins two sorted datasets before map. However, I want to implement joins using map/reduce methods so that I have more control on how to join