Thanks, I will have a look at what you mentioned.
I have another question. In Pig Latin data analysis tasks are expressed as
queries.
Pig Latin has join and cogroup operators which does the task using
Map/Reduce
on hadoop. Can anyone share how does Pig Latin implementation do it?
Thanks
On Fri, D
I'm not aware of anything that is completely equipped for the task, however
this could be done more simply with one of the Hadoop MapReduce tools.
My personal favorite is Cascading (http://www.cascading.org) by Chris
Wensel. This can help you with doing something like reading in two
different tab
I am trying to implement hash-join and nested join on two Hbase tables.
However, I am stuck.
I came across the package *org.apache.hadoop.mapred.join* which joins
two sorted datasets before map. However, I want to implement joins using
map/reduce methods so that I have more control on how to join