Hello Everyone, I wanted to write a mr for hbase table.... there is a million record and i wanted to write a Map reduce task to scan through it and then get the data and find the 90the percentail of the result. It would be helpful if some one has a tried it before...
The thing which i am finding hard to understand and the question is: How does MR get the input to the MR job, i mean how do i get the scanner output to the MR job? i dont want to get a million row first and give it back to the MR job but i needed to the region server's data directly go to the MR job. I saw the example @ http://www.nabble.com/Re:-Map-Reduce-over-HBase---sample-code-p18253120.html but still couldn't get it... :(. Thanks in advance! Regards, </VJ>
