Awesome Himanshu,
I was also trying to test using CPs and see where the sweetspot is
between number of threads to process in parallel, and overloading the
servers since you potentially send a heavy resource bound task to
already taxed servers and therefore taking a huge hit everywhere. I
was think
I did some experiments using coprocessors and compare the result with
vanilla scan, and in one case with mapreduce. I wrote up a blog about these
experiments as it was getting a bit difficult for me to explain it on mail,
without figures etc. Please refer to
http://hbase-coprocessor-experiments.blo