I don't think there's a CDH that includes Hadoop 1.2.1 So either your code is doing something slow or it's the reading itself. For the latter, make sure you go through http://hbase.apache.org/book.html#perf.reading and we also recently had this thread on the list were you can see some "live" performance debugging http://www.mail-archive.com/user@hbase.apache.org/msg27174.html. For example, make sure you're not running on the local job tracker.
J-D On Fri, Sep 27, 2013 at 11:07 AM, Pavan Sudheendra <pavan0...@gmail.com>wrote: > Hi Jean, > HBase 0.94.6 and Hadoop 1.2.1 Cloudera Distributions.. > > I infact tried that out, in place of doing the get operations , i created > stub data and returned that instead.. It was practically at the same speed. > > Nothing changed.. After 20 mins or so when i check the job status.. It > hardly reached 1,000,000 rows.. > > > On Fri, Sep 27, 2013 at 11:12 PM, Jean-Daniel Cryans <jdcry...@apache.org > >wrote: > > > Your details are missing important bits like you configurations, > > Hadoop/HBase versions, etc. > > > > Doing those random reads inside your MR job, especially if they are > reading > > cold data, will indeed make it slower. Just to get an idea, if you skip > > doing the Gets, how fast does it became? > > > > J-D > > > > > > On Fri, Sep 27, 2013 at 10:33 AM, Pavan Sudheendra <pavan0...@gmail.com > > >wrote: > > > > > Hi everyone, > > > I posted this question many time before and i've given full details on > > > stackoverflow.. > > > > > > http://stackoverflow.com/q/19056712/938959 > > > > > > Please i need someone to guide me in the right direction here. > > > > > > Help much appreciated! > > > > > > -- > > > Regards- > > > Pavan > > > > > > > > > -- > Regards- > Pavan >