I have patches for HDFS-347 and HDFS-941 (and HDFS-918) for CDH3U0. 

   - Andy

> From: Doug Meil <doug.m...@explorysmedical.com>
> Subject: RE: HDFS-1599 status? (HDFS tickets to improve HBase)
> To: "dev@hbase.apache.org" <dev@hbase.apache.org>
> Date: Friday, June 3, 2011, 12:50 PM
> Thanks everybody for commenting on
> this thread.  
> 
> We'd certainly like to lobby for movement on these two
> tickets, and although we don't have anybody that is familiar
> with the source code we'd be happy to perform some tests get
> some performance numbers.
> 
> Per Kihwal's comments, it sounds like HDFS-941 needs to get
> re-worked because the patch is stale.
> 
> The patch for HDFS-347 sounds like it's still usable.
> 
> So what else is needed to push this effort forward? 
> Is it beneficial to get more numbers on HDFS-347 and keep
> lobbying on the ticket, and/or is there another path that
> should be taken (plying with beer, free Cleveland Indians
> tickets, harassing phone calls, etc.)? 
> 
> 
> 
> -----Original Message-----
> From: Dhruba Borthakur [mailto:dhr...@gmail.com]
> 
> Sent: Friday, June 03, 2011 3:00 PM
> To: dev@hbase.apache.org
> Subject: Re: HDFS-1599 status? (HDFS tickets to improve
> HBase)
> 
> I completely agree with Ryan. Most of the measurements in
> HDFS-347 are point comparisions.... data rate over socket,
> single-threaded sequential read from datanode,
> single-threaded random read form datanode, etc. These
> measurements are good, but when you run the entire Hbase
> system at load, you definitely see a 3X performance
> improvement when reading data locally (instead of going
> through the datanode).
> 
> -dhruba
> 
> On Fri, Jun 3, 2011 at 11:08 AM, Ryan Rawson <ryano...@gmail.com>
> wrote:
> 
> > Could you explain your HDFS-347 comment more?  I
> dont think people 
> > suggested that the socket itself was the primary
> issue, but dealing 
> > with the datanode and the socket and everything was
> really slow.  It's 
> > hard to separate concerns and test only 1 thing at a
> time - for 
> > example you said 'local socket comm isnt the problem',
> but there is no 
> > way to build a test that uses a local socket but not
> the datanode.
> >
> > The basic fact is that datanode adds a lot of
> overhead, and under high 
> > concurrency that overhead grows.
> >
> >
> >
> > On Fri, Jun 3, 2011 at 7:07 AM, Kihwal Lee <kih...@yahoo-inc.com>
> wrote:
> > > HDFS-941
> > > The trunk has moved on so the patch won't
> apply.  There has been
> > significant changes in HDFS lately, so it will require
> more than 
> > simple rebase/merge.  If the original assignee is
> busy, I am willing to help.
> > >
> > > HDFS-347
> > > The analysis is pointing out that local socket
> communication is 
> > > actually
> > not the problem. The initial assumption of local
> socket being slow 
> > should be ignored and the design should be revisited.
> > >
> > > I agree that improving local pread performance is
> critical.  Based 
> > > on my
> > experiments, HDFS-941 helps a lot and the
> communication channel became 
> > no longer the bottleneck.
> > >
> > > Kihwal
> > >
> > >
> > > On 6/2/11 4:00 PM, "Doug Meil" <doug.m...@explorysmedical.com>
> wrote:
> > >
> > > Hi folks, I was wondering if there was any
> movement on any of these 
> > > HDFS
> > tickets for HBase.  The umbrella ticket is
> HDFS-1599, but the last 
> > comment from stack back in Feb highlighted interest in
> several tickets:
> > >
> > >
> > > 1)      HDFS-918 (use single
> selector)
> > >
> > > a.       Last comment
> Jan 2011
> > >
> > >
> > >
> > > 2)      HDFS-941 (reuse of
> connection)
> > >
> > > a.       Patch available
> as of April 2011
> > >
> > > b.      But ticket still
> unresolved.
> > >
> > >
> > >
> > > 3)      HDFS-347 (local reads)
> > >
> > > a.       Discussion
> seemed to end in March 2011 with a huge comment
> > saying that there was no performance benefit.
> > >
> > > b.      I'm working my way through
> this comment/report, but intuitively
> > it seems like it would be a good idea since as the
> other comments in 
> > the ticket stated the RS reads locally just about
> every time.
> > >
> > >
> > > Doug Meil
> > > Chief Software Architect, Explorys
> > > doug.m...@explorys.com
> > >
> > >
> > >
> >
> 
> 
> 
> --
> Connect to me at http://www.facebook.com/dhruba
>

Reply via email to