High Availability Framework for HDFS Namenode in 2.0.0

2012-05-03 Thread Shi Yu
It sounds like an exciting feature. Does anyone have tried this in practice? 
How does the hot standby namenode perform and how reliable is the HDFS 
recovery? Is it now a good chance to migrate to 2.0.0, in your opinions?

Best,

Shi



Re: High Availability Framework for HDFS Namenode in 2.0.0

2012-05-03 Thread Harsh J
Hey Shi Yu,

Some questions of yours are answered at this comment:
https://issues.apache.org/jira/browse/HDFS-1623?focusedCommentId=13215309page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13215309
(and below) and at was tracked at
https://issues.apache.org/jira/browse/HDFS-3010

The version of HA available already does work reasonably well,
although my own personal tests with DFS+HBase were local and not
long-running (as they may call it, non-prod).

Unsure on what you mean by recovery exactly. However, 2.0.0 is a
release yet to be prepared, but should come out soon per recent
activity on the general lists (branch-2 is your best shot for now for
the latest usable HDFS).

Docs for available HA in 0.23.1 is available in the tar, or
https://issues.apache.org/jira/secure/attachment/12513484/HDFSHighAvailability.html.
Doesn't look like it was compiled/linked onto the 0.23.1 live docs,
though it was present. Do give it a whirl and let us know!

Also checkout https://issues.apache.org/jira/browse/HDFS-3042 for the
remaining part of failover work.

On Thu, May 3, 2012 at 7:58 PM, Shi Yu sh...@uchicago.edu wrote:
 It sounds like an exciting feature. Does anyone have tried this in practice? 
 How does the hot standby namenode perform and how reliable is the HDFS 
 recovery? Is it now a good chance to migrate to 2.0.0, in your opinions?

 Best,

 Shi




-- 
Harsh J


Re: High Availability Framework for HDFS Namenode in 2.0.0

2012-05-03 Thread Shi Yu
Hi Harsh J,

It seems that the 20% performance lost is not that bad, at least some smart 
people are still working to improve it. I will keep an eye on this interesting 
trend. 

Shi


Re: High Availability Framework for HDFS Namenode in 2.0.0

2012-05-03 Thread Todd Lipcon
Hi Shi,

The 20% regression was prior to implementing a few optimizations on the
branch. Here's the later comment:
https://issues.apache.org/jira/browse/HDFS-1623?focusedCommentId=13218813page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13218813

Also, the 20% measurement was on a particular stress test designed to
target exactly the metadata code path we figured there might be performance
impact.

So on typical clusters with typical workloads, you shouldn't find any
measurable difference.

-Todd

On Thu, May 3, 2012 at 8:38 AM, Shi Yu sh...@uchicago.edu wrote:

 Hi Harsh J,

 It seems that the 20% performance lost is not that bad, at least some
 smart people are still working to improve it. I will keep an eye on this
 interesting trend.

 Shi




-- 
Todd Lipcon
Software Engineer, Cloudera


Re: High Availability Framework for HDFS Namenode in 2.0.0

2012-05-03 Thread Shi Yu
Hi Todd,

Okay, that sounds really good (sorry didn't grab all the 
information in that long page). 

Shi