Re: How to rebuild the shared edits directory

2012-07-24 Thread Jeff Whiting
7;t exit upon loss of shared edits, that would also be a bug which would hit the quorum-based solution. Thanks -Todd On Tue, May 8, 2012 at 4:20 PM, Jeff Whiting wrote: Thanks for being patient and listening to my rants. I'm excited to see hdfs continue to move forward. If the organi

Re: How to rebuild the shared edits directory

2012-05-08 Thread Jeff Whiting
issues. Most of what we've done with the hadoop eco system has been zookeeper and hbase related. Thanks, ~Jeff On 5/8/2012 2:46 PM, Todd Lipcon wrote: On Tue, May 8, 2012 at 12:38 PM, Jeff Whiting wrote: It seems the NN was originally written with the assumption that disks fail and s

Re: How to rebuild the shared edits directory

2012-05-08 Thread Jeff Whiting
he facility that's supposed to make the active NN crash if shared edits go away. The logs will help. To answer your question, though, you can run the "initializeSharedEdits" process again to re-initialize that edits dir. Thanks -Todd -- Todd Lipcon Software Engineer, Cloudera -- -

Re: dfs.data.dir and "hadoop namenode -format"

2010-06-24 Thread Jeff Whiting
.sh", then do 1) and then "start-dfs.sh"? 3) My last question here is what "hadoop namenode -format" does. If I run it on my Namenode, does it clean up the data.dir? and do I need to manually clean up the data.dir on Datanode? Thanks, Sean -- Jeff Whiting Qualtrics Senior Software Engineer je...@qualtrics.com

Re: Lots of Different Kind of Datanode Errors

2010-06-07 Thread Jeff Whiting
cated xceivers was set to 2048 instead of 4096. Also, in my experience SocketTimeoutExceptions are usually due to swapping. Verify that your machines aren't swapping when you're under load. Or doing any other heavy disk IO. -- Jeff Whiting Qualtrics Senior Softw

Re: Lots of Different Kind of Datanode Errors

2010-06-04 Thread Jeff Whiting
he "exceeds the limit" message. The EOFs and Connection Reset messages are when DFS clients are disconnecting prematurely from a client stream (probably due to xceiver errors on other streams) -Todd On Fri, Jun 4, 2010 at 8:56 AM, jeff whiting <mailto:je...@qualtrics

Lots of Different Kind of Datanode Errors

2010-06-04 Thread jeff whiting
ection reset by peer when I'm connecting locally. Why is the file prematurely ending? Any idea of what is going on? Thanks, ~Jeff -- Jeff Whiting Qualtrics Senior Software Engineer je...@qualtrics.com

Re: Unbalanced Datanode and Lots of Blocks Waiting for Deletion

2010-06-03 Thread jeff whiting
beta release of > CDH3. > > Thanks > -Todd > > On Wed, Jun 2, 2010 at 2:27 PM, jeff whiting wrote: > I'm running a 3 node hdfs cluster and am having major data distribution > issues. Looking at "live nodes" in the web interface I'm seeing the >

Unbalanced Datanode and Lots of Blocks Waiting for Deletion

2010-06-02 Thread jeff whiting
I need to do or check to solve the problem? Thanks, ~Jeff -- Jeff Whiting Qualtrics Senior Software Engineer je...@qualtrics.com