Hung in DFSClient$DFSOutputStream.writeChunk

2008-11-20 Thread stack
Over in hbase-space, we trigger a hang in DFSOutputStream.writeChunk. Input appreciated. Here are the two pertinent extracts from the hbase regionserver thread dump: "IPC Server handler 9 on 60020" daemon prio=10 tid=0x7fef1c3f0400 nid=0x7470 waiting for monitor entry [0x42d18000

Re: Hung in DFSClient$DFSOutputStream.writeChunk

2008-11-20 Thread stack
stack wrote: Over in hbase-space, we trigger a hang in DFSOutputStream.writeChunk. Input appreciated. Pardon me. The above should have read, "...we sometimes trigger". The below stack traces are from hadoop-0.18.2. Other ill-documented instances of the hang can be found over in HBASE-66

Re: Hung in DFSClient$DFSOutputStream.writeChunk

2008-11-21 Thread stack
Trying to get more data on the issue reported below, I tripped over the following where a datanode died and dfsclient is trying to transition to getting wanted block from another. The transition attempt is unsuccessful but odd is that we do not proceed to the datanode carrying the 3rd replica. Her