Lin Yiqun created HDFS-10181: -------------------------------- Summary: TestHFlush frequently fails due to theadInterrupt Key: HDFS-10181 URL: https://issues.apache.org/jira/browse/HDFS-10181 Project: Hadoop HDFS Issue Type: Bug Components: test Reporter: Lin Yiqun Assignee: Lin Yiqun
The test {{TestFLush}} frequently fails in recent patchs. I looked for the failed log records. I found there were two reason lead this test {{TestFLush#testHFlushInterrupted}} to be failed. And this method failure is the main reason that {{TestFLush}} not pass test. The two failed reasons is below: {code} Tests run: 14, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 30.325 sec <<< FAILURE! - in org.apache.hadoop.hdfs.TestHFlush testHFlushInterrupted(org.apache.hadoop.hdfs.TestHFlush) Time elapsed: 0.864 sec <<< ERROR! java.io.IOException: The stream is closed at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:118) at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:82) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:140) at java.io.DataOutputStream.flush(DataOutputStream.java:123) at java.io.FilterOutputStream.close(FilterOutputStream.java:158) at org.apache.hadoop.hdfs.DataStreamer.closeStream(DataStreamer.java:877) at org.apache.hadoop.hdfs.DataStreamer.closeInternal(DataStreamer.java:726) at org.apache.hadoop.hdfs.DataStreamer.run(DataStreamer.java:721) {code} {code} testHFlushInterrupted(org.apache.hadoop.hdfs.TestHFlush) Time elapsed: 0.862 sec <<< ERROR! java.nio.channels.ClosedByInterruptException: null at java.nio.channels.spi.AbstractInterruptibleChannel.end(AbstractInterruptibleChannel.java:202) at sun.nio.ch.SocketChannelImpl.write(SocketChannelImpl.java:501) at org.apache.hadoop.net.SocketOutputStream$Writer.performIO(SocketOutputStream.java:63) at org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:142) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:159) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:117) at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:82) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:140) at java.io.DataOutputStream.flush(DataOutputStream.java:123) at org.apache.hadoop.hdfs.DataStreamer.run(DataStreamer.java:653) {code} I analysed them, can be simplify described as below: * The IOException happens when the stream is closed but the stream write operation continues. * The ClosedByInterruptException happens when stream do hfulsh operations and thread interrupt happens. So we should catch these exceptions in stream {{hflush}} and {{write}} operations. -- This message was sent by Atlassian JIRA (v6.3.4#6332)