[ https://issues.apache.org/jira/browse/HDFS-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14905863#comment-14905863 ]
Hadoop QA commented on HDFS-8696: --------------------------------- \\ \\ | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | pre-patch | 18m 9s | Pre-patch trunk compilation is healthy. | | {color:green}+1{color} | @author | 0m 0s | The patch does not contain any @author tags. | | {color:red}-1{color} | tests included | 0m 0s | The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. | | {color:green}+1{color} | javac | 8m 5s | There were no new javac warning messages. | | {color:green}+1{color} | javadoc | 10m 24s | There were no new javadoc warning messages. | | {color:green}+1{color} | release audit | 0m 22s | The applied patch does not increase the total number of release audit warnings. | | {color:green}+1{color} | checkstyle | 1m 23s | There were no new checkstyle issues. | | {color:green}+1{color} | whitespace | 0m 0s | The patch has no lines that end in whitespace. | | {color:green}+1{color} | install | 1m 29s | mvn install still works. | | {color:green}+1{color} | eclipse:eclipse | 0m 35s | The patch built with eclipse:eclipse. | | {color:green}+1{color} | findbugs | 2m 31s | The patch does not introduce any new Findbugs (version 3.0.0) warnings. | | {color:green}+1{color} | native | 3m 11s | Pre-build of native portion | | {color:red}-1{color} | hdfs tests | 0m 21s | Tests failed in hadoop-hdfs. | | | | 46m 34s | | \\ \\ || Reason || Tests || | Failed build | hadoop-hdfs | \\ \\ || Subsystem || Report/Notes || | Patch URL | http://issues.apache.org/jira/secure/attachment/12762052/HDFS-8696.007.patch | | Optional Tests | javadoc javac unit findbugs checkstyle | | git revision | trunk / 06d1c90 | | hadoop-hdfs test log | https://builds.apache.org/job/PreCommit-HDFS-Build/12653/artifact/patchprocess/testrun_hadoop-hdfs.txt | | Test Results | https://builds.apache.org/job/PreCommit-HDFS-Build/12653/testReport/ | | Java | 1.7.0_55 | | uname | Linux asf904.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux | | Console output | https://builds.apache.org/job/PreCommit-HDFS-Build/12653/console | This message was automatically generated. > Reduce the variances of latency of WebHDFS > ------------------------------------------ > > Key: HDFS-8696 > URL: https://issues.apache.org/jira/browse/HDFS-8696 > Project: Hadoop HDFS > Issue Type: Improvement > Components: webhdfs > Affects Versions: 2.7.0 > Reporter: Xiaobing Zhou > Assignee: Xiaobing Zhou > Attachments: HDFS-8696.004.patch, HDFS-8696.005.patch, > HDFS-8696.006.patch, HDFS-8696.007.patch, HDFS-8696.1.patch, > HDFS-8696.2.patch, HDFS-8696.3.patch > > > There is an issue that appears related to the webhdfs server. When making two > concurrent requests, the DN will sometimes pause for extended periods (I've > seen 1-300 seconds), killing performance and dropping connections. > To reproduce: > 1. set up a HDFS cluster > 2. Upload a large file (I was using 10GB). Perform 1-byte reads, writing > the time out to /tmp/times.txt > {noformat} > i=1 > while (true); do > echo $i > let i++ > /usr/bin/time -f %e -o /tmp/times.txt -a curl -s -L -o /dev/null > "http://<namenode>:50070/webhdfs/v1/tmp/bigfile?op=OPEN&user.name=root&length=1"; > done > {noformat} > 3. Watch for 1-byte requests that take more than one second: > tail -F /tmp/times.txt | grep -E "^[^0]" > 4. After it has had a chance to warm up, start doing large transfers from > another shell: > {noformat} > i=1 > while (true); do > echo $i > let i++ > /usr/bin/time -f %e curl -s -L -o /dev/null > "http://<namenode>:50070/webhdfs/v1/tmp/bigfile?op=OPEN&user.name=root"; > done > {noformat} > It's easy to find after a minute or two that small reads will sometimes > pause for 1-300 seconds. In some extreme cases, it appears that the > transfers timeout and the DN drops the connection. -- This message was sent by Atlassian JIRA (v6.3.4#6332)