[ https://issues.apache.org/jira/browse/MAPREDUCE-7329?focusedWorklogId=579129&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-579129 ]
ASF GitHub Bot logged work on MAPREDUCE-7329: --------------------------------------------- Author: ASF GitHub Bot Created on: 08/Apr/21 12:06 Start Date: 08/Apr/21 12:06 Worklog Time Spent: 10m Work Description: hadoop-yetus commented on pull request #2775: URL: https://github.com/apache/hadoop/pull/2775#issuecomment-815727225 :broken_heart: **-1 overall** | Vote | Subsystem | Runtime | Logfile | Comment | |:----:|----------:|--------:|:--------:|:-------:| | +0 :ok: | reexec | 0m 34s | | Docker mode activated. | |||| _ Prechecks _ | | +1 :green_heart: | dupname | 0m 0s | | No case conflicting files found. | | +0 :ok: | codespell | 0m 0s | | codespell was not available. | | +1 :green_heart: | @author | 0m 0s | | The patch does not contain any @author tags. | | +1 :green_heart: | test4tests | 0m 0s | | The patch appears to include 1 new or modified test files. | |||| _ trunk Compile Tests _ | | +0 :ok: | mvndep | 14m 15s | | Maven dependency ordering for branch | | -1 :x: | mvninstall | 6m 4s | [/branch-mvninstall-root.txt](https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-2775/11/artifact/out/branch-mvninstall-root.txt) | root in trunk failed. | | +1 :green_heart: | compile | 2m 24s | | trunk passed with JDK Ubuntu-11.0.10+9-Ubuntu-0ubuntu1.20.04 | | +1 :green_heart: | compile | 2m 0s | | trunk passed with JDK Private Build-1.8.0_282-8u282-b08-0ubuntu1~20.04-b08 | | +1 :green_heart: | checkstyle | 0m 55s | | trunk passed | | +1 :green_heart: | mvnsite | 1m 15s | | trunk passed | | +1 :green_heart: | javadoc | 0m 50s | | trunk passed with JDK Ubuntu-11.0.10+9-Ubuntu-0ubuntu1.20.04 | | +1 :green_heart: | javadoc | 0m 42s | | trunk passed with JDK Private Build-1.8.0_282-8u282-b08-0ubuntu1~20.04-b08 | | +1 :green_heart: | spotbugs | 1m 54s | | trunk passed | | +1 :green_heart: | shadedclient | 13m 49s | | branch has no errors when building and testing our client artifacts. | |||| _ Patch Compile Tests _ | | +0 :ok: | mvndep | 0m 25s | | Maven dependency ordering for patch | | +1 :green_heart: | mvninstall | 1m 1s | | the patch passed | | +1 :green_heart: | compile | 2m 17s | | the patch passed with JDK Ubuntu-11.0.10+9-Ubuntu-0ubuntu1.20.04 | | +1 :green_heart: | javac | 2m 17s | | the patch passed | | +1 :green_heart: | compile | 1m 57s | | the patch passed with JDK Private Build-1.8.0_282-8u282-b08-0ubuntu1~20.04-b08 | | +1 :green_heart: | javac | 1m 57s | | the patch passed | | +1 :green_heart: | blanks | 0m 0s | | The patch has no blanks issues. | | +1 :green_heart: | checkstyle | 0m 49s | | the patch passed | | +1 :green_heart: | mvnsite | 1m 1s | | the patch passed | | +1 :green_heart: | javadoc | 0m 37s | | the patch passed with JDK Ubuntu-11.0.10+9-Ubuntu-0ubuntu1.20.04 | | +1 :green_heart: | javadoc | 0m 34s | | the patch passed with JDK Private Build-1.8.0_282-8u282-b08-0ubuntu1~20.04-b08 | | +1 :green_heart: | spotbugs | 2m 7s | | the patch passed | | +1 :green_heart: | shadedclient | 13m 38s | | patch has no errors when building and testing our client artifacts. | |||| _ Other Tests _ | | -1 :x: | unit | 7m 27s | [/patch-unit-hadoop-mapreduce-project_hadoop-mapreduce-client_hadoop-mapreduce-client-core.txt](https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-2775/11/artifact/out/patch-unit-hadoop-mapreduce-project_hadoop-mapreduce-client_hadoop-mapreduce-client-core.txt) | hadoop-mapreduce-client-core in the patch passed. | | +1 :green_heart: | unit | 131m 12s | | hadoop-mapreduce-client-jobclient in the patch passed. | | +1 :green_heart: | asflicense | 0m 39s | | The patch does not generate ASF License warnings. | | | | 210m 9s | | | | Reason | Tests | |-------:|:------| | Failed junit tests | hadoop.mapred.TestJobEndNotifier | | Subsystem | Report/Notes | |----------:|:-------------| | Docker | ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-2775/11/artifact/out/Dockerfile | | GITHUB PR | https://github.com/apache/hadoop/pull/2775 | | JIRA Issue | MAPREDUCE-7329 | | Optional Tests | dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle codespell | | uname | Linux 9228899b8f35 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | dev-support/bin/hadoop.sh | | git revision | trunk / 08790b1668caaf825fd79a7aff29c5e5e169ec45 | | Default Java | Private Build-1.8.0_282-8u282-b08-0ubuntu1~20.04-b08 | | Multi-JDK versions | /usr/lib/jvm/java-11-openjdk-amd64:Ubuntu-11.0.10+9-Ubuntu-0ubuntu1.20.04 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_282-8u282-b08-0ubuntu1~20.04-b08 | | Test Results | https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-2775/11/testReport/ | | Max. process+thread count | 1639 (vs. ulimit of 5500) | | modules | C: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient U: hadoop-mapreduce-project/hadoop-mapreduce-client | | Console output | https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-2775/11/console | | versions | git=2.25.1 maven=3.6.3 spotbugs=4.2.2 | | Powered by | Apache Yetus 0.14.0-SNAPSHOT https://yetus.apache.org | This message was automatically generated. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 579129) Time Spent: 6h 40m (was: 6.5h) > HadoopPipes task failed as a result of ping timeout exception > ------------------------------------------------------------- > > Key: MAPREDUCE-7329 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-7329 > Project: Hadoop Map/Reduce > Issue Type: Bug > Affects Versions: 2.6.0 > Reporter: chaoli > Priority: Major > Labels: patch, pull-request-available > Fix For: 2.6.0, 3.0.0 > > Attachments: > 0001-MAPREDUCE-7329-HadoopPipes-task-may-fail-when-linux-.patch, > image-2021-03-15-14-29-49-475.png, image-2021-03-15-14-37-32-184.png > > Time Spent: 6h 40m > Remaining Estimate: 0h > > {color:#FF0000}*Hadoop Pipes Ping implement has a bug*{color}. Recently, we > upgrade linux kernel version from 3.x to 4.x. And we find hadoop pipe task > exit with connect timeout which is implemented by PingThread in > HadoopPipes.cc. > !image-2021-03-15-14-37-32-184.png! > After a deep research, we finally find that current ping server won't accept > ping client created socket, which may cause critical problem: > * it will cause tcp accept queue full(default 50) > * when client close socket, server socket won't call close method, which > will leave too many CLOSE_WAIT socket fd existed(default 2h), and accept > queue never cleared. > * Even worse, in 4.x linux kernel version, it will cause tcp drop packet > directly which makes ping client connect time out. While In 3.x linux kernel > version, when accept queue full, client can also make half connection till > sync queue full (default 2048), so from client side, ping will aslo work till > sync queue full. And after 3 hours, task will also exit with connect timeout > exception. > To fix this bug, we introduced a PingSocketCleaner thread, which will > continuously accept ping socket connect from ping client. When socket close > from client, cleaner thread will detecte closed inputStream reading, then > finally close socket from sever side. > Refrenced by linux kernel patch: > [https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=5ea8ea2cb7] > -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org