[ https://issues.apache.org/jira/browse/SOLR-8456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15533900#comment-15533900 ]
Steve Rowe commented on SOLR-8456: ---------------------------------- I see still see non-reproducing failures on my Jenkins about once a day, mostly like these two, with closed sockets (neither of which is still present on the other Jenkins servers) - these don't seem to be related to timeouts, so maybe a separate issue is warranted?: >From July 10th: {noformat} Build: https://builds.apache.org/job/Lucene-Solr-NightlyTests-6.x/116/ 6 tests failed. FAILED: org.apache.solr.cloud.HttpPartitionTest.test Error Message: Captured an uncaught exception in thread: Thread[id=25190, name=SocketProxy-Response-37204:54204, state=RUNNABLE, group=TGRP-HttpPartitionTest] Stack Trace: com.carrotsearch.randomizedtesting.UncaughtExceptionError: Captured an uncaught exception in thread: Thread[id=25190, name=SocketProxy-Response-37204:54204, state=RUNNABLE, group=TGRP-HttpPartitionTest] at __randomizedtesting.SeedInfo.seed([56EEEACA37CCC526:DEBAD5109930A8DE]:0) Caused by: java.lang.RuntimeException: java.net.SocketException: Socket is closed at __randomizedtesting.SeedInfo.seed([56EEEACA37CCC526]:0) at org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:347) Caused by: java.net.SocketException: Socket is closed at java.net.Socket.setSoTimeout(Socket.java:1137) at org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:344) {noformat} >From August 26th: {noformat} Build: http://jenkins.thetaphi.de/job/Lucene-Solr-master-Windows/6078/ Java: 32bit/jdk1.8.0_102 -server -XX:+UseSerialGC 6 tests failed. FAILED: org.apache.solr.cloud.HttpPartitionTest.test Error Message: Captured an uncaught exception in thread: Thread[id=6241, name=SocketProxy-Request-60378:60020, state=RUNNABLE, group=TGRP-HttpPartitionTest] Stack Trace: com.carrotsearch.randomizedtesting.UncaughtExceptionError: Captured an uncaught exception in thread: Thread[id=6241, name=SocketProxy-Request-60378:60020, state=RUNNABLE, group=TGRP-HttpPartitionTest] Caused by: java.lang.RuntimeException: java.net.SocketException: Socket is closed at __randomizedtesting.SeedInfo.seed([25A5A0B6B8584DC6]:0) at org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:347) Caused by: java.net.SocketException: Socket is closed at java.net.Socket.setSoTimeout(Socket.java:1137) at org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:344) {noformat} Here's one from yesterday from my Jenkins [http://jenkins.sarowe.net/job/Lucene-Solr-tests-6.x/2919]: {noformat} [junit4] 2> 284268 ERROR (zkCallback-176-thread-1-processing-n:127.0.0.1:46806_ug_) [n:127.0.0.1:46806_ug_ c:collMinRf_1x3 s:shard1 r:core_node2 x:collMinRf_1x3_shard1_replica2] o.a.s.c.ShardLeaderElectionContext There was a problem trying to register as the leader:java.lang.NullPointerException [junit4] 2> at org.apache.solr.cloud.ShardLeaderElectionContext.runLeaderProcess(ElectionContext.java:422) [junit4] 2> at org.apache.solr.cloud.LeaderElector.runIamLeaderProcess(LeaderElector.java:170) [junit4] 2> at org.apache.solr.cloud.LeaderElector.checkIfIamLeader(LeaderElector.java:135) [junit4] 2> at org.apache.solr.cloud.LeaderElector.access$200(LeaderElector.java:56) [junit4] 2> at org.apache.solr.cloud.LeaderElector$ElectionWatcher.process(LeaderElector.java:348) [junit4] 2> at org.apache.solr.common.cloud.SolrZkClient$3.lambda$process$0(SolrZkClient.java:268) [junit4] 2> at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [junit4] 2> at java.util.concurrent.FutureTask.run(FutureTask.java:266) [junit4] 2> at org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:229) [junit4] 2> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [junit4] 2> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [junit4] 2> at java.lang.Thread.run(Thread.java:745) [...] [junit4] 2> NOTE: reproduce with: ant test -Dtestcase=HttpPartitionTest -Dtests.method=test -Dtests.seed=BD012B32E5FEC12B -Dtests.slow=true -Dtests.locale=de-AT -Dtests.timezone=Asia/Bishkek -Dtests.asserts=true -Dtests.file.encoding=US-ASCII [junit4] ERROR 167s J3 | HttpPartitionTest.test <<< [junit4] > Throwable #1: com.carrotsearch.randomizedtesting.UncaughtExceptionError: Captured an uncaught exception in thread: Thread[id=1344, name=SocketProxy-Response-60387:59366, state=RUNNABLE, group=TGRP-HttpPartitionTest] [junit4] > at __randomizedtesting.SeedInfo.seed([BD012B32E5FEC12B:355514E84B02ACD3]:0) [junit4] > Caused by: java.lang.RuntimeException: java.net.SocketException: Socket is closed [junit4] > at __randomizedtesting.SeedInfo.seed([BD012B32E5FEC12B]:0) [junit4] > at org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:347) [junit4] > Caused by: java.net.SocketException: Socket is closed [junit4] > at java.net.Socket.setSoTimeout(Socket.java:1137) [junit4] > at org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:344) [junit4] 2> 284534 INFO (SUITE-HttpPartitionTest-seed#[BD012B32E5FEC12B]-worker) [ ] o.a.s.SolrTestCaseJ4 ###deleteCore [junit4] 2> NOTE: test params are: codec=Asserting(Lucene62): {range_facet_l_dv=Lucene50(blocksize=128), _version_=FSTOrd50, multiDefault=PostingsFormat(name=Memory doPackFST= true), a_t=FSTOrd50, intDefault=FSTOrd50, id=Lucene50(blocksize=128), range_facet_i_dv=FSTOrd50, text=PostingsFormat(name=Memory doPackFST= false), range_facet_l=FSTOrd50, timestamp=FSTOrd50}, docValues:{range_facet_l_dv=DocValuesFormat(name=Lucene54), range_facet_i_dv=DocValuesFormat(name=Lucene54), timestamp=DocValuesFormat(name=Lucene54)}, maxPointsInLeafNode=256, maxMBSortInHeap=5.671099811904124, sim=RandomSimilarity(queryNorm=false,coord=yes): {}, locale=de-AT, timezone=Asia/Bishkek [junit4] 2> NOTE: Linux 4.1.0-custom2-amd64 amd64/Oracle Corporation 1.8.0_77 (64-bit)/cpus=16,threads=1,free=351228904,total=524812288 [junit4] 2> NOTE: All tests run in this JVM: [AutoCommitTest, BlockJoinFacetDistribTest, HdfsDirectoryFactoryTest, CopyFieldTest, DocExpirationUpdateProcessorFactoryTest, ZkStateReaderTest, PathHierarchyTokenizerFactoryTest, DateFieldTest, TestTestInjection, TestSolrQueryParser, TestConfigSetsAPI, HttpPartitionTest] [junit4] Completed [184/635 (1!)] on J3 in 167.39s, 1 test, 1 error <<< FAILURES! {noformat} > Investigate org.apache.solr.cloud.HttpPartitionTest.test failing more > commonly. > ------------------------------------------------------------------------------- > > Key: SOLR-8456 > URL: https://issues.apache.org/jira/browse/SOLR-8456 > Project: Solr > Issue Type: Test > Reporter: Mark Miller > Assignee: Timothy Potter > -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org