[ 
https://issues.apache.org/jira/browse/SOLR-8456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15533900#comment-15533900
 ] 

Steve Rowe commented on SOLR-8456:
----------------------------------

I see still see non-reproducing failures on my Jenkins about once a day, mostly 
like these two, with closed sockets (neither of which is still present on the 
other Jenkins servers) - these don't seem to be related to timeouts, so maybe a 
separate issue is warranted?:

>From July 10th:
{noformat}
Build: https://builds.apache.org/job/Lucene-Solr-NightlyTests-6.x/116/

6 tests failed.
FAILED:  org.apache.solr.cloud.HttpPartitionTest.test

Error Message:
Captured an uncaught exception in thread: Thread[id=25190, 
name=SocketProxy-Response-37204:54204, state=RUNNABLE, 
group=TGRP-HttpPartitionTest]

Stack Trace:
com.carrotsearch.randomizedtesting.UncaughtExceptionError: Captured an uncaught 
exception in thread: Thread[id=25190, name=SocketProxy-Response-37204:54204, 
state=RUNNABLE, group=TGRP-HttpPartitionTest]
        at 
__randomizedtesting.SeedInfo.seed([56EEEACA37CCC526:DEBAD5109930A8DE]:0)
Caused by: java.lang.RuntimeException: java.net.SocketException: Socket is 
closed
        at __randomizedtesting.SeedInfo.seed([56EEEACA37CCC526]:0)
        at 
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:347)
Caused by: java.net.SocketException: Socket is closed
        at java.net.Socket.setSoTimeout(Socket.java:1137)
        at 
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:344)
{noformat}

>From August 26th:

{noformat}
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-master-Windows/6078/
Java: 32bit/jdk1.8.0_102 -server -XX:+UseSerialGC

6 tests failed.
FAILED:  org.apache.solr.cloud.HttpPartitionTest.test

Error Message:
Captured an uncaught exception in thread: Thread[id=6241, 
name=SocketProxy-Request-60378:60020, state=RUNNABLE, 
group=TGRP-HttpPartitionTest]

Stack Trace:
com.carrotsearch.randomizedtesting.UncaughtExceptionError: Captured an uncaught 
exception in thread: Thread[id=6241, name=SocketProxy-Request-60378:60020, 
state=RUNNABLE, group=TGRP-HttpPartitionTest]
Caused by: java.lang.RuntimeException: java.net.SocketException: Socket is 
closed
        at __randomizedtesting.SeedInfo.seed([25A5A0B6B8584DC6]:0)
        at 
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:347)
Caused by: java.net.SocketException: Socket is closed
        at java.net.Socket.setSoTimeout(Socket.java:1137)
        at 
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:344)
{noformat}

Here's one from yesterday from my Jenkins 
[http://jenkins.sarowe.net/job/Lucene-Solr-tests-6.x/2919]:

{noformat}
   [junit4]   2> 284268 ERROR 
(zkCallback-176-thread-1-processing-n:127.0.0.1:46806_ug_) 
[n:127.0.0.1:46806_ug_ c:collMinRf_1x3 s:shard1 r:core_node2 
x:collMinRf_1x3_shard1_replica2] o.a.s.c.ShardLeaderElectionContext There was a 
problem trying to register as the leader:java.lang.NullPointerException
   [junit4]   2>        at 
org.apache.solr.cloud.ShardLeaderElectionContext.runLeaderProcess(ElectionContext.java:422)
   [junit4]   2>        at 
org.apache.solr.cloud.LeaderElector.runIamLeaderProcess(LeaderElector.java:170)
   [junit4]   2>        at 
org.apache.solr.cloud.LeaderElector.checkIfIamLeader(LeaderElector.java:135)
   [junit4]   2>        at 
org.apache.solr.cloud.LeaderElector.access$200(LeaderElector.java:56)
   [junit4]   2>        at 
org.apache.solr.cloud.LeaderElector$ElectionWatcher.process(LeaderElector.java:348)
   [junit4]   2>        at 
org.apache.solr.common.cloud.SolrZkClient$3.lambda$process$0(SolrZkClient.java:268)
   [junit4]   2>        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
   [junit4]   2>        at 
java.util.concurrent.FutureTask.run(FutureTask.java:266)
   [junit4]   2>        at 
org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:229)
   [junit4]   2>        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
   [junit4]   2>        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
   [junit4]   2>        at java.lang.Thread.run(Thread.java:745)
[...]
   [junit4]   2> NOTE: reproduce with: ant test  -Dtestcase=HttpPartitionTest 
-Dtests.method=test -Dtests.seed=BD012B32E5FEC12B -Dtests.slow=true 
-Dtests.locale=de-AT -Dtests.timezone=Asia/Bishkek -Dtests.asserts=true 
-Dtests.file.encoding=US-ASCII
   [junit4] ERROR    167s J3  | HttpPartitionTest.test <<<
   [junit4]    > Throwable #1: 
com.carrotsearch.randomizedtesting.UncaughtExceptionError: Captured an uncaught 
exception in thread: Thread[id=1344, name=SocketProxy-Response-60387:59366, 
state=RUNNABLE, group=TGRP-HttpPartitionTest]
   [junit4]    >        at 
__randomizedtesting.SeedInfo.seed([BD012B32E5FEC12B:355514E84B02ACD3]:0)
   [junit4]    > Caused by: java.lang.RuntimeException: 
java.net.SocketException: Socket is closed
   [junit4]    >        at 
__randomizedtesting.SeedInfo.seed([BD012B32E5FEC12B]:0)
   [junit4]    >        at 
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:347)
   [junit4]    > Caused by: java.net.SocketException: Socket is closed
   [junit4]    >        at java.net.Socket.setSoTimeout(Socket.java:1137)
   [junit4]    >        at 
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:344)
   [junit4]   2> 284534 INFO  
(SUITE-HttpPartitionTest-seed#[BD012B32E5FEC12B]-worker) [    ] 
o.a.s.SolrTestCaseJ4 ###deleteCore
   [junit4]   2> NOTE: test params are: codec=Asserting(Lucene62): 
{range_facet_l_dv=Lucene50(blocksize=128), _version_=FSTOrd50, 
multiDefault=PostingsFormat(name=Memory doPackFST= true), a_t=FSTOrd50, 
intDefault=FSTOrd50, id=Lucene50(blocksize=128), range_facet_i_dv=FSTOrd50, 
text=PostingsFormat(name=Memory doPackFST= false), range_facet_l=FSTOrd50, 
timestamp=FSTOrd50}, 
docValues:{range_facet_l_dv=DocValuesFormat(name=Lucene54), 
range_facet_i_dv=DocValuesFormat(name=Lucene54), 
timestamp=DocValuesFormat(name=Lucene54)}, maxPointsInLeafNode=256, 
maxMBSortInHeap=5.671099811904124, 
sim=RandomSimilarity(queryNorm=false,coord=yes): {}, locale=de-AT, 
timezone=Asia/Bishkek
   [junit4]   2> NOTE: Linux 4.1.0-custom2-amd64 amd64/Oracle Corporation 
1.8.0_77 (64-bit)/cpus=16,threads=1,free=351228904,total=524812288
   [junit4]   2> NOTE: All tests run in this JVM: [AutoCommitTest, 
BlockJoinFacetDistribTest, HdfsDirectoryFactoryTest, CopyFieldTest, 
DocExpirationUpdateProcessorFactoryTest, ZkStateReaderTest, 
PathHierarchyTokenizerFactoryTest, DateFieldTest, TestTestInjection, 
TestSolrQueryParser, TestConfigSetsAPI, HttpPartitionTest]
   [junit4] Completed [184/635 (1!)] on J3 in 167.39s, 1 test, 1 error <<< 
FAILURES!
{noformat}

> Investigate org.apache.solr.cloud.HttpPartitionTest.test failing more 
> commonly.
> -------------------------------------------------------------------------------
>
>                 Key: SOLR-8456
>                 URL: https://issues.apache.org/jira/browse/SOLR-8456
>             Project: Solr
>          Issue Type: Test
>            Reporter: Mark Miller
>            Assignee: Timothy Potter
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to