[ 
https://issues.apache.org/jira/browse/SOLR-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated SOLR-9290:
---------------------------
    Attachment: index.sh
                setup-solr.sh

FWIW, I'm attaching a beefed up setup-solr.sh and an index.sh i've been testing 
with...

* setup-solr.sh
** you must edit 2 variables: the path to your lucene checkout & an absolute 
path to the SSL keystore for jetty to use (with a password "secret)
** spins up a 3 node cluster, then creates a collection with has 5 shards an 
rep factor of 3
* index.sh
** you must edit one variable to point at the SSL pem file for curl to use
** loops forever doing a bunch of curl connections indexing the same 9 docs 
over and over, periodically commiting & sleeping, reporting the # of CLOSE_WAIT 
java connections at each step

On master, index.sh never reports _any_ CLOSE_WAIT connections for me.

On branch_6x, I'll see the CLOSE_WAITS spike up to 40 - even with this 
(essentially) single threaded indexing, and stay at stead state even after 
killing the index.sh process

On branch_6x, with shalin's patch, CLOSE_WAITS start at 15 (which is 
suspiciously 5x3) as soon as the collection is created -- even w/o indexing -- 
and stay steady state at 15 forever.

Which begs the question: why are there 15 CLOSE_WAIT connections that last 
forever on branch_6x even with this patch?

> TCP-connections in CLOSE_WAIT spikes during heavy indexing when SSL is enabled
> ------------------------------------------------------------------------------
>
>                 Key: SOLR-9290
>                 URL: https://issues.apache.org/jira/browse/SOLR-9290
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>    Affects Versions: 5.5.1, 5.5.2
>            Reporter: Anshum Gupta
>            Priority: Critical
>         Attachments: SOLR-9290-debug.patch, SOLR-9290-debug.patch, index.sh, 
> setup-solr.sh, setup-solr.sh
>
>
> Heavy indexing on Solr with SSL leads to a lot of connections in CLOSE_WAIT 
> state. 
> At my workplace, we have seen this issue only with 5.5.1 and could not 
> reproduce it with 5.4.1 but from my conversation with Shalin, he knows of 
> users with 5.3.1 running into this issue too. 
> Here's an excerpt from the email [~shaie] sent to the mailing list  (about 
> what we see:
> {quote}
> 1) It consistently reproduces on 5.5.1, but *does not* reproduce on 5.4.1
> 2) It does not reproduce when SSL is disabled
> 3) Restarting the Solr process (sometimes both need to be restarted), the
> count drops to 0, but if indexing continues, they climb up again
> When it does happen, Solr seems stuck. The leader cannot talk to the
> replica, or vice versa, the replica is usually put in DOWN state and
> there's no way to fix it besides restarting the JVM.
> {quote}
> Here's the mail thread: 
> http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201607.mbox/%3c46cc66220a8143dc903fa34e79205...@vp-exc01.dips.local%3E
> Creating this issue so we could track this and have more people comment on 
> what they see. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to