[ 
https://issues.apache.org/jira/browse/SOLR-7121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14529427#comment-14529427
 ] 

Sachin Goyal commented on SOLR-7121:
------------------------------------

I guess the threshold settings are a little different than the other ones 
because they factor in the underlying machine's capabilities.
So all cores in the same collection but running on different kinds of machines 
can use this setting.

For other settings, the configuration is more closely tied to the type of 
collection.
Hence its ok for them to be different across collections and same across all 
cores in a collection.

If we remove the above configuration, it might be somewhat difficult to test 
this feature because currently we are enabling the thresholds on only a single 
core by the regular expression.
Maybe we can have the regular expression as an option for testing only like the 
`solrHealthMonitorClass` option?

> Solr nodes should go down based on configurable thresholds and not rely on 
> resource exhaustion
> ----------------------------------------------------------------------------------------------
>
>                 Key: SOLR-7121
>                 URL: https://issues.apache.org/jira/browse/SOLR-7121
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Sachin Goyal
>            Assignee: Mark Miller
>         Attachments: SOLR-7121.patch, SOLR-7121.patch, SOLR-7121.patch, 
> SOLR-7121.patch, SOLR-7121.patch, SOLR-7121.patch, SOLR-7121.patch
>
>
> Currently, there is no way to control when a Solr node goes down.
> If the server is having high GC pauses or too many threads or is just getting 
> too many queries due to some bad load-balancer, the cores in the machine keep 
> on serving unless they exhaust the machine's resources and everything comes 
> to a stall.
> Such a slow-dying core can affect other cores as well by taking huge time to 
> serve their distributed queries.
> There should be a way to specify some threshold values beyond which the 
> targeted core can its ill-health and proactively go down to recover.
> When the load improves, the core should come up automatically.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to