[ https://issues.apache.org/jira/browse/CASSANDRA-17142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17456703#comment-17456703 ]
Yifan Cai commented on CASSANDRA-17142: --------------------------------------- PR: https://github.com/apache/cassandra/pull/1357 CI: https://app.circleci.com/pipelines/github/yifan-c/cassandra/288/workflows/566bde90-9b69-4743-9e12-a97d4a4c137a (mostly green) As mentioned in the description, the PR add the configuration, {{max_hints_size_per_host_in_mb}}, to limit the total size of the hints per host. It is off by default. Thanks [~Gerrrr] for bringing up the Guardrails framework. Read though the CEP and the first merged prototype. I think ultimately it is a good fit as you mentioned. But as of now, the implementation is client facing since its foundation classes depends on ClientState and ClientWarn. It does not fit very well with those system-internal limits. I can see Guardrails is approaching it iteratively. With respect to this ticket, we do not have to block it. We can merge it first and address the required refactoring along with the other system-internal limits when Guardrails evolves. > Limit the maximum hints size per host > ------------------------------------- > > Key: CASSANDRA-17142 > URL: https://issues.apache.org/jira/browse/CASSANDRA-17142 > Project: Cassandra > Issue Type: Improvement > Components: Consistency/Hints > Reporter: Yifan Cai > Assignee: Yifan Cai > Priority: Normal > > The hints system defines a time window, i.e. max_hint_window_in_ms, to store > the hints. > It defines no limit on how much data can be kept during the time window. The > hints can grow excessively and make the node running out of disk. In such > scenario, the operators have to truncate the hints manually. > I'd propose that in addition to the conventional hints window, operators > should be able to define the maximum hints size per host, i.e. > max_hints_size_per_host_in_mb, to provide an another layer of protection. A > node stops to store hints for the down node whenever it reaches to the time > cap or the size cap. In order to not surprise the users, the config should be > disabled by default. It should also be configurable via JMX. -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org