[ 
https://issues.apache.org/jira/browse/KUDU-1514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke updated KUDU-1514:
------------------------------
    Target Version/s: 1.8.0  (was: 1.5.0)

> A tablet that ends up under replicated will spam logs
> -----------------------------------------------------
>
>                 Key: KUDU-1514
>                 URL: https://issues.apache.org/jira/browse/KUDU-1514
>             Project: Kudu
>          Issue Type: Bug
>          Components: consensus
>            Reporter: Jean-Daniel Cryans
>            Priority: Major
>
> Trying to debug a tablet that got down to 1 replica is hard when these lines 
> are printed multiple times per second:
> {noformat}
> W0704 23:05:30.999037   312 transaction_tracker.cc:112] Transaction failed, 
> tablet 807ff8e42640482d8d947b693d56ce03 transaction memory consumption 
> (67107918) has exceeded its limit (67108864) or the limit of an ancestral 
> tracker [suppressed 140 similar messages]
> I0704 23:05:31.000737 24321 consensus_peers.cc:181] T 
> 807ff8e42640482d8d947b693d56ce03 P 9e59a4c24de44e3f9de219df865b4f3b -> Peer 
> 94051c9253f94dadbc1af38098b41077 (e1105.halxg.cloudera.com:7050): Could not 
> obtain request from queue for peer: 94051c9253f94dadbc1af38098b41077. Status: 
> Not found: Failed to read ops 2302557..2325361: Segment 1118 which contained 
> index 2302557 has been GCed
> I0704 23:05:31.000780 24452 raft_consensus.cc:629] T 
> 807ff8e42640482d8d947b693d56ce03 P 9e59a4c24de44e3f9de219df865b4f3b [term 29 
> LEADER]: Processing failure of peer 94051c9253f94dadbc1af38098b41077 in term 
> 29 (The logs necessary to catch up peer 94051c9253f94dadbc1af38098b41077 have 
> been garbage collected. The follower will never be able to catch up (Not 
> found: Failed to read ops 2302557..2325361: Segment 1118 which contained 
> index 2302557 has been GCed)): There is already a config change operation in 
> progress. Unable to evict follower until it completes. Doing nothing.
> I0704 23:05:31.138310   378 raft_consensus.cc:1603] T 
> 807ff8e42640482d8d947b693d56ce03 P 9e59a4c24de44e3f9de219df865b4f3b [term 29 
> LEADER]: Leader election vote request: Denying vote to candidate 
> 94051c9253f94dadbc1af38098b41077 for term 5380 because replica is either 
> leader or believes a valid leader to be alive.
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to