keith-turner commented on PR #5726: URL: https://github.com/apache/accumulo/pull/5726#issuecomment-3075668809
Ran the two test again tracking average files per tablet. With `compactor.failure.backoff.interval=3s`, seeing much better numbers for average file per tablet and and max files per tablet. Looking at the logs the coordinator scans the tservers every 60s by default. If a bad compactor takes a tablets job after that scan, then it will not be found until the next scan by the coordinator. Some tablets would keep getting unlucky and picked up by bad compactors after each scan so they would not compact for 3 or 4 minutes even as new bulk import files kept rolling in. I know it would be a change in behavior, but wondering if the default settings should make compactors backoff. Could be slight like `compactor.failure.backoff.interval=100ms`. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
