Ethan Rose created HDDS-13091:
---------------------------------

             Summary: Improve default configurations for container and volume 
scan windows and intervals
                 Key: HDDS-13091
                 URL: https://issues.apache.org/jira/browse/HDDS-13091
             Project: Apache Ozone
          Issue Type: Sub-task
            Reporter: Ethan Rose


If using a time-based sliding window to determine volume failure, a few configs 
need to be set accordingly:
* Minimum scan gap between repeated volume scans.
** This is currently set to 15 minutes, which is probably too high given that 
volume scans are cheap.
* The interval that the background volume scanner runs
** This is currently set to 1 hour.
* The bandwidth of the container data scanner
** This affects the rate at which it can scan containers.
** Each unhealthy container will trigger an on-demand volume scan.
** This is currently set to 5mb/sec which means we can expect one container 
scan result every 17 minutes.
* The number of failed checks over a fixed time interval that is required to 
fail a volume.
** This number must be set such that volume scans triggered either by the 
background volume scanner or unhealthy containers from the container scanner 
have a chance to mark the volume as failed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@ozone.apache.org
For additional commands, e-mail: issues-h...@ozone.apache.org

Reply via email to