cxzl25 opened a new pull request, #3488: URL: https://github.com/apache/celeborn/pull/3488
### What changes were proposed in this pull request? Introduce the configuration `celeborn.worker.graceful.shutdown.dbDeleteFailurePolicy`. When configuring `EXIT`, we can exit RocksDB delete failure gracefully. ### Why are the changes needed? StorageManager#cleanupExpiredShuffleKey may cause some shuffle files to be not cleaned up due to RocksDB delete exception, so that the worker data directory is full. This can only be cleaned up by restarting the worker. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? GA -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
