Steve Loughran created HADOOP-14810: ---------------------------------------
Summary: S3Guard: handle provisioning failure through backoff & retry (& metrics) Key: HADOOP-14810 URL: https://issues.apache.org/jira/browse/HADOOP-14810 Project: Hadoop Common Issue Type: Sub-task Components: fs/s3 Affects Versions: HADOOP-13345 Reporter: Steve Loughran S3Guard can't handle overloaded tables. I think we all though the API did: it doesn't; exceptions get raised and the caller is expected to handle it. This relates very much to the s3a-lambda invocation code in HADOOP-13786 to handle failures during commit, and the need for all the S3AFileSystem calls of the S3 APIs to handle transient failures like throttling, and again, needs some fault injection to verify the handling, metrics to count rate so it can be monitored & used to understand why work is underperforming. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-dev-h...@hadoop.apache.org