wrmxs227 commented on code in PR #27759:
URL: https://github.com/apache/flink/pull/27759#discussion_r2930285675


##########
flink-models/flink-model-triton/src/main/java/org/apache/flink/model/triton/TritonOptions.java:
##########
@@ -181,4 +181,36 @@ private TritonOptions() {
                                                     + "Example: %s",
                                             
code("'X-Custom-Header:value,X-Another:value2'"))
                                     .build());
+
+    @Documentation.Section({Documentation.Sections.MODEL_TRITON_ADVANCED})
+    public static final ConfigOption<Integer> MAX_RETRIES =
+            ConfigOptions.key("max-retries")
+                    .intType()
+                    .defaultValue(0)
+                    .withDescription(
+                            "Maximum number of retry attempts for failed 
inference requests. "
+                                    + "Retries are triggered by network errors 
and retryable server "
+                                    + "errors (HTTP 503, 504). Client errors 
(HTTP 4xx) are not retried. "
+                                    + "Defaults to 0 (no retries).");

Review Comment:
   Good point👍. Added HTTP 408 (Request Timeout) and HTTP 429 (Too Many 
Requests) to both isRetryable() and the config description.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to