tillrohrmann commented on a change in pull request #8484: [FLINK-12547] Add connection and socket timeouts for the blob client URL: https://github.com/apache/flink/pull/8484#discussion_r288022559
########## File path: flink-core/src/main/java/org/apache/flink/configuration/BlobServerOptions.java ########## @@ -102,4 +102,20 @@ public static final ConfigOption<Integer> OFFLOAD_MINSIZE = key("blob.offload.minsize") .defaultValue(1_024 * 1_024) // 1MiB by default .withDescription("The minimum size for messages to be offloaded to the BlobServer."); + + /** + * The socket timeout in milliseconds for the blob client. + */ + public static final ConfigOption<Integer> SO_TIMEOUT = + key("blob.client.socket.timeout") + .defaultValue(120_000) Review comment: The reason I'm asking is because we might break existing setups with this default value. Imagine you have a really large setup (let's say 100 TMs) and all of them need to download a large set of blobs. Since the `BlobServer` only serves 50 connections concurrently, it might be the case that 50 others will time out if the transmission takes longer than 2 minutes. I think the Netty case is a bit different because we don't limit the number of concurrent transactions. So concretely, I'm wondering whether we should set the default value to `0` to maintain the existing behaviour or to something a bit larger (e.g. 10 minutes). Or alternatively, we might add a bit more meaningful error message allowing users to pin point the problem and what configuration value to increase. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services