Hossein Falaki created SPARK-17919: -------------------------------------- Summary: Make timeout to RBackend configurable in SparkR Key: SPARK-17919 URL: https://issues.apache.org/jira/browse/SPARK-17919 Project: Spark Issue Type: Story Components: SparkR Affects Versions: 2.0.1 Reporter: Hossein Falaki
I am working on a project where {{gapply()}} is being used with a large dataset that happens to be extremely skewed. On that skewed partition, the user function takes more than 2 hours to return and that turns out to be larger than the timeout that we hardcode in SparkR for backend connection. {code} connectBackend <- function(hostname, port, timeout = 6000) {code} Ideally user should be able to reconfigure Spark and increase the timeout. It should be a small fix. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org