[ https://issues.apache.org/jira/browse/SPARK-5851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Nicholas Chammas updated SPARK-5851: ------------------------------------ Description: The following function doesn't distinguish between the ssh failing (e.g. presumably a connection issue) and the remote command that it executes failing (e.g. setup.sh). The latter should probably not result in a retry. Perhaps tries could be an argument that is set to 1 for certain usages. # Run a command on a host through ssh, retrying up to five times # and then throwing an exception if ssh continues to fail. [{{def ssh(host, opts, command)}}|https://github.com/apache/spark/blob/d8f69cf78862d13a48392a0b94388b8d403523da/ec2/spark_ec2.py#L953-L975] was: The following function doesn't distinguish between the ssh failing (e.g. presumably a connection issue) and the remote command that it executes failing (e.g. setup.sh). The latter should probably not result in a retry. Perhaps tries could be an argument that is set to 1 for certain usages. # Run a command on a host through ssh, retrying up to five times # and then throwing an exception if ssh continues to fail. {code} def ssh(host, opts, command): {code} > spark_ec2.py ssh failure retry handling not always appropriate > -------------------------------------------------------------- > > Key: SPARK-5851 > URL: https://issues.apache.org/jira/browse/SPARK-5851 > Project: Spark > Issue Type: Bug > Components: EC2 > Reporter: Florian Verhein > Priority: Minor > > The following function doesn't distinguish between the ssh failing (e.g. > presumably a connection issue) and the remote command that it executes > failing (e.g. setup.sh). The latter should probably not result in a retry. > Perhaps tries could be an argument that is set to 1 for certain usages. > # Run a command on a host through ssh, retrying up to five times > # and then throwing an exception if ssh continues to fail. > [{{def ssh(host, opts, > command)}}|https://github.com/apache/spark/blob/d8f69cf78862d13a48392a0b94388b8d403523da/ec2/spark_ec2.py#L953-L975] -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org