We’ve been experiencing this as well, and our simple solution is to actually
keep trying the ssh connection instead of just waiting:
Something like this:
def wait_for_ssh_connection(opts, host):
u.message(Waiting for ssh connection to host {}.format(host))
connected = False
while
Hi,
Since 0.9.0 spark-ec2 has gone unstable. During launch it throws many
errors like:
ssh: connect to host ec-xx-xx-xx-xx.compute-1.amazonaws.com port 22:
Connection refused
Error 255 while executing remote command, retrying after 30 seconds
.. and recently, it prompts for passwords!:
Aureliano,
I've been noticing this error recently as well:
ssh: connect to host ec-xx-xx-xx-xx.compute-1.amazonaws.com port 22:
Connection refused
Error 255 while executing remote command, retrying after 30 seconds
However, this isn't an issue with the spark-ec2 scripts. After the scripts
fail,
Frank,
Thanks for the prompt reply. Unfortunately I've been experiencing this for
the past few weeks on N Virginia farm, note that the latency might also
depend on the instance type.
I'll try to amend the ec2 script as you suggested, but that will mean
waiting even longer for the cluster to come