On 23 November 2015 at 01:36, Major Hayden <ma...@mhtx.net> wrote:

> Hey folks,
>
> Some of my recent reviews have been frequent fliers in the land of CI gate
> jobs and I've spent a fair amount of time diagnosing random ssh failures to
> containers in AIO builds.  The error I get most often is this:
>
>     SSH Error: data could not be sent to the remote host. Make sure this
> host can be reached over ssh
>
> After digging in Ansible code for a bit, I found the error within the ssh
> connection plugin[1].  It looks like an issue where the ssh connection is
> actually open but data cannot be sent to the subprocess.
>
> I messed around heavily with multiplexing, keys, GSSAPI, and more, but the
> errors randomly appear.  I've proposed a review[2] for a switch to paramiko
> transport mode for gate jobs only and it has run four times without ssh
> errors (although two builds had timeouts due to the repo build taking too
> long).
>
> The fifth build is running now and it seems to be moving along fairly
> quickly.
>
> [1]
> https://github.com/ansible/ansible/blob/devel/lib/ansible/plugins/connection/ssh.py#L245-L260
> [2] https://review.openstack.org/#/c/248361/


Thanks for digging into this Major. It is a royal pain and will likely be
resolved with the release of Ansible 2, but for now we're stuck with having
to work around the issue with what we have.

I wonder, is there a difference in results or performance between using
paramiko or turning ssh pipelining off?
__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Reply via email to