Re: Issues with ssh-agent connecting to a large number of hosts at once

2009-04-22 Thread Bob Belnap
After some further fiddling, I should redefine the problem. It's not that I'm having problems when connecting simultaneously to a large number of hosts, but rather I'm having problems when doing so over a forwarded ssh connection. so I'm doing this: ssh-agent (localmachine) -> jump server -> 130

Re: Issues with ssh-agent connecting to a large number of hosts at once

2009-04-22 Thread Andy Bradford
Thus said Bob Belnap on Wed, 22 Apr 2009 09:21:38 MDT: > Another clue to the puzzle. I have 1300 or so machines in a DC in Hong > Kong, only available through a jump server in the same DC. If I'm > running my agent on my local machine, through the jump server, and > connect to all the m

Re: Issues with ssh-agent connecting to a large number of hosts at once

2009-04-22 Thread Frank Sorenson
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Bob Belnap wrote: > r...@chub:~# ls /proc/29019/fd/ Suppose I probably should have asked for "ls -l", since the list of fd's itself isn't too especially helpful. Perhaps "lsof -p " would have been even more helpful, but it looks like you've determi

Re: Issues with ssh-agent connecting to a large number of hosts at once

2009-04-22 Thread Bob Belnap
Thanks for your help Frank. On Tue, Apr 21, 2009 at 7:57 PM, Frank Sorenson wrote: > > The manpage for read(2) shows: > EAGAIN Non-blocking I/O has been selected using O_NONBLOCK and no > data was immediately available for reading. > > Can you show us the output of: readlink /proc/`pidof

Re: Issues with ssh-agent connecting to a large number of hosts at once

2009-04-21 Thread Frank Sorenson
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Bob Belnap wrote: > Hi, > > I'm having problems with ssh-agent when I am connecting to a large (several > hundred) hosts at once. I'm using a kanif ( > http://taktuk.gforge.inria.fr/kanif/) which is a very nice package that > distributes ssh connecti

Re: Issues with ssh-agent connecting to a large number of hosts at once

2009-04-20 Thread Bob Belnap
On Mon, Apr 20, 2009 at 9:27 AM, Bryan Sant wrote: > Bob, what state are your sockets in when you run netstat? Do you have > a bunch of sockets in the TIME_WAIT or CLOSE_WAIT state? > > -Bryan > No, they are CONNECTED or CONNECTING, nothing in TIME_WAIT or CLOSE_WAIT. A representative sample:

Re: Issues with ssh-agent connecting to a large number of hosts at once

2009-04-20 Thread Bryan Sant
On Thu, Apr 16, 2009 at 11:15 AM, Bob Belnap wrote: > while pegging the cpu.  Tracking the number of connections to the agent once > every second (while true; do netstat -an | grep -c ; > sleep 1) looks like: > > 5 > 5 > 5 > 35 > 98 > 154 > 155 > 200 > 287 > 287 Bob, what state are your sockets i

Re: Issues with ssh-agent connecting to a large number of hosts at once

2009-04-19 Thread Andy Bradford
Thus said Bob Belnap on Thu, 16 Apr 2009 11:15:41 MDT: > running strace on the ssh-agent, things start out ok, then go sour and it > starts spitting out: > > read(160, 0xbf8f300a, 1024) = -1 EAGAIN (Resource temporarily > unavailable) > read(160, 0xbf8f300a, 1024) = -1 EAGA

Issues with ssh-agent connecting to a large number of hosts at once

2009-04-16 Thread Bob Belnap
Hi, I'm having problems with ssh-agent when I am connecting to a large (several hundred) hosts at once. I'm using a kanif ( http://taktuk.gforge.inria.fr/kanif/) which is a very nice package that distributes ssh connections across the hosts you are connecting to (a fan-out sort of approach, so al