After some further fiddling, I should redefine the problem.
It's not that I'm having problems when connecting simultaneously to a large
number of hosts, but rather I'm having problems when doing so over a
forwarded ssh connection.
so I'm doing this:
ssh-agent (localmachine) -> jump server -> 130
Thus said Bob Belnap on Wed, 22 Apr 2009 09:21:38 MDT:
> Another clue to the puzzle. I have 1300 or so machines in a DC in Hong
> Kong, only available through a jump server in the same DC. If I'm
> running my agent on my local machine, through the jump server, and
> connect to all the m
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Bob Belnap wrote:
> r...@chub:~# ls /proc/29019/fd/
Suppose I probably should have asked for "ls -l", since the list of fd's
itself isn't too especially helpful. Perhaps "lsof -p " would have
been even more helpful, but it looks like you've determi
Thanks for your help Frank.
On Tue, Apr 21, 2009 at 7:57 PM, Frank Sorenson wrote:
>
> The manpage for read(2) shows:
> EAGAIN Non-blocking I/O has been selected using O_NONBLOCK and no
> data was immediately available for reading.
>
> Can you show us the output of: readlink /proc/`pidof
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Bob Belnap wrote:
> Hi,
>
> I'm having problems with ssh-agent when I am connecting to a large (several
> hundred) hosts at once. I'm using a kanif (
> http://taktuk.gforge.inria.fr/kanif/) which is a very nice package that
> distributes ssh connecti
On Mon, Apr 20, 2009 at 9:27 AM, Bryan Sant wrote:
> Bob, what state are your sockets in when you run netstat? Do you have
> a bunch of sockets in the TIME_WAIT or CLOSE_WAIT state?
>
> -Bryan
>
No, they are CONNECTED or CONNECTING, nothing in TIME_WAIT or CLOSE_WAIT. A
representative sample:
On Thu, Apr 16, 2009 at 11:15 AM, Bob Belnap wrote:
> while pegging the cpu. Tracking the number of connections to the agent once
> every second (while true; do netstat -an | grep -c ;
> sleep 1) looks like:
>
> 5
> 5
> 5
> 35
> 98
> 154
> 155
> 200
> 287
> 287
Bob, what state are your sockets i
Thus said Bob Belnap on Thu, 16 Apr 2009 11:15:41 MDT:
> running strace on the ssh-agent, things start out ok, then go sour and it
> starts spitting out:
>
> read(160, 0xbf8f300a, 1024) = -1 EAGAIN (Resource temporarily
> unavailable)
> read(160, 0xbf8f300a, 1024) = -1 EAGA
Hi,
I'm having problems with ssh-agent when I am connecting to a large (several
hundred) hosts at once. I'm using a kanif (
http://taktuk.gforge.inria.fr/kanif/) which is a very nice package that
distributes ssh connections across the hosts you are connecting to (a
fan-out sort of approach, so al