Re: Problems with stability of networking / pfinet

2008-12-18 Thread olafBuddenhagen
Hi,

On Tue, Dec 16, 2008 at 11:13:04PM +0100, Thomas Schwinge wrote:
> On Sat, Jul 12, 2008 at 10:06:54AM +0200, I wrote:

> > For a few weeks now, I seem to notice more and more problems w.r.t.
> > networking.  Olaf confirmed this.

Well, not exactly: While I can confirm that disabling syslog fixed the
ssh problems on some of Barry's boxes, the networking problems I'm
experiencing on *my* machine seem to have a different nature: When
apt-get downloads many packages during a larger upgrade, pfinet
regularily hangs after a while. The current ssh connection consequently
hangs as well. New connections fail, though IIRC in an unusual manner.
(Don't remember the details.)

After restarting pfinet and sshd, everything works again. (Well, I also
need to create a new screen session -- the old session is not able to
create new screens anymore, though the existing ones go on working
perfectly fine...)

I never tried disabling syslog on my box I think -- but I have a hard
time believing this problem is related to syslog as well...

-antrik-




Re: Problems with stability of networking / pfinet

2008-12-16 Thread Thomas Schwinge
Hello!

Nothing fundamentally new; just noting down what we know so far.


On Sat, Jul 12, 2008 at 10:06:54AM +0200, I wrote:
> For a few weeks now, I seem to notice more and more problems w.r.t.
> networking.  Olaf confirmed this.
> 
> $ ssh flubber
> Warning: Permanently added '[flubber.bddebian.com]:2250' (RSA) to the 
> list of known hosts.
> Connection closed by 71.224.175.179
> 
> The connection is first established, but then closed.
> 
> $ ssh r...@flubber
> Warning: Permanently added '[flubber.bddebian.com]:2250' (RSA) to the 
> list of known hosts.
> r...@flubber.bddebian.com's password: 
> Connection closed by 71.224.175.179
> 
> The remote site is even asking for the password before then closing the
> connection again.

This difference is because the non-root case is using public key
authentification.

The underlying problem is that sshd hangs when it tries to syslog the new
successful login attempt.  (Samuel told about this already some time
ago.)  At this time everything using syslog will hang: be it a ``logger
foo'' command manually run, or be it using sudo which roughly does the
same logging sshd does.

This situation can easily be induced like this:

$ while logger .; do echo -n .; done

This will usually hang after a few hundred invokations.

Killing the system's pflocal server will ``fix'' this situation.  If you,
however, after that restart the system's syslogd, the looping command
will again hang after another few hundred invokations.


> A problem with the TCP/IP stack?  But nothing has been changed
> there, recently.  Michael also assured me that he's still using GCC 4.2
> for building pfinet (using GCC 4.3 indeed yields a dysfunctional one).
> Or a problem with the SSH packages?

Killing pfinet and restarting sshd indeed also does ``fix'' this
situation (why?), but neither pfinet nor sshd seem to be originally
involved in the underlying problem.


Regards,
 Thomas


signature.asc
Description: Digital signature


Re: Problems with stability of networking / pfinet

2008-07-18 Thread Thomas Schwinge
Hello!

On Wed, Jul 16, 2008 at 10:45:38AM +0200, I wrote:
> On Sat, Jul 12, 2008 at 10:06:54AM +0200, I wrote:
> > $ ssh flubber
> > Warning: Permanently added '[flubber.bddebian.com]:2250' (RSA) to the 
> > list of known hosts.
> > Connection closed by 71.224.175.179

I logged in as `root' on gnubber, created an account `tschwinge', logged
out, and tried to log in as `tschwinge' again, but:

[EMAIL PROTECTED]'s password:
Connection closed by 71.224.175.179

Logging in as `root' again fails the same way now.  We really need to
track this down.


Regards,
 Thomas


signature.asc
Description: Digital signature


Re: Problems with stability of networking / pfinet

2008-07-16 Thread Thomas Schwinge
Hello!

Barry rebooted flubber and I was able to do some (tiny amount of) work on
her, but now...

On Sat, Jul 12, 2008 at 10:06:54AM +0200, I wrote:
> $ ssh flubber
> Warning: Permanently added '[flubber.bddebian.com]:2250' (RSA) to the 
> list of known hosts.
> Connection closed by 71.224.175.179

... we're already again back to this one.  I did not really stress her
pfinet server instance, etc.  No idea what's going on there.  Goddamn.


Regards,
 Thomas


signature.asc
Description: Digital signature


Re: Problems with stability of networking / pfinet

2008-07-13 Thread olafBuddenhagen
Hi,

On Sat, Jul 12, 2008 at 10:06:54AM +0200, Thomas Schwinge wrote:

> For a few weeks now, I seem to notice more and more problems w.r.t.
> networking.

> Any idea?  A problem with the TCP/IP stack?  But nothing has been
> changed there, recently.

Well, I started seeing it some two or three months ago, and it could
have been in the archive much longer -- I don't upgrade very
regularily...

> Or a problem with the SSH packages?

Seems unlikely: Even forcibly killing and restarting sshd doesn't
restore it to sanity -- pfinet needs to be restarted fist, and only
afterwards restarting sshd helps.

In my case, the screen session also becomes partially disfunctional
(when trying to create a new window it says that it can't find the
shell...) -- I guess that's related to the fact that my home directory
is on NFS, and thus also affected by pfinet going bonkers.

The problem seems related to network load somehow: When doing a larger
upgrade, it's sure to break while downloading the packages. (I seldom
saw the problem occuring in other situations than apt downloading
packages...)

-antrik-




Problems with stability of networking / pfinet

2008-07-12 Thread Thomas Schwinge
Hello!

For a few weeks now, I seem to notice more and more problems w.r.t.
networking.  Olaf confirmed this.

$ ssh flubber
Warning: Permanently added '[flubber.bddebian.com]:2250' (RSA) to the list 
of known hosts.
Connection closed by 71.224.175.179

The connection is first established, but then closed.

$ ssh [EMAIL PROTECTED]
Warning: Permanently added '[flubber.bddebian.com]:2250' (RSA) to the list 
of known hosts.
[EMAIL PROTECTED]'s password: 
Connection closed by 71.224.175.179

The remote site is even asking for the password before then closing the
connection again.

Barry did reboot the machine just yesterday, and it has been working just
fine in the weeks/months before.


Any idea?  A problem with the TCP/IP stack?  But nothing has been changed
there, recently.  Michael also assured me that he's still using GCC 4.2
for building pfinet (using GCC 4.3 indeed yields a dysfunctional one).
Or a problem with the SSH packages?


Regards,
 Thomas


signature.asc
Description: Digital signature