Re: FreeBSD 7.0: sockets stuck in CLOSED state...

Ali Niknam Wed, 25 Jun 2008 14:48:16 -0700

Hi Robert,

Sounds like there's a bug somewhere. Before we start trying to track it

[...]

So, with that introduction, we're interested in resolving:

Quite comprehensive indeed; thank you for all that information. I wasnot aware that there was a decoupling between the various parts of theabstractions, but now that I think of it, it's more or less logical I guess.

The first is the easiest to resolve, as all we need to do is see whether

[...]

the file descriptor numbers being returned to see whether, perhaps, thatnumber only goes up over time, and gets really big.

My personal feeling is that it's a race condition; no idea why, but itfeels that way. Maybe because it's such a small number as compared tothe big amount of connections that takes place.

I do not leak file descriptors as far as I can see, I can send you theinformation you ask for (netstat, sockstat, fstat, etc.) offlist if youlike, or if you prefer, I can give you access to the machine, please letme know whichever you like.

I'd like to reiterate that at this moment i'm not sure at all if it's mycode, or kernel code. However I've seen, for my feeling, sufficientinformation to reasonably suspect that it _might_ be something outsidemy code :).

wedged-up state. It would be most helpful if you could actually shutdown to single-user mode, killing all user processes, then waiting tenminutes, and capturing the output of those above commands to files thatyou can then e-mail to me.

Because it's a live machine that would be very difficult. Maybe, if youreally really need it that way and we can't find another way I canannounce maintainance and do it in the middle of the night :).

Without accusing you of having buggy code, I should say that I thinkthere's a reasonable chance that what you're seeing is an interactionbetween an existing leak of resources in the application and the way thekernel state management has changed. The output from netstat pretty

Yes that was the first thing I though of as well, however, especiallyone of the two applications is so simple that I would be ashamed todeath if I still had a bug in there :). If it turns out that way:sssstttt ;).

precisely matches that what you'd expect: lots of TCP connections in theCLOSED state reflecting a series of connections built by an applicationbut then not properly discarded. Likewise, when the application iskilled, all of the connections go away -- most likely because the filedescriptors are all closed, allowing them to be garbage collected andconnection state freed. If it is this sort of bug, then most likelyyou're missing a call to close() in a work loop somewhere, and in someexceptional case, you fall out of the loop without calling close().


I will double check this once more, but honestly, i strongly doubt it...

Also one other thing that I've noticed, is that it's always the inputbuffer that has bytes left; never the output buffer...

Moreover, i've seen that close() reports EBADF, but due to the insaneamount of connections I can not say for certain that that's when theconnection goes into CLOSED state. The ip's do match, but it's verycommon for the same ip's to make numerous connections too.


Kind Regards,

Ali


--
  Transip BV | http://www.transip.nl/
  We never let you down.
_______________________________________________
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: FreeBSD 7.0: sockets stuck in CLOSED state...

Reply via email to