The status page I was looking at for these numbers had the labels and values mismatched. There do not appear to be an malformed or invalid messages now that they are lined up. Radius has been restarted, so the numbers are all pretty low right now. I will reply again when I have some more numbers to share later today, but over the last 40 minutes I am seeing 4077 duplicates and 14566 drops for accounting, which still seems high to me.
So that eliminates any malformed/invalid/zero response issues. As for the errors I see in the logs, I do not believe it to be a slow database. The database is responsive to other queries against the radius database while we experience timeouts and crashses. Do you have any suggestions on how we might troubleshoot that end of it? Chris ________________________________________ From: freeradius-users-bounces+cmanigan=towerstream....@lists.freeradius.org [freeradius-users-bounces+cmanigan=towerstream....@lists.freeradius.org] on behalf of Alan DeKok [al...@deployingradius.com] Sent: Monday, August 06, 2012 9:19 AM To: FreeRadius users mailing list Subject: Re: Duplicate Radius Accounting Christopher Manigan wrote: > In my logs I see many entries like the following: > > Info: WARNING: Child is hung for request 51651 in component <core> module > <queue>.3 > Error: Dropping request (2049 is too many): from client myhost.mysite port > 32869 - ID: 239 Something is blocking the server. This is usually a slow database. > In the last ~10 hours, the status server reports the following for accounting: > > Responses 0 > Duplicate 954442 > Malformed 115045 > Invalid 564029 That is *terrible*. Zero responses? It indicates a catastrophic failure in the system. And *malformed* packets? Something is sending NON RADIUS packets to the RADIUS port. Go fix that. And "invalid" packets? Something is sending non-accounting packets to the accounting port. > Dropped 0 > Unknown 0 > > Radius will hang and start to time out and eventually die. It looks like the > duplicate count gets extremely high very quickly. Could it be the NAS that > are pointing to it? Or could it be my radius configs somehow causing this? > I am not really sure how to prove it out or troubleshoot. I can increase the > max requests but I don't think that is the right solution. Your RADIUS system is horribly slow, and isn't finishing any requests. Go fix that. The default configuration *works*. And your NAS is broken. Something is very, very, wrong in your network. Find out what it is. Ensure that only RADIUS accounting packets go to the RADIUS accounting port. Alan DeKok. - List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html - List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html