Re: over 30 radiusd processes - more information

2009-10-18 Thread Alan Buxey
Hi,

> 1) Could bad accounting packets cause the radiusd process to EXIT?
> 2) Could bad accounting packets result in hung child processes (as seen 
> in the gdb output after the radius log file)?

I'd say yes. we ensure that bad packets dont hit our accounting servers eg

accounting {

#  call SQL accouting function only if session-time is not zero
if (Acct-Session-Time != 0) {
sql
}
else {
ok
}

}

alan
-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes - more information

2009-10-18 Thread Ivan Kalik
> I think you may be 'jumping the gun' a wee bit.
>
> The system currently has over 13,000 active sessions.
>
> There were some odd accounting packets, but the vast majority were valid.
> These could be configuration errors or hack attempts (investigating).

Something broke at 2am. Before that you had a single malformed packet but
after 2am there were dozens.

> Questions:
>
> 1) Could bad accounting packets cause the radiusd process to EXIT?

I don't think so, but I can't answer why did that happen.

> 2) Could bad accounting packets result in hung child processes (as seen in
> the gdb output after the radius log file)?

You should look into the detail file and see what is wrong with the
packets that were stuck (or did the queries fail for some reason), like:

WARNING: Unresponsive child for request 165616, in module sql component
accounting
etc.

Ivan Kalik
Kalik Informatika ISP

-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes - more information

2009-10-18 Thread Craig Campbell

I think you may be 'jumping the gun' a wee bit.

The system currently has over 13,000 active sessions.

There were some odd accounting packets, but the vast majority were valid. 
These could be configuration errors or hack attempts (investigating).


Questions:

1) Could bad accounting packets cause the radiusd process to EXIT?
2) Could bad accounting packets result in hung child processes (as seen in 
the gdb output after the radius log file)?


Thanks,
-craig

- Original Message - 
From: "Ivan Kalik" 

To: "FreeRadius users mailing list" 
Sent: Sunday, October 18, 2009 10:56 AM
Subject: Re: over 30 radiusd processes - more information



I've continued to try an investigate the root cause of this, and the last
run behaved slightly differently - the parent process seems to have
terminated, and there are more messages in the radius log.

There were four (4) hung processes left over.

I have attached the radius.log file below, as well as gdb sessions for 
the

hung processes showing the results of the gd 'bt' and 'list' commands.

It looks like Alan's initial idea that the hung processes are a result of
running the acctstop.sh process are correct.  I've tried looking at the
code
to see if anything 'leapt out' at me, but the logic is quite clever, and
dissecting it from the middle is quite a challenge.

I am hoping that the gdb output might prove helpful to someone already
familiar with the logic flow.

It seems I can reproduce this issue within 24 hours, so if there is any
other information I could gather, please left me know.

Thanks,
-craig

radiusd.log

Fri Oct 16 11:15:56 2009 : Info: Exiting normally.
Fri Oct 16 11:16:22 2009 : Info: rlm_sql (sql): Driver rlm_sql_mysql
(module
rlm_sql_mysql) loaded and linked
Fri Oct 16 11:16:22 2009 : Info: rlm_sql (sql): Attempting to connect to
radi...@localhost:/radius
Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL
server for #0
Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL
server for #1
Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL
server for #2
Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL
server for #3
Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL
server for #4
Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server inner-tunnel
Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server
copy-acct-to-home-server
Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server
copy-acct-to-radius-c
Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server 
Fri Oct 16 11:16:22 2009 : Info: Ready to process requests.
Fri Oct 16 17:29:12 2009 : Error: [sql] stop packet with zero session
length. [user 'use...@realm4tl', nas '192.168.1.101']
Sat Oct 17 02:00:18 2009 : Error: [sql] stop packet with zero session
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:18 2009 : Error: [sql] stop packet with zero session
length. [user 'use...@realm1', nas '192.168.1.101']

etc.

/*
* If stop but zero session length AND no previous
* session found, drop it as in invalid packet
* This is to fix CISCO's aaa from filling our
* table with bogus crap
*/

Your NAS is broken. Fix it so it sends proper accounting packets.

Ivan Kalik
Kalik Informatika ISP

-
List info/subscribe/unsubscribe? See 
http://www.freeradius.org/list/users.html


__ Information from ESET Smart Security, version of virus 
signature database 4519 (20091018) __


The message was checked by ESET Smart Security.

http://www.eset.com






__ Information from ESET Smart Security, version of virus signature 
database 4519 (20091018) __

The message was checked by ESET Smart Security.

http://www.eset.com



-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes - more information

2009-10-18 Thread Alan DeKok
Craig Campbell wrote:
> I have attached the radius.log file below, as well as gdb sessions for
> the hung processes showing the results of the gd 'bt' and 'list' commands.

  The log is interesting.

> Sat Oct 17 02:01:25 2009 : Error: WARNING: Unresponsive child for
> request 165616, in module sql component accounting

  It looks like the SQL module is being blocked somehow.  Why?

> (gdb) bt
> #0  0x003acf4dee6e in __lll_lock_wait_private () from /lib64/libc.so.6
> #1  0x003acf48c75d in _L_lock_1685 () from /lib64/libc.so.6
> #2  0x003acf48c4a7 in __tz_convert () from /lib64/libc.so.6
> #3  0x2b794fa6e39f in vp_prints_value (out=0x41ced6c0 "",
> outlen=1008, vp=,
>delimitst=) at print.c:267

  Hmm...  OK.  It's a pthread lock / fork issue.

  I think the solution is to print the "env" variables *before* forking.
 I'll see what I can do.

  Alan DeKok.
-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes - more information

2009-10-18 Thread Ivan Kalik
> I've continued to try an investigate the root cause of this, and the last
> run behaved slightly differently - the parent process seems to have
> terminated, and there are more messages in the radius log.
>
> There were four (4) hung processes left over.
>
> I have attached the radius.log file below, as well as gdb sessions for the
> hung processes showing the results of the gd 'bt' and 'list' commands.
>
> It looks like Alan's initial idea that the hung processes are a result of
> running the acctstop.sh process are correct.  I've tried looking at the
> code
> to see if anything 'leapt out' at me, but the logic is quite clever, and
> dissecting it from the middle is quite a challenge.
>
> I am hoping that the gdb output might prove helpful to someone already
> familiar with the logic flow.
>
> It seems I can reproduce this issue within 24 hours, so if there is any
> other information I could gather, please left me know.
>
> Thanks,
> -craig
>
> radiusd.log
>
> Fri Oct 16 11:15:56 2009 : Info: Exiting normally.
> Fri Oct 16 11:16:22 2009 : Info: rlm_sql (sql): Driver rlm_sql_mysql
> (module
> rlm_sql_mysql) loaded and linked
> Fri Oct 16 11:16:22 2009 : Info: rlm_sql (sql): Attempting to connect to
> radi...@localhost:/radius
> Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL
> server for #0
> Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL
> server for #1
> Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL
> server for #2
> Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL
> server for #3
> Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL
> server for #4
> Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server inner-tunnel
> Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server
> copy-acct-to-home-server
> Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server
> copy-acct-to-radius-c
> Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server 
> Fri Oct 16 11:16:22 2009 : Info: Ready to process requests.
> Fri Oct 16 17:29:12 2009 : Error: [sql] stop packet with zero session
> length. [user 'use...@realm4tl', nas '192.168.1.101']
> Sat Oct 17 02:00:18 2009 : Error: [sql] stop packet with zero session
> length. [user 'use...@realm1', nas '192.168.1.101']
> Sat Oct 17 02:00:18 2009 : Error: [sql] stop packet with zero session
> length. [user 'use...@realm1', nas '192.168.1.101']
etc.

/*
 * If stop but zero session length AND no previous
 * session found, drop it as in invalid packet
 * This is to fix CISCO's aaa from filling our
 * table with bogus crap
 */

Your NAS is broken. Fix it so it sends proper accounting packets.

Ivan Kalik
Kalik Informatika ISP

-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes - more information

2009-10-18 Thread Craig Campbell
I've continued to try an investigate the root cause of this, and the last 
run behaved slightly differently - the parent process seems to have 
terminated, and there are more messages in the radius log.


There were four (4) hung processes left over.

I have attached the radius.log file below, as well as gdb sessions for the 
hung processes showing the results of the gd 'bt' and 'list' commands.


It looks like Alan's initial idea that the hung processes are a result of 
running the acctstop.sh process are correct.  I've tried looking at the code 
to see if anything 'leapt out' at me, but the logic is quite clever, and 
dissecting it from the middle is quite a challenge.


I am hoping that the gdb output might prove helpful to someone already 
familiar with the logic flow.


It seems I can reproduce this issue within 24 hours, so if there is any 
other information I could gather, please left me know.


Thanks,
-craig

radiusd.log

Fri Oct 16 11:15:56 2009 : Info: Exiting normally.
Fri Oct 16 11:16:22 2009 : Info: rlm_sql (sql): Driver rlm_sql_mysql (module 
rlm_sql_mysql) loaded and linked
Fri Oct 16 11:16:22 2009 : Info: rlm_sql (sql): Attempting to connect to 
radi...@localhost:/radius
Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL 
server for #0
Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL 
server for #1
Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL 
server for #2
Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL 
server for #3
Fri Oct 16 11:16:22 2009 : Info: rlm_sql_mysql: Starting connect to MySQL 
server for #4

Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server inner-tunnel
Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server 
copy-acct-to-home-server

Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server copy-acct-to-radius-c
Fri Oct 16 11:16:22 2009 : Info: Loaded virtual server 
Fri Oct 16 11:16:22 2009 : Info: Ready to process requests.
Fri Oct 16 17:29:12 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm4tl', nas '192.168.1.101']
Sat Oct 17 02:00:18 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:18 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:18 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm3', nas '192.168.1.101']
Sat Oct 17 02:00:20 2009 : Error: [sql] stop packet with zero session 
length. [user 'us...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:20 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:20 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:20 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:20 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:20 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:21 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:21 2009 : Error: [sql] stop packet with zero session 
length. [user 'us...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:21 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm3', nas '192.168.1.101']
Sat Oct 17 02:00:21 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:21 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm4', nas '192.168.1.101']
Sat Oct 17 02:00:21 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:21 2009 : Error: [sql] stop packet with zero session 
length. [user 'us...@realm3', nas '192.168.1.101']
Sat Oct 17 02:00:21 2009 : Error: [sql] stop packet with zero session 
length. [user 'username', nas '192.168.1.101']
Sat Oct 17 02:00:21 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm2', nas '192.168.1.101']
Sat Oct 17 02:00:22 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:22 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:22 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:22 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm3', nas '192.168.1.101']
Sat Oct 17 02:00:22 2009 : Error: [sql] stop packet with zero session 
length. [user 'use...@realm1', nas '192.168.1.101']
Sat Oct 17 02:00:22 2009 : Error: [sql] stop packet with 

Re: over 30 radiusd processes

2009-10-14 Thread Alan DeKok
Craig Campbell wrote:
> While the ps command doesn't show the time of these extra processes
> (over 24 hours old), in a previous event, I determined they seemed to
> coincide with a significant increase in radius traffic (from ~100/min to
> over 1000/min)  I believe the NAS forces all users to log off (and they
> automatically log back in) in the middle of the night, and I suspect
> this is related.

  Well, the server is *supposed* to clean up child processes.

> Alan suggested this might be related to shell scripts being run - as
> happens when acct STOP records are received.

  Yes.  If you can replace the shell script with a Perl script and use
rlm_perl, the issue will go away.

> I'm trying to come up with a strategy to narrow down what might be
> happening.

  The server isn't cleaning up child processes.  For some reason, it's
lost track of them.

> Under what circumstances does radiusd fork?  Also, I THOUGH I'd heard
> somewhere that threads and fork did NOT interact well.

  It forks when you exec a program.

  Alan DeKok.
-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes

2009-10-14 Thread Craig Campbell

Nothing in the log except the normal startup.

This server is only receiving accounting records currently.

While the ps command doesn't show the time of these extra processes (over 24 
hours old), in a previous event, I determined they seemed to coincide with a 
significant increase in radius traffic (from ~100/min to over 1000/min)  I 
believe the NAS forces all users to log off (and they automatically log back 
in) in the middle of the night, and I suspect this is related.


Also, the system provided is a Virtual Machine, which normally seems to have 
plenty of resources available.  The mysql database is local to the same 
system, and is only used to keep accounting records of currently logged in 
users (radutmp replacement - a cron job flushed out completed records to 
prevent database growth).


Alan suggested this might be related to shell scripts being run - as happens 
when acct STOP records are received.


I'm trying to come up with a strategy to narrow down what might be 
happening.


Under what circumstances does radiusd fork?  Also, I THOUGH I'd heard 
somewhere that threads and fork did NOT interact well.


I am also considering upgrading to 2.1.7 (but I just finished configuring 
2.1.6 :(  2.1.7 wasn't released when I started this..)


Thoughts?

Thanks (everyone),
-craig


- Original Message - 
From: "Marinko Tarlac" 

To: "FreeRadius users mailing list" 
Sent: Wednesday, October 14, 2009 7:12 AM
Subject: Re: over 30 radiusd processes



I had a same problem when one of our databases was terrible slow...

Is there anything in radius.log ?

Alan DeKok wrote:

Craig Campbell wrote:


Freeradius 2.1.6

Running on Redhat AS5 Update 3
with mysql-devel rpms added to enable mysql support.

Compiled with no options specified. (./configure ; make clean ; make ;
make install)



  I don't know.. all I know is it cleans up processes when I run them,
and no one else seems to be running into this.

  Alan DeKok.
-
List info/subscribe/unsubscribe? See 
http://www.freeradius.org/list/users.html





-
List info/subscribe/unsubscribe? See 
http://www.freeradius.org/list/users.html


__ Information from ESET Smart Security, version of virus 
signature database 4506 (20091014) __


The message was checked by ESET Smart Security.

http://www.eset.com






__ Information from ESET Smart Security, version of virus signature 
database 4506 (20091014) __

The message was checked by ESET Smart Security.

http://www.eset.com



-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes

2009-10-14 Thread Marinko Tarlac

I had a same problem when one of our databases was terrible slow...

Is there anything in radius.log ?

Alan DeKok wrote:

Craig Campbell wrote:
  

Freeradius 2.1.6

Running on Redhat AS5 Update 3
with mysql-devel rpms added to enable mysql support.

Compiled with no options specified. (./configure ; make clean ; make ;
make install)



  I don't know.. all I know is it cleans up processes when I run them,
and no one else seems to be running into this.

  Alan DeKok.
-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html

  


-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes

2009-10-14 Thread Alan DeKok
Craig Campbell wrote:
> Freeradius 2.1.6
> 
> Running on Redhat AS5 Update 3
> with mysql-devel rpms added to enable mysql support.
> 
> Compiled with no options specified. (./configure ; make clean ; make ;
> make install)

  I don't know.. all I know is it cleans up processes when I run them,
and no one else seems to be running into this.

  Alan DeKok.
-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes

2009-10-13 Thread Craig Campbell

Freeradius 2.1.6

Running on Redhat AS5 Update 3
with mysql-devel rpms added to enable mysql support.

Compiled with no options specified. (./configure ; make clean ; make ; make 
install)


Thanks,
-craig
- Original Message - 
From: "Alan DeKok" 

To: "FreeRadius users mailing list" 
Sent: Tuesday, October 13, 2009 1:55 AM
Subject: Re: over 30 radiusd processes



Craig Campbell wrote:

Up to 65 processes now

Any ideas how to stop this from happening?


 Which version are you running?

 Alan DeKok.
-
List info/subscribe/unsubscribe? See 
http://www.freeradius.org/list/users.html


__ Information from ESET Smart Security, version of virus 
signature database 4501 (20091012) __


The message was checked by ESET Smart Security.

http://www.eset.com






__ Information from ESET Smart Security, version of virus signature 
database 4502 (20091013) __

The message was checked by ESET Smart Security.

http://www.eset.com



-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes

2009-10-12 Thread Alan DeKok
Craig Campbell wrote:
> Up to 65 processes now
> 
> Any ideas how to stop this from happening?

  Which version are you running?

  Alan DeKok.
-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes

2009-10-12 Thread Craig Campbell

Up to 65 processes now

Any ideas how to stop this from happening?

Anyone?

Thanks,
-craig
- Original Message - 
From: "Alan DeKok" 

To: "FreeRadius users mailing list" 
Sent: Saturday, October 10, 2009 1:21 AM
Subject: Re: over 30 radiusd processes



Craig Campbell wrote:

Yes, two(2) binaries and one (1) shell script are called via exec as
follows from the file,


Could you NOT CC me on messages to the list?  I subscribe, and I read
the messages.

 And fix your mailer.  I saw a *large* number of duplicates.


   - user (an authentication binary program - Exec-Program-Wait
= "/usr/local/sbin/auth -- %{User-Name} %{User-Password}
%{%{Called-Station-Id}:-Missing} %{%{NAS-IP-Address}:-Missing}
%{%{Calling-Station-Id}:-Missing} %{%{NAS-Port-Type}:-Missing}
%{Vendor-Specific}" ,)

   -acct_user (shell script - Exec-Program =
"%{exec:/usr/local/sbin/acctstop.sh}", )
and
   -attr_rewrite module (a hex translation binary -  replacewith =
"%{exec:/usr/local/sbin/hexconvert -lX %{User-Name} }")

Is this bad?
Is there a better alternative?


Thanks so much!
-craig



- Original Message - From: "Alan DeKok" 

To: "FreeRadius users mailing list" 


Sent: Friday, October 09, 2009 4:17 PM
Subject: Re: over 30 radiusd processes



Craig Campbell wrote:

radius-a seems to be getting the bulk of the radius records.  Normally,
it has a single process.
Last night it spawned a bunch of children that seem to be loitering...


 Are you forking shell scripts via "exec"?


radius-b and radius-c don't have more than a single radiusd process.

Any idea what is going on?  Why all the children?  Do I need to be
concerned?  Is this normal?


 It's not normal.  They're likely zombies that need to go away.  The
server normally cleans up any zombie children, but...

 Alan DeKok.
-
List info/subscribe/unsubscribe? See
http://www.freeradius.org/list/users.html

__ Information from ESET Smart Security, version of virus
signature database 4494 (20091009) __

The message was checked by ESET Smart Security.

http://www.eset.com






__ Information from ESET Smart Security, version of virus
signature database 4494 (20091009) __

The message was checked by ESET Smart Security.

http://www.eset.com



-
List info/subscribe/unsubscribe? See
http://www.freeradius.org/list/users.html




-
List info/subscribe/unsubscribe? See 
http://www.freeradius.org/list/users.html


__ Information from ESET Smart Security, version of virus 
signature database 4494 (20091009) __


The message was checked by ESET Smart Security.

http://www.eset.com






__ Information from ESET Smart Security, version of virus signature 
database 4501 (20091012) __

The message was checked by ESET Smart Security.

http://www.eset.com



-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes

2009-10-09 Thread Alan DeKok
Craig Campbell wrote:
> Yes, two(2) binaries and one (1) shell script are called via exec as
> follows from the file,

 Could you NOT CC me on messages to the list?  I subscribe, and I read
the messages.

  And fix your mailer.  I saw a *large* number of duplicates.

>- user (an authentication binary program - Exec-Program-Wait
> = "/usr/local/sbin/auth -- %{User-Name} %{User-Password}
> %{%{Called-Station-Id}:-Missing} %{%{NAS-IP-Address}:-Missing}
> %{%{Calling-Station-Id}:-Missing} %{%{NAS-Port-Type}:-Missing}
> %{Vendor-Specific}" ,)
> 
>-acct_user (shell script - Exec-Program =
> "%{exec:/usr/local/sbin/acctstop.sh}", )
> and
>-attr_rewrite module (a hex translation binary -  replacewith =
> "%{exec:/usr/local/sbin/hexconvert -lX %{User-Name} }")
> 
> Is this bad?
> Is there a better alternative?
> 
> 
> Thanks so much!
> -craig
> 
> 
> 
> - Original Message - From: "Alan DeKok" 
> To: "FreeRadius users mailing list" 
> Sent: Friday, October 09, 2009 4:17 PM
> Subject: Re: over 30 radiusd processes
> 
> 
>> Craig Campbell wrote:
>>> radius-a seems to be getting the bulk of the radius records.  Normally,
>>> it has a single process.
>>> Last night it spawned a bunch of children that seem to be loitering...
>>
>>  Are you forking shell scripts via "exec"?
>>
>>> radius-b and radius-c don't have more than a single radiusd process.
>>>
>>> Any idea what is going on?  Why all the children?  Do I need to be
>>> concerned?  Is this normal?
>>
>>  It's not normal.  They're likely zombies that need to go away.  The
>> server normally cleans up any zombie children, but...
>>
>>  Alan DeKok.
>> -
>> List info/subscribe/unsubscribe? See
>> http://www.freeradius.org/list/users.html
>>
>> __ Information from ESET Smart Security, version of virus
>> signature database 4494 (20091009) __
>>
>> The message was checked by ESET Smart Security.
>>
>> http://www.eset.com
>>
>>
>>
> 
> 
> __ Information from ESET Smart Security, version of virus
> signature database 4494 (20091009) __
> 
> The message was checked by ESET Smart Security.
> 
> http://www.eset.com
> 
> 
> 
> -
> List info/subscribe/unsubscribe? See
> http://www.freeradius.org/list/users.html
> 
> 

-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes

2009-10-09 Thread Craig Campbell
Yes, two(2) binaries and one (1) shell script are called via exec as follows 
from the file,


   - user (an authentication binary program - Exec-Program-Wait = 
"/usr/local/sbin/auth -- %{User-Name} %{User-Password} 
%{%{Called-Station-Id}:-Missing} %{%{NAS-IP-Address}:-Missing} 
%{%{Calling-Station-Id}:-Missing} %{%{NAS-Port-Type}:-Missing} 
%{Vendor-Specific}" ,)


   -acct_user (shell script - Exec-Program = 
"%{exec:/usr/local/sbin/acctstop.sh}", )

and
   -attr_rewrite module (a hex translation binary -  replacewith = 
"%{exec:/usr/local/sbin/hexconvert -lX %{User-Name} }")


Is this bad?
Is there a better alternative?


Thanks so much!
-craig



- Original Message - 
From: "Alan DeKok" 

To: "FreeRadius users mailing list" 
Sent: Friday, October 09, 2009 4:17 PM
Subject: Re: over 30 radiusd processes



Craig Campbell wrote:

radius-a seems to be getting the bulk of the radius records.  Normally,
it has a single process.
Last night it spawned a bunch of children that seem to be loitering...


 Are you forking shell scripts via "exec"?


radius-b and radius-c don't have more than a single radiusd process.

Any idea what is going on?  Why all the children?  Do I need to be
concerned?  Is this normal?


 It's not normal.  They're likely zombies that need to go away.  The
server normally cleans up any zombie children, but...

 Alan DeKok.
-
List info/subscribe/unsubscribe? See 
http://www.freeradius.org/list/users.html


__ Information from ESET Smart Security, version of virus 
signature database 4494 (20091009) __


The message was checked by ESET Smart Security.

http://www.eset.com






__ Information from ESET Smart Security, version of virus signature 
database 4494 (20091009) __

The message was checked by ESET Smart Security.

http://www.eset.com



-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes

2009-10-09 Thread Alan DeKok
Craig Campbell wrote:
> radius-a seems to be getting the bulk of the radius records.  Normally,
> it has a single process.
> Last night it spawned a bunch of children that seem to be loitering... 

  Are you forking shell scripts via "exec"?

> radius-b and radius-c don't have more than a single radiusd process.
>  
> Any idea what is going on?  Why all the children?  Do I need to be
> concerned?  Is this normal?

  It's not normal.  They're likely zombies that need to go away.  The
server normally cleans up any zombie children, but...

  Alan DeKok.
-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


Re: over 30 radiusd processes

2009-10-08 Thread Craig Campbell
Update:

strace  suggests all the child processes are doing the same thing,

[r...@radius-a raddb]# strace -p 30893
Process 30893 attached - interrupt to quit
futex(0x3acf752554, FUTEX_WAIT_PRIVATE, 2, NULL 

Hope this helps,
-craig
  - Original Message - 
  From: Craig Campbell 
  To: FreeRadius users mailing list 
  Sent: Thursday, October 08, 2009 12:07 PM
  Subject: over 30 radiusd processes


  I am runnning 2.1.6 on Redhat Linux (Red Hat Enterprise Linux Server release 
5.3 (Tikanga)).

  This server relays all records to an identical server radius-b and radius-c
  Similarly radius-b relays its records back to radius-a (except those from 
radius-a)
  radius-c is just for testing.

  All relaying appears to be working correctly.

  These server currently receiving accounting records only.

  radius-a seems to be getting the bulk of the radius records.  Normally, it 
has a single process.
  Last night it spawned a bunch of children that seem to be loitering...  

  radius-b and radius-c don't have more than a single radiusd process.

  Any idea what is going on?  Why all the children?  Do I need to be concerned? 
 Is this normal?

  From the detail log, is seems that the message rate increase to about 
2000-2400/minute at that time for about 3 minutes, then dropped to <1000/min.
[r...@radius-a radius]# ps -aef | grep radiusd
radiusd   5426 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
radiusd   5738 21400  0 03:29 ?00:00:00 /usr/local/sbin/radiusd
radiusd   8239 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
radiusd   8240 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
radiusd   8241 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
radiusd   8242 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
radiusd   8243 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
radiusd   8244 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
radiusd   9029 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
radiusd   9104 21400  0 03:29 ?00:00:00 /usr/local/sbin/radiusd
radiusd  14154 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  14426 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  15039 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  15040 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  16082 21400  0 03:36 ?00:00:00 /usr/local/sbin/radiusd
radiusd  17295 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
root 19242 20229  0 11:42 pts/000:00:00 grep radiusd
radiusd  19974 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  20670 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  20673 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  20674 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  20675 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  20679 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  20680 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  21207 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd
radiusd  21208 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd
radiusd  21209 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd
radiusd  21300 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  21301 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
radiusd  21400 1  0 Oct07 ?00:02:40 /usr/local/sbin/radiusd
radiusd  26543 21400  0 03:34 ?00:00:00 /usr/local/sbin/radiusd
radiusd  26683 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd
radiusd  28411 21400  0 03:34 ?00:00:00 /usr/local/sbin/radiusd
radiusd  29065 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd
radiusd  30648 21400  0 03:34 ?00:00:00 /usr/local/sbin/radiusd
radiusd  30649 21400  0 03:34 ?00:00:00 /usr/local/sbin/radiusd
radiusd  30893 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd

  Thanks,
  -craig





  __ Information from ESET Smart Security, version of virus signature 
database 4490 (20091008) __

  The message was checked by ESET Smart Security.

  http://www.eset.com


  __ Information from ESET Smart Security, version of virus signature 
database 4490 (20091008) __

  The message was checked by ESET Smart Security.

  http://www.eset.com



--


  -
  List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html


  __ Information from ESET Smart Security, version of virus signature 
database 4490 (20091008) __

  The message was checked by ESET Smart Security.

  http://www.eset.com




__ Information from ESET Smart Security, version 

over 30 radiusd processes

2009-10-08 Thread Craig Campbell
I am runnning 2.1.6 on Redhat Linux (Red Hat Enterprise Linux Server release 
5.3 (Tikanga)).

This server relays all records to an identical server radius-b and radius-c
Similarly radius-b relays its records back to radius-a (except those from 
radius-a)
radius-c is just for testing.

All relaying appears to be working correctly.

These server currently receiving accounting records only.

radius-a seems to be getting the bulk of the radius records.  Normally, it has 
a single process.
Last night it spawned a bunch of children that seem to be loitering...  

radius-b and radius-c don't have more than a single radiusd process.

Any idea what is going on?  Why all the children?  Do I need to be concerned?  
Is this normal?

>From the detail log, is seems that the message rate increase to about 
>2000-2400/minute at that time for about 3 minutes, then dropped to <1000/min.
  [r...@radius-a radius]# ps -aef | grep radiusd
  radiusd   5426 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
  radiusd   5738 21400  0 03:29 ?00:00:00 /usr/local/sbin/radiusd
  radiusd   8239 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
  radiusd   8240 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
  radiusd   8241 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
  radiusd   8242 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
  radiusd   8243 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
  radiusd   8244 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
  radiusd   9029 21400  0 03:32 ?00:00:00 /usr/local/sbin/radiusd
  radiusd   9104 21400  0 03:29 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  14154 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  14426 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  15039 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  15040 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  16082 21400  0 03:36 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  17295 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  root 19242 20229  0 11:42 pts/000:00:00 grep radiusd
  radiusd  19974 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  20670 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  20673 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  20674 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  20675 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  20679 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  20680 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  21207 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  21208 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  21209 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  21300 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  21301 21400  0 03:33 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  21400 1  0 Oct07 ?00:02:40 /usr/local/sbin/radiusd
  radiusd  26543 21400  0 03:34 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  26683 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  28411 21400  0 03:34 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  29065 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  30648 21400  0 03:34 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  30649 21400  0 03:34 ?00:00:00 /usr/local/sbin/radiusd
  radiusd  30893 21400  0 03:31 ?00:00:00 /usr/local/sbin/radiusd

Thanks,
-craig





__ Information from ESET Smart Security, version of virus signature 
database 4490 (20091008) __

The message was checked by ESET Smart Security.

http://www.eset.com

-
List info/subscribe/unsubscribe? See http://www.freeradius.org/list/users.html