[Server-devel] Ejabberd CPU/RAM Spike - Crashes

2009-12-19 Thread Devon Connolly
Here is another example after it has been running all night.

http://pastebin.com/m11537281

As you can see, these runaway beam processes vary greatly in there RAM
usage.  Also, they are always using 100% of the cpu.

I will try to clear the DB now and see what happens.



On Fri, Dec 18, 2009 at 12:51 PM, Martin Langhoff martin.langh...@gmail.com
 wrote:

 On Fri, Dec 18, 2009 at 1:37 PM, Devon Connolly dev...@gmail.com wrote:
  Anyway, back on topic...  Here is that script slightly modified running
 on
  a fresh boot.  I'm going to leave this looping and post the file to
  pastebin.  Here is an initial output after only like 10 minutes.  It will
  get more interesting over time.  I'll paste another later this afternoon.

 outrageous. beam should have only ~40MB in use, total.

 if you 'clear' the mnesia db as i suggested (keep a copy for
 forensics!), does it get better?



 m
 --
  martin.langh...@gmail.com
  mar...@laptop.org -- School Server Architect
  - ask interesting questions
  - don't get distracted with shiny stuff  - working code first
  - http://wiki.laptop.org/go/User:Martinlanghoff

___
Server-devel mailing list
Server-devel@lists.laptop.org
http://lists.laptop.org/listinfo/server-devel


Re: [Server-devel] Ejabberd CPU/RAM Spike - Crashes

2009-12-19 Thread Devon Connolly
Changing the domain, I still get the following error when it tries (and
fails to shutdown ejabberd).

___
Crash dump was written to: erl_crash.dump
Kernel pid terminated (application_controller)
({application_start_failure,kernel,{shutdown,{kernel,start,[normal,[]]}}})
{error_logger,{{2009,12,19},{12,19,16}},Protocol: ~p: register error:
~p~n,[inet_tcp,{{badmatch,{error,duplicate_name}},[{inet_tcp_dist,listen,1},{net_kernel,start_protos,4},{net_kernel,start_protos,3},{net_kernel,init_node,2},{net_kernel,init,1},{gen_server,init_it,6},{proc_lib,init_p_do_apply,3}]}]}
{error_logger,{{2009,12,19},{12,19,16}},crash_report,[[{pid,0.20.0},{registered_name,net_kernel},{error_info,{exit,{error,badarg},[{gen_server,init_it,6},{proc_lib,init_p_do_apply,3}]}},{initial_call,{net_kernel,init,['Argument__1']}},{ancestors,[net_sup,kernel_sup,0.8.0]},{messages,[]},{links,[#Port0.84,0.17.0]},{dictionary,[{longnames,false}]},{trap_exit,true},{status,running},{heap_size,610},{stack_size,23},{reductions,505}],[]]}
{error_logger,{{2009,12,19},{12,19,16}},supervisor_report,[{supervisor,{local,net_sup}},{errorContext,start_error},{reason,{'EXIT',nodistribution}},{offender,[{pid,undefined},{name,net_kernel},{mfa,{net_kernel,start_link,[[ejabberdctl,shortnames]]}},{restart_type,permanent},{shutdown,2000},{child_type,worker}]}]}
{error_logger,{{2009,12,19},{12,19,16}},supervisor_report,[{supervisor,{local,kernel_sup}},{errorContext,start_error},{reason,shutdown},{offender,[{pid,undefined},{name,net_sup},{mfa,{erl_distribution,start_link,[]}},{restart_type,permanent},{shutdown,infinity},{child_type,supervisor}]}]}
{error_logger,{{2009,12,19},{12,19,16}},crash_report,[[{pid,0.7.0},{registered_name,[]},{error_info,{exit,{shutdown,{kernel,start,[normal,[]]}},[{application_master,init,4},{proc_lib,init_p_do_apply,3}]}},{initial_call,{application_master,init,['Argument__1','Argument__2','Argument__3','Argument__4']}},{ancestors,[0.6.0]},{messages,[{'EXIT',0.8.0,normal}]},{links,[0.6.0,0.5.0]},{dictionary,[]},{trap_exit,true},{status,running},{heap_size,233},{stack_size,23},{reductions,123}],[]]}
{error_logger,{{2009,12,19},{12,19,16}},std_info,[{application,kernel},{exited,{shutdown,{kernel,start,[normal,[]]}}},{type,permanent}]}
{Kernel pid
terminated,application_controller,{application_start_failure,kernel,{shutdown,{kernel,start,[normal,[]]

Crash dump was written to: erl_crash.dump
Kernel pid terminated (application_controller)
({application_start_failure,kernel,{shutdown,{kernel,start,[normal,[]]}}})
__

Beam is still consuming 100% of the cpu after a few minutes.  I'm going to
leave that script running to see what it does over the next few hours.

I imagine I now have to re-register all XO's?



On Sat, Dec 19, 2009 at 10:59 AM, Devon Connolly dev...@gmail.com wrote:


 Here is another example after it has been running all night.

 http://pastebin.com/m11537281

 As you can see, these runaway beam processes vary greatly in there RAM
 usage.  Also, they are always using 100% of the cpu.

 I will try to clear the DB now and see what happens.



 On Fri, Dec 18, 2009 at 12:51 PM, Martin Langhoff 
 martin.langh...@gmail.com wrote:

 On Fri, Dec 18, 2009 at 1:37 PM, Devon Connolly dev...@gmail.com wrote:
  Anyway, back on topic...  Here is that script slightly modified running
 on
  a fresh boot.  I'm going to leave this looping and post the file to
  pastebin.  Here is an initial output after only like 10 minutes.  It
 will
  get more interesting over time.  I'll paste another later this
 afternoon.

 outrageous. beam should have only ~40MB in use, total.

 if you 'clear' the mnesia db as i suggested (keep a copy for
 forensics!), does it get better?



 m
 --
  martin.langh...@gmail.com
  mar...@laptop.org -- School Server Architect
  - ask interesting questions
  - don't get distracted with shiny stuff  - working code first
  - http://wiki.laptop.org/go/User:Martinlanghoff




___
Server-devel mailing list
Server-devel@lists.laptop.org
http://lists.laptop.org/listinfo/server-devel


Re: [Server-devel] XO 1.5 to XS-onXO1 behavior

2009-12-19 Thread Martin Langhoff
Hi Sameer,

that's expected, unfortunately. The XS-on-XO runs an active antenna
/ mesh gateway in the sense of 802.11s. As the current XO-1.5
drivers/firmware don't talk 802.11s, it won't work. In that sense, the
XO-1.5 is the same as any other 802.11a/b/g device.

There are two paths to address this

 - get the hostap / thinfirm driver+firmware combo working again. It
was last seen working on a 2.6.17 i think.

 - get 802.11s on XO-1.5 going. this is not trivial...

cheers,

m


On Sat, Dec 19, 2009 at 12:00 AM, Sameer Verma sve...@sfsu.edu wrote:
 I haven't had much time to look into XS-on-XO1.5 (will look into it as soon
 as Fall semester finishes) but I have a XS-on-XO1 running at home and when a
 XO1 associates with it, the XO1 gets a 172.18.xxx.xxx address (school mesh
 portal). When the XO1.5 associates with school-mesh-0 (I have to click on it
 in the Neighborhood view) the association happens, but the XO1.5 gets
 169.254.xxx.xxx address, and beyond that the network is unusable.

 Thought I'd pass that along.

 Sameer
 --
 Dr. Sameer Verma, Ph.D.
 Associate Professor, Information Systems
 Director, Center for Business Solutions
 San Francisco State University
 http://verma.sfsu.edu/
 http://cbs.sfsu.edu/
 http://is.sfsu.edu/

 ___
 Server-devel mailing list
 Server-devel@lists.laptop.org
 http://lists.laptop.org/listinfo/server-devel





-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Server-devel mailing list
Server-devel@lists.laptop.org
http://lists.laptop.org/listinfo/server-devel


Re: [Server-devel] Ejabberd CPU/RAM Spike - Crashes

2009-12-19 Thread Martin Langhoff
On Sat, Dec 19, 2009 at 1:31 PM, Devon Connolly dev...@gmail.com wrote:
 Beam is still consuming 100% of the cpu after a few minutes.  I'm going to
 leave that script running to see what it does over the next few hours.

That's really abnormal.

 - Is there any disk anomaly? (Reboot forcing a fsck?)

 - Is there any problem in the binaries? If you run rpm with the
'verify' options, it'll check that no binaries have been corrupted
on-disk... It's normal to see some config files changed, but no
binaries should be different from the rpms.

 I imagine I now have to re-register all XO's?

Nope. The DB gets rebuilt automagically for you, 100%, on XS-0.6 .

cheers,



m


-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Server-devel mailing list
Server-devel@lists.laptop.org
http://lists.laptop.org/listinfo/server-devel