Re: [ath5k-devel] How to profile this strange effect under high load?

2010-10-14 Thread Bruno Randolf
On Wed October 13 2010 02:14:19 Qasim Javed wrote:
 How about using trace-cmd and kernelshark with function graph tracer
 to see what is getting called so often?
 
 -Qasim
 
 On Tue, Oct 12, 2010 at 5:33 AM, Bruno Randolf b...@einfach.org wrote:
  Hello!
  
  I am trying to debug a strange effect, I am seeing on soekris net48xx
  boards with two ath5k interfaces:
  
   * wlan0 [phy0] configured in ad-hoc mode, A band, and it is getting a
  lot of traffic routed thru it
   * wlan1 [phy1] also configured in ad-hoc mode, G band, but not actively
  sending (no IP address).
  
  I run iperf between two PCs attached by ethernet, UDP 25Mbps and I am
  sure all traffic is routed thru wlan0 (phy0). The load on the SENDING
  box gets very high:
  
  Mem: 27304K used, 99040K free, 0K shrd, 616K buff, 14128K cached
  CPU:   0% usr   0% sys   0% nic   0% idle   0% io   5% irq  93% sirq
  Load average: 1.73 1.01 2.23 2/38 3128
   PID  PPID USER STAT   VSZ %MEM %CPU COMMAND
   2772 2 root RW   0   0%  71% [phy1]
   2757 2 root SW   0   0%  11% [phy0]
   3128   436 root R  988   1%   8% top
 3 2 root SW   0   0%   5% [ksoftirqd/0]
 5 2 root SW   0   0%   3% [events/0]
56 2 root SW   0   0%   1% [bdi-default]
   1383 1 root S 1824   1%   0% /usr/bin/oprofiled
  --session-dir=/r
  
  ...and the interesting thing is that [phy1] - the INACTIVE interface -
  consumes much more CPU time than [phy0] which is actually transmitting.
  
  I have disabled ANI on wlan1, to avoid getting many MIB interrupts, but
  it does not change much.
  
  I have dumped frames on phy1, and all it sees are a few beacon frames.
  
  I have tried wlan1 in managed mode - same effect. The effect is slightly
  less in AP mode, though...
  
  I tried oprofile, but it shows mostly tx related things.
  
  So my question is: How can I find out, what [phy1] is so busy doing? Any
  ideas how to profile this?

Thanks for your suggestions.

I think, I was able to get a hold of this using the Ftrace function tracer. 
What is happening, is that received beacons get queued up by mac80211 
(ieee80211_queue_work) and then the work queue (ieee80211_rx_mgmt_beacon) has 
many beacons to process. If we are running late, the funny thing is that most 
of these beacons in the queue will be irrelevant because they are too old. 
Obviously there is room for optimization here.

Also another effect which I think adds to the high CPU utilization of the 
[phy1] process is that functions running in interrupt context are run at the 
expense of the process which has been interrupted. So if [phy1] work has been 
interrupted by a TX completed interrupt of phy0 it will be accounted to 
[phy1].

Here is some output of ftrace, which shows my first point:

   phy1-1089  [000]   594.942525: ieee80211_rx_mgmt_beacon -
ieee80211_ibss_work
   phy1-1089  [000]   594.986019: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
   phy1-1089  [000]   595.030190: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
   phy1-1089  [000]   595.088663: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
   phy1-1089  [000]   595.131842: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
top-1284  [000]   595.195661: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
top-1284  [000]   595.195812: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
top-1284  [000]   595.241381: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
top-1284  [000]   595.307445: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
top-1284  [000]   595.328727: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
   phy0-1075  [000]   595.328935: ieee80211_rx_mgmt_beacon -
ieee80211_ibss_work
   phy0-1075  [000]   595.329223: ieee80211_queue_work -
ieee80211_ibss_timer
   phy0-1075  [000]   595.395654: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
   phy0-1075  [000]   595.395784: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
   phy0-1075  [000]   595.440385: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
ksoftirqd/0-3 [000]   595.544013: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
ksoftirqd/0-3 [000]   595.614735: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
ksoftirqd/0-3 [000]   595.633631: ieee80211_queue_work -
ieee80211_ibss_rx_mgmt
   phy1-1089  [000]   595.642036: ieee80211_rx_mgmt_beacon -
ieee80211_ibss_work
   phy1-1089  [000]   595.642122: ieee80211_rx_mgmt_beacon -
ieee80211_ibss_work
   phy1-1089  [000]   595.642173: ieee80211_rx_mgmt_beacon -
ieee80211_ibss_work
   phy1-1089  [000]   595.642280: ieee80211_rx_mgmt_beacon -
ieee80211_ibss_work
   phy1-1089  [000]   595.642334: ieee80211_rx_mgmt_beacon -
ieee80211_ibss_work
   phy1-1089  [000]   595.642385: ieee80211_rx_mgmt_beacon -
ieee80211_ibss_work
   phy1-1089  [000]   595.642437: ieee80211_rx_mgmt_beacon -
ieee80211_ibss_work
   phy1-1089  [000]   595.642491: ieee80211_rx_mgmt_beacon -

[ath5k-devel] How to profile this strange effect under high load?

2010-10-12 Thread Bruno Randolf
Hello!

I am trying to debug a strange effect, I am seeing on soekris net48xx boards 
with two ath5k interfaces:

 * wlan0 [phy0] configured in ad-hoc mode, A band, and it is getting a lot of 
traffic routed thru it
 * wlan1 [phy1] also configured in ad-hoc mode, G band, but not actively 
sending (no IP address).

I run iperf between two PCs attached by ethernet, UDP 25Mbps and I am sure all 
traffic is routed thru wlan0 (phy0). The load on the SENDING box gets very 
high:

Mem: 27304K used, 99040K free, 0K shrd, 616K buff, 14128K cached
CPU:   0% usr   0% sys   0% nic   0% idle   0% io   5% irq  93% sirq
Load average: 1.73 1.01 2.23 2/38 3128
  PID  PPID USER STAT   VSZ %MEM %CPU COMMAND
 2772 2 root RW   0   0%  71% [phy1]
 2757 2 root SW   0   0%  11% [phy0]
 3128   436 root R  988   1%   8% top 
3 2 root SW   0   0%   5% [ksoftirqd/0]
5 2 root SW   0   0%   3% [events/0]
   56 2 root SW   0   0%   1% [bdi-default]
 1383 1 root S 1824   1%   0% /usr/bin/oprofiled --session-dir=/r

...and the interesting thing is that [phy1] - the INACTIVE interface - 
consumes much more CPU time than [phy0] which is actually transmitting.

I have disabled ANI on wlan1, to avoid getting many MIB interrupts, but it 
does not change much. 

I have dumped frames on phy1, and all it sees are a few beacon frames.

I have tried wlan1 in managed mode - same effect. The effect is slightly less 
in AP mode, though...

I tried oprofile, but it shows mostly tx related things.

So my question is: How can I find out, what [phy1] is so busy doing? Any ideas 
how to profile this?

Thanks in advance,
bruno
___
ath5k-devel mailing list
ath5k-devel@lists.ath5k.org
https://lists.ath5k.org/mailman/listinfo/ath5k-devel


Re: [ath5k-devel] How to profile this strange effect under high load?

2010-10-12 Thread Bob Copeland
On Tue, Oct 12, 2010 at 6:33 AM, Bruno Randolf b...@einfach.org wrote:
 So my question is: How can I find out, what [phy1] is so busy doing? Any ideas
 how to profile this?

/proc/interrupts shows lots of interrupts on phy1 line?  Is it shared irq?

What options did you try with oprofile?

You can try perf -- as long as you limit the functions you profile it should
have about the same overhead as oprofile.  But then you should get about the
same results with either; I just find perf to be a little better these days.

One thing I know of is that all the linear interpolation we do for setting
the power curves is pretty heavy cpu-wise and we do it on every reset.  There's
some low-hanging fruit there like precomputing the reciprocal of some
scaling factors instead of some divisions...

-- 
Bob Copeland %% www.bobcopeland.com
___
ath5k-devel mailing list
ath5k-devel@lists.ath5k.org
https://lists.ath5k.org/mailman/listinfo/ath5k-devel


Re: [ath5k-devel] How to profile this strange effect under high load?

2010-10-12 Thread Qasim Javed
How about using trace-cmd and kernelshark with function graph tracer
to see what is getting called so often?

-Qasim

On Tue, Oct 12, 2010 at 5:33 AM, Bruno Randolf b...@einfach.org wrote:
 Hello!

 I am trying to debug a strange effect, I am seeing on soekris net48xx boards
 with two ath5k interfaces:

  * wlan0 [phy0] configured in ad-hoc mode, A band, and it is getting a lot of
 traffic routed thru it
  * wlan1 [phy1] also configured in ad-hoc mode, G band, but not actively
 sending (no IP address).

 I run iperf between two PCs attached by ethernet, UDP 25Mbps and I am sure all
 traffic is routed thru wlan0 (phy0). The load on the SENDING box gets very
 high:

 Mem: 27304K used, 99040K free, 0K shrd, 616K buff, 14128K cached
 CPU:   0% usr   0% sys   0% nic   0% idle   0% io   5% irq  93% sirq
 Load average: 1.73 1.01 2.23 2/38 3128
  PID  PPID USER     STAT   VSZ %MEM %CPU COMMAND
  2772     2 root     RW       0   0%  71% [phy1]
  2757     2 root     SW       0   0%  11% [phy0]
  3128   436 root     R      988   1%   8% top
    3     2 root     SW       0   0%   5% [ksoftirqd/0]
    5     2 root     SW       0   0%   3% [events/0]
   56     2 root     SW       0   0%   1% [bdi-default]
  1383     1 root     S     1824   1%   0% /usr/bin/oprofiled --session-dir=/r

 ...and the interesting thing is that [phy1] - the INACTIVE interface -
 consumes much more CPU time than [phy0] which is actually transmitting.

 I have disabled ANI on wlan1, to avoid getting many MIB interrupts, but it
 does not change much.

 I have dumped frames on phy1, and all it sees are a few beacon frames.

 I have tried wlan1 in managed mode - same effect. The effect is slightly less
 in AP mode, though...

 I tried oprofile, but it shows mostly tx related things.

 So my question is: How can I find out, what [phy1] is so busy doing? Any ideas
 how to profile this?

 Thanks in advance,
 bruno
 ___
 ath5k-devel mailing list
 ath5k-devel@lists.ath5k.org
 https://lists.ath5k.org/mailman/listinfo/ath5k-devel

___
ath5k-devel mailing list
ath5k-devel@lists.ath5k.org
https://lists.ath5k.org/mailman/listinfo/ath5k-devel


Re: [ath5k-devel] How to profile this strange effect under high load?

2010-10-12 Thread Nick Kossifidis
2010/10/12 Bruno Randolf b...@einfach.org:
 Hello!

 I am trying to debug a strange effect, I am seeing on soekris net48xx boards
 with two ath5k interfaces:

  * wlan0 [phy0] configured in ad-hoc mode, A band, and it is getting a lot of
 traffic routed thru it
  * wlan1 [phy1] also configured in ad-hoc mode, G band, but not actively
 sending (no IP address).

 I run iperf between two PCs attached by ethernet, UDP 25Mbps and I am sure all
 traffic is routed thru wlan0 (phy0). The load on the SENDING box gets very
 high:

 Mem: 27304K used, 99040K free, 0K shrd, 616K buff, 14128K cached
 CPU:   0% usr   0% sys   0% nic   0% idle   0% io   5% irq  93% sirq
 Load average: 1.73 1.01 2.23 2/38 3128
  PID  PPID USER     STAT   VSZ %MEM %CPU COMMAND
  2772     2 root     RW       0   0%  71% [phy1]
  2757     2 root     SW       0   0%  11% [phy0]
  3128   436 root     R      988   1%   8% top
    3     2 root     SW       0   0%   5% [ksoftirqd/0]
    5     2 root     SW       0   0%   3% [events/0]
   56     2 root     SW       0   0%   1% [bdi-default]
  1383     1 root     S     1824   1%   0% /usr/bin/oprofiled --session-dir=/r

 ...and the interesting thing is that [phy1] - the INACTIVE interface -
 consumes much more CPU time than [phy0] which is actually transmitting.

 I have disabled ANI on wlan1, to avoid getting many MIB interrupts, but it
 does not change much.

 I have dumped frames on phy1, and all it sees are a few beacon frames.

 I have tried wlan1 in managed mode - same effect. The effect is slightly less
 in AP mode, though...

 I tried oprofile, but it shows mostly tx related things.

 So my question is: How can I find out, what [phy1] is so busy doing? Any ideas
 how to profile this?

 Thanks in advance,
 bruno


When ph1 is also connected consumes more or less CPU ? What happens If
you just give it an IP address ?


-- 
GPG ID: 0xD21DB2DB
As you read this post global entropy rises. Have Fun ;-)
Nick
___
ath5k-devel mailing list
ath5k-devel@lists.ath5k.org
https://lists.ath5k.org/mailman/listinfo/ath5k-devel