Re: [ath5k-devel] How to profile this strange effect under high load?
On Wed October 13 2010 02:14:19 Qasim Javed wrote: How about using trace-cmd and kernelshark with function graph tracer to see what is getting called so often? -Qasim On Tue, Oct 12, 2010 at 5:33 AM, Bruno Randolf b...@einfach.org wrote: Hello! I am trying to debug a strange effect, I am seeing on soekris net48xx boards with two ath5k interfaces: * wlan0 [phy0] configured in ad-hoc mode, A band, and it is getting a lot of traffic routed thru it * wlan1 [phy1] also configured in ad-hoc mode, G band, but not actively sending (no IP address). I run iperf between two PCs attached by ethernet, UDP 25Mbps and I am sure all traffic is routed thru wlan0 (phy0). The load on the SENDING box gets very high: Mem: 27304K used, 99040K free, 0K shrd, 616K buff, 14128K cached CPU: 0% usr 0% sys 0% nic 0% idle 0% io 5% irq 93% sirq Load average: 1.73 1.01 2.23 2/38 3128 PID PPID USER STAT VSZ %MEM %CPU COMMAND 2772 2 root RW 0 0% 71% [phy1] 2757 2 root SW 0 0% 11% [phy0] 3128 436 root R 988 1% 8% top 3 2 root SW 0 0% 5% [ksoftirqd/0] 5 2 root SW 0 0% 3% [events/0] 56 2 root SW 0 0% 1% [bdi-default] 1383 1 root S 1824 1% 0% /usr/bin/oprofiled --session-dir=/r ...and the interesting thing is that [phy1] - the INACTIVE interface - consumes much more CPU time than [phy0] which is actually transmitting. I have disabled ANI on wlan1, to avoid getting many MIB interrupts, but it does not change much. I have dumped frames on phy1, and all it sees are a few beacon frames. I have tried wlan1 in managed mode - same effect. The effect is slightly less in AP mode, though... I tried oprofile, but it shows mostly tx related things. So my question is: How can I find out, what [phy1] is so busy doing? Any ideas how to profile this? Thanks for your suggestions. I think, I was able to get a hold of this using the Ftrace function tracer. What is happening, is that received beacons get queued up by mac80211 (ieee80211_queue_work) and then the work queue (ieee80211_rx_mgmt_beacon) has many beacons to process. If we are running late, the funny thing is that most of these beacons in the queue will be irrelevant because they are too old. Obviously there is room for optimization here. Also another effect which I think adds to the high CPU utilization of the [phy1] process is that functions running in interrupt context are run at the expense of the process which has been interrupted. So if [phy1] work has been interrupted by a TX completed interrupt of phy0 it will be accounted to [phy1]. Here is some output of ftrace, which shows my first point: phy1-1089 [000] 594.942525: ieee80211_rx_mgmt_beacon - ieee80211_ibss_work phy1-1089 [000] 594.986019: ieee80211_queue_work - ieee80211_ibss_rx_mgmt phy1-1089 [000] 595.030190: ieee80211_queue_work - ieee80211_ibss_rx_mgmt phy1-1089 [000] 595.088663: ieee80211_queue_work - ieee80211_ibss_rx_mgmt phy1-1089 [000] 595.131842: ieee80211_queue_work - ieee80211_ibss_rx_mgmt top-1284 [000] 595.195661: ieee80211_queue_work - ieee80211_ibss_rx_mgmt top-1284 [000] 595.195812: ieee80211_queue_work - ieee80211_ibss_rx_mgmt top-1284 [000] 595.241381: ieee80211_queue_work - ieee80211_ibss_rx_mgmt top-1284 [000] 595.307445: ieee80211_queue_work - ieee80211_ibss_rx_mgmt top-1284 [000] 595.328727: ieee80211_queue_work - ieee80211_ibss_rx_mgmt phy0-1075 [000] 595.328935: ieee80211_rx_mgmt_beacon - ieee80211_ibss_work phy0-1075 [000] 595.329223: ieee80211_queue_work - ieee80211_ibss_timer phy0-1075 [000] 595.395654: ieee80211_queue_work - ieee80211_ibss_rx_mgmt phy0-1075 [000] 595.395784: ieee80211_queue_work - ieee80211_ibss_rx_mgmt phy0-1075 [000] 595.440385: ieee80211_queue_work - ieee80211_ibss_rx_mgmt ksoftirqd/0-3 [000] 595.544013: ieee80211_queue_work - ieee80211_ibss_rx_mgmt ksoftirqd/0-3 [000] 595.614735: ieee80211_queue_work - ieee80211_ibss_rx_mgmt ksoftirqd/0-3 [000] 595.633631: ieee80211_queue_work - ieee80211_ibss_rx_mgmt phy1-1089 [000] 595.642036: ieee80211_rx_mgmt_beacon - ieee80211_ibss_work phy1-1089 [000] 595.642122: ieee80211_rx_mgmt_beacon - ieee80211_ibss_work phy1-1089 [000] 595.642173: ieee80211_rx_mgmt_beacon - ieee80211_ibss_work phy1-1089 [000] 595.642280: ieee80211_rx_mgmt_beacon - ieee80211_ibss_work phy1-1089 [000] 595.642334: ieee80211_rx_mgmt_beacon - ieee80211_ibss_work phy1-1089 [000] 595.642385: ieee80211_rx_mgmt_beacon - ieee80211_ibss_work phy1-1089 [000] 595.642437: ieee80211_rx_mgmt_beacon - ieee80211_ibss_work phy1-1089 [000] 595.642491: ieee80211_rx_mgmt_beacon -
[ath5k-devel] How to profile this strange effect under high load?
Hello! I am trying to debug a strange effect, I am seeing on soekris net48xx boards with two ath5k interfaces: * wlan0 [phy0] configured in ad-hoc mode, A band, and it is getting a lot of traffic routed thru it * wlan1 [phy1] also configured in ad-hoc mode, G band, but not actively sending (no IP address). I run iperf between two PCs attached by ethernet, UDP 25Mbps and I am sure all traffic is routed thru wlan0 (phy0). The load on the SENDING box gets very high: Mem: 27304K used, 99040K free, 0K shrd, 616K buff, 14128K cached CPU: 0% usr 0% sys 0% nic 0% idle 0% io 5% irq 93% sirq Load average: 1.73 1.01 2.23 2/38 3128 PID PPID USER STAT VSZ %MEM %CPU COMMAND 2772 2 root RW 0 0% 71% [phy1] 2757 2 root SW 0 0% 11% [phy0] 3128 436 root R 988 1% 8% top 3 2 root SW 0 0% 5% [ksoftirqd/0] 5 2 root SW 0 0% 3% [events/0] 56 2 root SW 0 0% 1% [bdi-default] 1383 1 root S 1824 1% 0% /usr/bin/oprofiled --session-dir=/r ...and the interesting thing is that [phy1] - the INACTIVE interface - consumes much more CPU time than [phy0] which is actually transmitting. I have disabled ANI on wlan1, to avoid getting many MIB interrupts, but it does not change much. I have dumped frames on phy1, and all it sees are a few beacon frames. I have tried wlan1 in managed mode - same effect. The effect is slightly less in AP mode, though... I tried oprofile, but it shows mostly tx related things. So my question is: How can I find out, what [phy1] is so busy doing? Any ideas how to profile this? Thanks in advance, bruno ___ ath5k-devel mailing list ath5k-devel@lists.ath5k.org https://lists.ath5k.org/mailman/listinfo/ath5k-devel
Re: [ath5k-devel] How to profile this strange effect under high load?
On Tue, Oct 12, 2010 at 6:33 AM, Bruno Randolf b...@einfach.org wrote: So my question is: How can I find out, what [phy1] is so busy doing? Any ideas how to profile this? /proc/interrupts shows lots of interrupts on phy1 line? Is it shared irq? What options did you try with oprofile? You can try perf -- as long as you limit the functions you profile it should have about the same overhead as oprofile. But then you should get about the same results with either; I just find perf to be a little better these days. One thing I know of is that all the linear interpolation we do for setting the power curves is pretty heavy cpu-wise and we do it on every reset. There's some low-hanging fruit there like precomputing the reciprocal of some scaling factors instead of some divisions... -- Bob Copeland %% www.bobcopeland.com ___ ath5k-devel mailing list ath5k-devel@lists.ath5k.org https://lists.ath5k.org/mailman/listinfo/ath5k-devel
Re: [ath5k-devel] How to profile this strange effect under high load?
How about using trace-cmd and kernelshark with function graph tracer to see what is getting called so often? -Qasim On Tue, Oct 12, 2010 at 5:33 AM, Bruno Randolf b...@einfach.org wrote: Hello! I am trying to debug a strange effect, I am seeing on soekris net48xx boards with two ath5k interfaces: * wlan0 [phy0] configured in ad-hoc mode, A band, and it is getting a lot of traffic routed thru it * wlan1 [phy1] also configured in ad-hoc mode, G band, but not actively sending (no IP address). I run iperf between two PCs attached by ethernet, UDP 25Mbps and I am sure all traffic is routed thru wlan0 (phy0). The load on the SENDING box gets very high: Mem: 27304K used, 99040K free, 0K shrd, 616K buff, 14128K cached CPU: 0% usr 0% sys 0% nic 0% idle 0% io 5% irq 93% sirq Load average: 1.73 1.01 2.23 2/38 3128 PID PPID USER STAT VSZ %MEM %CPU COMMAND 2772 2 root RW 0 0% 71% [phy1] 2757 2 root SW 0 0% 11% [phy0] 3128 436 root R 988 1% 8% top 3 2 root SW 0 0% 5% [ksoftirqd/0] 5 2 root SW 0 0% 3% [events/0] 56 2 root SW 0 0% 1% [bdi-default] 1383 1 root S 1824 1% 0% /usr/bin/oprofiled --session-dir=/r ...and the interesting thing is that [phy1] - the INACTIVE interface - consumes much more CPU time than [phy0] which is actually transmitting. I have disabled ANI on wlan1, to avoid getting many MIB interrupts, but it does not change much. I have dumped frames on phy1, and all it sees are a few beacon frames. I have tried wlan1 in managed mode - same effect. The effect is slightly less in AP mode, though... I tried oprofile, but it shows mostly tx related things. So my question is: How can I find out, what [phy1] is so busy doing? Any ideas how to profile this? Thanks in advance, bruno ___ ath5k-devel mailing list ath5k-devel@lists.ath5k.org https://lists.ath5k.org/mailman/listinfo/ath5k-devel ___ ath5k-devel mailing list ath5k-devel@lists.ath5k.org https://lists.ath5k.org/mailman/listinfo/ath5k-devel
Re: [ath5k-devel] How to profile this strange effect under high load?
2010/10/12 Bruno Randolf b...@einfach.org: Hello! I am trying to debug a strange effect, I am seeing on soekris net48xx boards with two ath5k interfaces: * wlan0 [phy0] configured in ad-hoc mode, A band, and it is getting a lot of traffic routed thru it * wlan1 [phy1] also configured in ad-hoc mode, G band, but not actively sending (no IP address). I run iperf between two PCs attached by ethernet, UDP 25Mbps and I am sure all traffic is routed thru wlan0 (phy0). The load on the SENDING box gets very high: Mem: 27304K used, 99040K free, 0K shrd, 616K buff, 14128K cached CPU: 0% usr 0% sys 0% nic 0% idle 0% io 5% irq 93% sirq Load average: 1.73 1.01 2.23 2/38 3128 PID PPID USER STAT VSZ %MEM %CPU COMMAND 2772 2 root RW 0 0% 71% [phy1] 2757 2 root SW 0 0% 11% [phy0] 3128 436 root R 988 1% 8% top 3 2 root SW 0 0% 5% [ksoftirqd/0] 5 2 root SW 0 0% 3% [events/0] 56 2 root SW 0 0% 1% [bdi-default] 1383 1 root S 1824 1% 0% /usr/bin/oprofiled --session-dir=/r ...and the interesting thing is that [phy1] - the INACTIVE interface - consumes much more CPU time than [phy0] which is actually transmitting. I have disabled ANI on wlan1, to avoid getting many MIB interrupts, but it does not change much. I have dumped frames on phy1, and all it sees are a few beacon frames. I have tried wlan1 in managed mode - same effect. The effect is slightly less in AP mode, though... I tried oprofile, but it shows mostly tx related things. So my question is: How can I find out, what [phy1] is so busy doing? Any ideas how to profile this? Thanks in advance, bruno When ph1 is also connected consumes more or less CPU ? What happens If you just give it an IP address ? -- GPG ID: 0xD21DB2DB As you read this post global entropy rises. Have Fun ;-) Nick ___ ath5k-devel mailing list ath5k-devel@lists.ath5k.org https://lists.ath5k.org/mailman/listinfo/ath5k-devel