[Kernel-packages] [Bug 1886588] Re: ath10k_pci often crashes in focal

2020-08-25 Thread Alex Hung
vicamo posted a probable solution @
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1891405. See #19
for more details

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1886588

Title:
  ath10k_pci often crashes in focal

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  WIreless connectivity has been very flaky in focal ever since release.
  Thought I'd start investigating a bit. This is all I found, but it
  clearly signisfies an error. Please let med know, if I can provide
  more information.

  Description:  Ubuntu 20.04 LTS
  Release:  20.04

  
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: firmware crashed! (guid 
ba24fee9-d0cb-42e2-9aef-06f0d07a053e)
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: qca6174 hw3.2 target 
0x0503 chip_id 0x00340aff sub 1a56:1535
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: kconfig debug 0 debugfs 
1 tracing 1 dfs 0 testmode 0
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: firmware ver 
WLAN.RM.4.4.1-00140-QCARMSWPZ-1 api 6 features wowlan,ignore-otp,mfp crc32 
29eb8ca1
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: board_file api 2 bmi_id 
N/A crc32 4ac0889b
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: htt-ver 3.60 wmi-op 4 
htt-op 3 cal otp max-sta 32 raw 0 hwcrypto 1
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: failed to get memcpy hi 
address for firmware address 4: -16
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: failed to read firmware 
dump area: -16
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: Copy Engine register 
dump:
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [00]: 0x00034400  11  11 
  3   3
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [01]: 0x00034800   3   2 
175 176
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [02]: 0x00034c00  12  11 
 10  11
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [03]: 0x00035000  12  12 
 14  12
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [04]: 0x00035400 6557 
6549  11 203
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [05]: 0x00035800   0   0 
 64   0
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [06]: 0x00035c00   3   1 
 10   8
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [07]: 0x00036000   1   0 
  1   0
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: failed to read 
hi_board_data address: -28
  [ma. juli  6 23:16:51 2020] ieee80211 phy0: Hardware restart was requested
  [ma. juli  6 23:16:51 2020] ath10k_pci :3b:00.0: unsupported HTC service 
id: 1536
  [ma. juli  6 23:16:51 2020] ath10k_pci :3b:00.0: device successfully 
recovered
  [ma. juli  6 23:19:57 2020] mce: CPU2: Core temperature above threshold, cpu 
clock throttled (total events = 6714)
  [ma. juli  6 23:19:57 2020] mce: CPU1: Package temperature above threshold, 
cpu clock throttled (total events = 20892)
  [ma. juli  6 23:19:57 2020] mce: CPU5: Package temperature above threshold, 
cpu clock throttled (total events = 20892)
  [ma. juli  6 23:19:57 2020] mce: CPU4: Package temperature above threshold, 
cpu clock throttled (total events = 20892)
  [ma. juli  6 23:19:57 2020] mce: CPU2: Package temperature above threshold, 
cpu clock throttled (total events = 20893)
  [ma. juli  6 23:19:57 2020] mce: CPU0: Package temperature above threshold, 
cpu clock throttled (total events = 20893)
  [ma. juli  6 23:19:57 2020] mce: CPU3: Package temperature above threshold, 
cpu clock throttled (total events = 20893)
  [ma. juli  6 23:19:57 2020] mce: CPU2: Core temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU1: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU5: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU4: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU2: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU0: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU3: Package temperature/speed normal
  [ma. juli  6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled 
state
  [ma. juli  6 23:20:08 2020] veth76e34af: renamed from eth0
  [ma. juli  6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled 
state
  [ma. juli  6 23:20:08 2020] device veth76af7ee left promiscuous mode
  [ma. juli  6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled 
state
  [ma. juli  6 23:20:18 2020] wlp59s0: deauthenticating from 00:22:07:80:6c:2e 
by local choice (Reason: 3=DEAUTH_LEAVING)
  [ma. juli  6 23:21:46 2020] ath10k_pci :3b:00.0: pci irq msi 
oper_irq_mode 2 irq_mode 0 reset_mode 0
  [ma. juli  6 23:21:46 2020] ath10k_pci :3b:00.0: qca6174 hw3.2 target 
0x0503 chip_id 0x00340aff sub 1a56:1535
  [ma. juli  6 23:21:46 2020] ath10k_pci :3b:00.0: kconfig debug 0 debugfs 
1 tracing 1 dfs 0 testmode 0
  [ma. juli  6 

[Kernel-packages] [Bug 1886588] Re: ath10k_pci often crashes in focal

2020-07-09 Thread Alex Hung
The log in description clearly points out "firmware crashed", and that
may be something we can forward to the hardware vendor.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1886588

Title:
  ath10k_pci often crashes in focal

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  WIreless connectivity has been very flaky in focal ever since release.
  Thought I'd start investigating a bit. This is all I found, but it
  clearly signisfies an error. Please let med know, if I can provide
  more information.

  Description:  Ubuntu 20.04 LTS
  Release:  20.04

  
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: firmware crashed! (guid 
ba24fee9-d0cb-42e2-9aef-06f0d07a053e)
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: qca6174 hw3.2 target 
0x0503 chip_id 0x00340aff sub 1a56:1535
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: kconfig debug 0 debugfs 
1 tracing 1 dfs 0 testmode 0
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: firmware ver 
WLAN.RM.4.4.1-00140-QCARMSWPZ-1 api 6 features wowlan,ignore-otp,mfp crc32 
29eb8ca1
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: board_file api 2 bmi_id 
N/A crc32 4ac0889b
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: htt-ver 3.60 wmi-op 4 
htt-op 3 cal otp max-sta 32 raw 0 hwcrypto 1
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: failed to get memcpy hi 
address for firmware address 4: -16
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: failed to read firmware 
dump area: -16
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: Copy Engine register 
dump:
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [00]: 0x00034400  11  11 
  3   3
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [01]: 0x00034800   3   2 
175 176
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [02]: 0x00034c00  12  11 
 10  11
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [03]: 0x00035000  12  12 
 14  12
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [04]: 0x00035400 6557 
6549  11 203
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [05]: 0x00035800   0   0 
 64   0
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [06]: 0x00035c00   3   1 
 10   8
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [07]: 0x00036000   1   0 
  1   0
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: failed to read 
hi_board_data address: -28
  [ma. juli  6 23:16:51 2020] ieee80211 phy0: Hardware restart was requested
  [ma. juli  6 23:16:51 2020] ath10k_pci :3b:00.0: unsupported HTC service 
id: 1536
  [ma. juli  6 23:16:51 2020] ath10k_pci :3b:00.0: device successfully 
recovered
  [ma. juli  6 23:19:57 2020] mce: CPU2: Core temperature above threshold, cpu 
clock throttled (total events = 6714)
  [ma. juli  6 23:19:57 2020] mce: CPU1: Package temperature above threshold, 
cpu clock throttled (total events = 20892)
  [ma. juli  6 23:19:57 2020] mce: CPU5: Package temperature above threshold, 
cpu clock throttled (total events = 20892)
  [ma. juli  6 23:19:57 2020] mce: CPU4: Package temperature above threshold, 
cpu clock throttled (total events = 20892)
  [ma. juli  6 23:19:57 2020] mce: CPU2: Package temperature above threshold, 
cpu clock throttled (total events = 20893)
  [ma. juli  6 23:19:57 2020] mce: CPU0: Package temperature above threshold, 
cpu clock throttled (total events = 20893)
  [ma. juli  6 23:19:57 2020] mce: CPU3: Package temperature above threshold, 
cpu clock throttled (total events = 20893)
  [ma. juli  6 23:19:57 2020] mce: CPU2: Core temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU1: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU5: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU4: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU2: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU0: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU3: Package temperature/speed normal
  [ma. juli  6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled 
state
  [ma. juli  6 23:20:08 2020] veth76e34af: renamed from eth0
  [ma. juli  6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled 
state
  [ma. juli  6 23:20:08 2020] device veth76af7ee left promiscuous mode
  [ma. juli  6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled 
state
  [ma. juli  6 23:20:18 2020] wlp59s0: deauthenticating from 00:22:07:80:6c:2e 
by local choice (Reason: 3=DEAUTH_LEAVING)
  [ma. juli  6 23:21:46 2020] ath10k_pci :3b:00.0: pci irq msi 
oper_irq_mode 2 irq_mode 0 reset_mode 0
  [ma. juli  6 23:21:46 2020] ath10k_pci :3b:00.0: qca6174 hw3.2 target 
0x0503 chip_id 0x00340aff sub 1a56:1535
  [ma. juli  6 23:21:46 2020] ath10k_pci :3b:00.0: kconfig debug 0 debugfs 
1 tracing 1 dfs 0 testmode 0
  [ma. juli  6 

[Kernel-packages] [Bug 1886588] Re: ath10k_pci often crashes in focal

2020-07-08 Thread Alex Hung
I have also a Qualcomm Atheros QCA6174 and can observe it fails from
time to time, not in focal only. More discussion can be found in
LP:1872351

While there aren't solution now, I use "sudo modprobe -r ath10k_pci ;
sudo modprobe ath10k_pci" to avoid rebooting the system.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1886588

Title:
  ath10k_pci often crashes in focal

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  WIreless connectivity has been very flaky in focal ever since release.
  Thought I'd start investigating a bit. This is all I found, but it
  clearly signisfies an error. Please let med know, if I can provide
  more information.

  Description:  Ubuntu 20.04 LTS
  Release:  20.04

  
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: firmware crashed! (guid 
ba24fee9-d0cb-42e2-9aef-06f0d07a053e)
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: qca6174 hw3.2 target 
0x0503 chip_id 0x00340aff sub 1a56:1535
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: kconfig debug 0 debugfs 
1 tracing 1 dfs 0 testmode 0
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: firmware ver 
WLAN.RM.4.4.1-00140-QCARMSWPZ-1 api 6 features wowlan,ignore-otp,mfp crc32 
29eb8ca1
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: board_file api 2 bmi_id 
N/A crc32 4ac0889b
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: htt-ver 3.60 wmi-op 4 
htt-op 3 cal otp max-sta 32 raw 0 hwcrypto 1
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: failed to get memcpy hi 
address for firmware address 4: -16
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: failed to read firmware 
dump area: -16
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: Copy Engine register 
dump:
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [00]: 0x00034400  11  11 
  3   3
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [01]: 0x00034800   3   2 
175 176
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [02]: 0x00034c00  12  11 
 10  11
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [03]: 0x00035000  12  12 
 14  12
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [04]: 0x00035400 6557 
6549  11 203
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [05]: 0x00035800   0   0 
 64   0
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [06]: 0x00035c00   3   1 
 10   8
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: [07]: 0x00036000   1   0 
  1   0
  [ma. juli  6 23:16:50 2020] ath10k_pci :3b:00.0: failed to read 
hi_board_data address: -28
  [ma. juli  6 23:16:51 2020] ieee80211 phy0: Hardware restart was requested
  [ma. juli  6 23:16:51 2020] ath10k_pci :3b:00.0: unsupported HTC service 
id: 1536
  [ma. juli  6 23:16:51 2020] ath10k_pci :3b:00.0: device successfully 
recovered
  [ma. juli  6 23:19:57 2020] mce: CPU2: Core temperature above threshold, cpu 
clock throttled (total events = 6714)
  [ma. juli  6 23:19:57 2020] mce: CPU1: Package temperature above threshold, 
cpu clock throttled (total events = 20892)
  [ma. juli  6 23:19:57 2020] mce: CPU5: Package temperature above threshold, 
cpu clock throttled (total events = 20892)
  [ma. juli  6 23:19:57 2020] mce: CPU4: Package temperature above threshold, 
cpu clock throttled (total events = 20892)
  [ma. juli  6 23:19:57 2020] mce: CPU2: Package temperature above threshold, 
cpu clock throttled (total events = 20893)
  [ma. juli  6 23:19:57 2020] mce: CPU0: Package temperature above threshold, 
cpu clock throttled (total events = 20893)
  [ma. juli  6 23:19:57 2020] mce: CPU3: Package temperature above threshold, 
cpu clock throttled (total events = 20893)
  [ma. juli  6 23:19:57 2020] mce: CPU2: Core temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU1: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU5: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU4: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU2: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU0: Package temperature/speed normal
  [ma. juli  6 23:19:57 2020] mce: CPU3: Package temperature/speed normal
  [ma. juli  6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled 
state
  [ma. juli  6 23:20:08 2020] veth76e34af: renamed from eth0
  [ma. juli  6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled 
state
  [ma. juli  6 23:20:08 2020] device veth76af7ee left promiscuous mode
  [ma. juli  6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled 
state
  [ma. juli  6 23:20:18 2020] wlp59s0: deauthenticating from 00:22:07:80:6c:2e 
by local choice (Reason: 3=DEAUTH_LEAVING)
  [ma. juli  6 23:21:46 2020] ath10k_pci :3b:00.0: pci irq msi 
oper_irq_mode 2 irq_mode 0 reset_mode 0
  [ma. juli  6 23:21:46 2020] ath10k_pci :3b:00.0: qca6174 hw3.2 target 
0x0503 chip_id