[Kernel-packages] [Bug 1736794] Re: iwlwifi driver for Intel 7260 crashing

2018-03-19 Thread Simon Ye
This is a duplicate of
https://bugzilla.kernel.org/show_bug.cgi?id=191601, which also contains
a script that might be able to reset your card without a reboot.

Please see my longer comments here:
https://bugs.launchpad.net/ubuntu/+source/linux-firmware/+bug/1673344.
In summary, I believe this to be a longstanding hardware-software bug in
Linux due to a hardware degraded 7260 NIC. Replacing the 7260 card fixed
it for me.

** Bug watch added: Linux Kernel Bug Tracker #191601
   https://bugzilla.kernel.org/show_bug.cgi?id=191601

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-firmware in Ubuntu.
https://bugs.launchpad.net/bugs/1736794

Title:
  iwlwifi driver for Intel 7260 crashing

Status in linux-firmware package in Ubuntu:
  New

Bug description:
  Intel NUC's from Axiomtek with Intel Corporation Wireless 7260 (rev
  bb) chipsets. We have this running on almost 200 of these devices.

  Ubuntu 16.04.3 as well as Ubuntu 14.04.5

  "loaded firmware version 17.608620.0 op_mode iwlmvm"

  linux-firmware trusty: 1.127.24
  linux-firmware xenial: 1.157.14

  The wireless driver will crash at least daily, if not multiple times
  in a day depending on the machine. Swapping the chipset out for an
  Atheros works just fine.

  I have followed the many other bug reports related to this same
  chipset and they are either marked incomplete, refer to hardware we
  aren't running or have been marked as fix released.

  The latest linux-firmware did not resolve or change the behavior of
  the issue.

  Output from dmesg when the crash occurs:

  [Wed Dec  6 18:56:38 2017] iwlwifi :02:00.0: Queue 16 stuck for 1 ms.
  [Wed Dec  6 18:56:38 2017] iwlwifi :02:00.0: Current SW read_ptr 236 
write_ptr 88
  [Wed Dec  6 18:56:38 2017] [ cut here ]
  [Wed Dec  6 18:56:38 2017] WARNING: CPU: 3 PID: 0 at 
/build/linux-lts-xenial-CtFtiH/linux-lts-xenial-4.4.0/drivers/net/wireless/iwlwifi/pcie/trans.c:1552
 iwl_trans_pcie_grab_nic_access+0xf5/0x100 [iwlwifi]()
  [Wed Dec  6 18:56:38 2017] Timeout waiting for hardware access (CSR_GP_CNTRL 
0x)
  [Wed Dec  6 18:56:38 2017] Modules linked in: drbg ansi_cprng ctr ccm 
uvcvideo videobuf2_vmalloc btusb videobuf2_memops btrtl videobuf2_v4l2 btbcm 
btintel videobuf2_core hid_generic bluetooth v4l2_common videodev usbhid media 
arc4 snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic iwlmvm 
mac80211 intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel 
kvm irqbypass iwlwifi crct10dif_pclmul crc32_pclmul ghash_clmulni_intel 
cfg80211 aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd 
snd_soc_rt5640 snd_soc_rl6231 snd_soc_ssm4567 serio_raw lpc_ich i915 shpchp 
binfmt_misc snd_hda_intel snd_soc_core snd_compress snd_hda_codec ac97_bus 
snd_seq_midi snd_seq_midi_event snd_rawmidi snd_pcm_dmaengine snd_hda_core 
snd_hwdep snd_pcm 8250_fintek snd_seq video snd_seq_device snd_timer 
drm_kms_helper snd elan_i2c 8250_dw drm dw_dmac i2c_hid hid dw_dmac_core mei_me 
fb_sys_fops syscopyarea snd_soc_sst_acpi i2c_designware_platform mac_hid 
sysfillrect spi_pxa2xx_platform mei i2c_designware_core soundcore acpi_pad 
sysimgblt acpi_als kfifo_buf industrialio lp parport igb psmouse ahci e1000e 
i2c_algo_bit libahci dca ptp pps_core fjes sdhci_acpi sdhci
  [Wed Dec  6 18:56:38 2017] CPU: 3 PID: 0 Comm: swapper/3 Not tainted 
4.4.0-101-generic #124~14.04.1-Ubuntu
  [Wed Dec  6 18:56:38 2017] Hardware name: To be filled by O.E.M. To be filled 
by O.E.M./CRESCENTBAY, BIOS 5.6.5 12/14/2016
  [Wed Dec  6 18:56:38 2017]   88012dd83d00 
813dfc6c 88012dd83d48
  [Wed Dec  6 18:56:38 2017]  c07ca6e0 88012dd83d38 
8107ea26 8800ca47
  [Wed Dec  6 18:56:38 2017]  8800ca4735f0 88012dd83dd8 
 880128108bc0
  [Wed Dec  6 18:56:38 2017] Call Trace:
  [Wed Dec  6 18:56:38 2017][] dump_stack+0x63/0x87
  [Wed Dec  6 18:56:38 2017]  [] 
warn_slowpath_common+0x86/0xc0
  [Wed Dec  6 18:56:38 2017]  [] warn_slowpath_fmt+0x4c/0x50
  [Wed Dec  6 18:56:38 2017]  [] ? iwl_read32+0x1f/0x90 
[iwlwifi]
  [Wed Dec  6 18:56:38 2017]  [] 
iwl_trans_pcie_grab_nic_access+0xf5/0x100 [iwlwifi]
  [Wed Dec  6 18:56:38 2017]  [] 
iwl_trans_pcie_read_mem+0x2f/0xa0 [iwlwifi]
  [Wed Dec  6 18:56:38 2017]  [] 
iwl_pcie_txq_stuck_timer+0xd8/0x390 [iwlwifi]
  [Wed Dec  6 18:56:38 2017]  [] ? 
intel_pstate_timer_func+0x2db/0x3a0
  [Wed Dec  6 18:56:38 2017]  [] ? 
iwl_pcie_enqueue_hcmd+0xa90/0xa90 [iwlwifi]
  [Wed Dec  6 18:56:38 2017]  [] call_timer_fn+0x35/0x130
  [Wed Dec  6 18:56:38 2017]  [] ? 
iwl_pcie_enqueue_hcmd+0xa90/0xa90 [iwlwifi]
  [Wed Dec  6 18:56:38 2017]  [] run_timer_softirq+0x20e/0x2c0
  [Wed Dec  6 18:56:38 2017]  [] __do_softirq+0xdd/0x290
  [Wed Dec  6 18:56:38 2017]  [] irq_exit+0x95/0xa0
  [Wed Dec  6 18:56:38 2017]  [] 
smp_apic_timer_interrupt+0x42/0x50
  [Wed Dec  6 18:56:38 2017]  [] 

[Kernel-packages] [Bug 1673344] Re: Ubuntu 16.04 doesn't recongnize wifi card after loosing signal completely (intel 7260) even when I try to restart network manager

2018-03-19 Thread Simon Ye
Final update on this. It turned out that using the 2.4 Ghz band was much
more stable than 5 Ghz, which got wedged within minutes, but conditions
eventually worsened until even on 2.4 Ghz without much internet usage it
would get wedged every few hours. I then tried out various kernel
versions on 16.04 LTS going all the way back to 4.4.9 and different
versions of iwlwifi ucode with no success and different variations of
dmesg output, even though I recalled the wifi working just fine back
before upgrading to 17.10.

Looking through other bugs, this one is very similar
https://bugzilla.kernel.org/show_bug.cgi?id=191601 and there is a script
that worked for me to reset the wifi without having to reboot, which was
a major improvement.

Clearly there wasn't a specific recent software change in Linux that
caused this issue. Other people reporting this bug have reported it
cropping up at different OS/kernel versions. The wifi still works
perfectly fine on Windows 10, and the script to remove/re-add the PCI
card via sysfs can recover the wifi. Based on all of these facts, I
believe this is a longstanding hardware-software bug in Linux where it
can't recover from certain conditions when there is hardware degradation
on the PCI card - perhaps some timeout deadlock - that Windows either
shrugs off or can recover from. Based on that hunch, I ordered a new
7260 half height PCI-E card, replaced it in my desktop, and wifi has
been working perfectly since (a week or so).

At this stage, this card is very old and no specific bug can be easily
pinpointed. Furthermore it requires a somehow hardware degraded 7260
card to reproduce so a fix will be unlikely. In my opinion it is your
best shot to buy a replacement 7260 card to tide you over until you
upgrade your PC.

** Bug watch added: Linux Kernel Bug Tracker #191601
   https://bugzilla.kernel.org/show_bug.cgi?id=191601

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-firmware in Ubuntu.
https://bugs.launchpad.net/bugs/1673344

Title:
  Ubuntu 16.04 doesn't recongnize wifi card after loosing signal
  completely (intel 7260) even when I try to restart network manager

Status in linux package in Ubuntu:
  Confirmed
Status in linux-firmware package in Ubuntu:
  Confirmed

Bug description:
  My wireless connection completely disappears and isn't recognized
  along with my NIC card and doesn't come back until I reboot even if I
  restart network manager.

  
  I've attached the required files showing what my wifi information looks like 
connected and what it looks like when the problem occurs hopefully a fix or 
workaround can be found this happens several times a day.

  Steps:

  1) I boot up type in sudo iwconfig wlp3s0 power off (turn off power 
management)
   
  2) 
  $ iwconfig
  enp4s0f2  no wireless extensions.
  
  lono wireless extensions.
  
  wlp3s0IEEE 802.11  ESSID:"2.4ghz"  
Mode:Managed  Frequency:2.437 GHz  Access Point: 
E0:3F:29:93:3F:20   
Bit Rate=144.4 Mb/s   Tx-Power=22 dBm   
Retry short limit:7   RTS thr:off   Fragment thr:off
Power Management:off
Link Quality=60/70  Signal level=-50 dBm  
Rx invalid nwid:0  Rx invalid crypt:0  Rx invalid frag:0
Tx excessive retries:0  Invalid misc:55   Missed beacon:0
  
  3) I loose wifi using 5ghz or 2.4ghz after a certain amount of time
  
  4) sudo rmmod iwlmvm iwlwifi && sudo modprobe iwlmvm iwlwifi
  5) sudo systemctl restart network-manager.service

  I do a **iwconfig** and **I'm missing wlp3s0IEEE 802.11
  ESSID:"2.4ghz"**

  $ iwconfig
  enp4s0f2  no wireless extensions.
  
  lono wireless extensions.

  I have to reboot to get my wifi back and for ubunutu to recognize my
  wifi card back how can I prevent this.

  $ uname -a
  Linux sys76 4.8.0-41-generic #44~16.04.1-Ubuntu SMP Fri Mar 3 17:11:16 
UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

  $ dmesg | grep iwlwifi
  876.891600] iwlwifi :03:00.0: Error sending SCAN_OFFLOAD_REQUEST_CMD: 
enqueue_hcmd failed: -5
  [  876.891602] iwlwifi :03:00.0: Scan failed! ret -5
  [  877.909370] iwlwifi :03:00.0: Failed to wake NIC for hcmd
  [  877.909397] iwlwifi :03:00.0: Error sending 
SCAN_OFFLOAD_REQUEST_CMD: enqueue_hcmd failed: -5
  [  877.909409] iwlwifi :03:00.0: Scan failed! ret -5
  [  878.928277] iwlwifi :03:00.0: Failed to wake NIC for hcmd
  [  878.928364] iwlwifi :03:00.0: Error sending 
SCAN_OFFLOAD_REQUEST_CMD: enqueue_hcmd failed: -5
  [  878.928366] iwlwifi :03:00.0: Scan failed! ret -5
  [  879.947120] iwlwifi :03:00.0: Failed to wake NIC for hcmd
  [  879.947170] iwlwifi :03:00.0: Error sending 
SCAN_OFFLOAD_REQUEST_CMD: enqueue_hcmd failed: -5
  [  879.947172] iwlwifi :03:00.0: Scan failed! 

[Kernel-packages] [Bug 1673344] Re: Ubuntu 16.04 doesn't recongnize wifi card after loosing signal completely (intel 7260) even when I try to restart network manager

2018-01-29 Thread Simon Ye
Also now experiencing this bug after 17.10 upgrade - never had a problem
before. Wifi still works fine on the Windows side of dual boot so
probably software issue. Messing with swcrypto, rfkill, power management
off, WOL off doesn't seem to have any effect. I tried different older
ucode versions from
https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-
firmware.git but the problem still exists, although the oldest 7260-17
ucode version on the git tree doesn't continually spam errors to dmesg
after the initial failure, just once for the initial failure.

One strange thing I noticed is that stability is much worse with an
active deluge (torrent client) process, in which iwlwifi wedges after
hours->a day. But with deluge off it can last for multiple days - maybe
indefinitely - without wedging.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-firmware in Ubuntu.
https://bugs.launchpad.net/bugs/1673344

Title:
  Ubuntu 16.04 doesn't recongnize wifi card after loosing signal
  completely (intel 7260) even when I try to restart network manager

Status in linux package in Ubuntu:
  Confirmed
Status in linux-firmware package in Ubuntu:
  Confirmed

Bug description:
  My wireless connection completely disappears and isn't recognized
  along with my NIC card and doesn't come back until I reboot even if I
  restart network manager.

  
  I've attached the required files showing what my wifi information looks like 
connected and what it looks like when the problem occurs hopefully a fix or 
workaround can be found this happens several times a day.

  Steps:

  1) I boot up type in sudo iwconfig wlp3s0 power off (turn off power 
management)
   
  2) 
  $ iwconfig
  enp4s0f2  no wireless extensions.
  
  lono wireless extensions.
  
  wlp3s0IEEE 802.11  ESSID:"2.4ghz"  
Mode:Managed  Frequency:2.437 GHz  Access Point: 
E0:3F:29:93:3F:20   
Bit Rate=144.4 Mb/s   Tx-Power=22 dBm   
Retry short limit:7   RTS thr:off   Fragment thr:off
Power Management:off
Link Quality=60/70  Signal level=-50 dBm  
Rx invalid nwid:0  Rx invalid crypt:0  Rx invalid frag:0
Tx excessive retries:0  Invalid misc:55   Missed beacon:0
  
  3) I loose wifi using 5ghz or 2.4ghz after a certain amount of time
  
  4) sudo rmmod iwlmvm iwlwifi && sudo modprobe iwlmvm iwlwifi
  5) sudo systemctl restart network-manager.service

  I do a **iwconfig** and **I'm missing wlp3s0IEEE 802.11
  ESSID:"2.4ghz"**

  $ iwconfig
  enp4s0f2  no wireless extensions.
  
  lono wireless extensions.

  I have to reboot to get my wifi back and for ubunutu to recognize my
  wifi card back how can I prevent this.

  $ uname -a
  Linux sys76 4.8.0-41-generic #44~16.04.1-Ubuntu SMP Fri Mar 3 17:11:16 
UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

  $ dmesg | grep iwlwifi
  876.891600] iwlwifi :03:00.0: Error sending SCAN_OFFLOAD_REQUEST_CMD: 
enqueue_hcmd failed: -5
  [  876.891602] iwlwifi :03:00.0: Scan failed! ret -5
  [  877.909370] iwlwifi :03:00.0: Failed to wake NIC for hcmd
  [  877.909397] iwlwifi :03:00.0: Error sending 
SCAN_OFFLOAD_REQUEST_CMD: enqueue_hcmd failed: -5
  [  877.909409] iwlwifi :03:00.0: Scan failed! ret -5
  [  878.928277] iwlwifi :03:00.0: Failed to wake NIC for hcmd
  [  878.928364] iwlwifi :03:00.0: Error sending 
SCAN_OFFLOAD_REQUEST_CMD: enqueue_hcmd failed: -5
  [  878.928366] iwlwifi :03:00.0: Scan failed! ret -5
  [  879.947120] iwlwifi :03:00.0: Failed to wake NIC for hcmd
  [  879.947170] iwlwifi :03:00.0: Error sending 
SCAN_OFFLOAD_REQUEST_CMD: enqueue_hcmd failed: -5
  [  879.947172] iwlwifi :03:00.0: Scan failed! ret -5

  
  $ sudo lshw -C network
  [sudo] password for rt: 
*-network   
 description: Wireless interface
 product: Wireless 7260
 vendor: Intel Corporation
 physical id: 0
 bus info: pci@:03:00.0
 logical name: wlp3s0
 version: 73
 serial: 0c:8b:fd:2e:64:8e
 width: 64 bits
 clock: 33MHz
 capabilities: pm msi pciexpress bus_master cap_list ethernet 
physical wireless
 configuration: broadcast=yes driver=iwlwifi 
driverversion=4.8.0-41-generic firmware=17.352738.0 ip=192.168.1.8 latency=0 
link=yes multicast=yes wireless=IEEE 802.11
 resources: irq:31 memory:f7d0-f7d01fff
*-network
 description: Ethernet interface
 product: RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller
 vendor: Realtek Semiconductor Co., Ltd.
 physical id: 0.2
 bus info: pci@:04:00.2
 logical name: