Hi,
We were seeing a lot of Tx hangs (once a day in lab, much more in
production) with Ubuntu 18.04.5 built-in driver (5.0.0-k and 5.1.0-k),
causing link resets etc.. Trying to isolate the problem a bit more we're
now trying the latest out-of-tree driver. Now it's
Feb 03 22:46:19 x kernel: NETDEV WATCHDOG: enp13s0 (ixgbe): transmit
queue 3 timed out
Feb 03 22:46:19 x kernel: ixgbe 0000:0d:00.0 enp13s0: Fake Tx hang
detected with timeout of 5 seconds
Feb 04 01:56:01 x kernel: ixgbe 0000:0d:00.0 enp13s0: Fake Tx hang
detected with timeout of 10 seconds
Feb 04 15:13:21 x kernel: ixgbe 0000:0d:00.0 enp13s0: Fake Tx hang
detected with timeout of 20 seconds
"Fake Tx Hang", but traffic still stops for quite a while. Setup is
ixgbe 0000:0d:00.0: enabling device (0000 -> 0002)
ixgbe 0000:0d:00.0 0000:0d:00.0 (uninitialized): ixgbe_check_options:
FCoE Offload feature enabled
ixgbe 0000:0d:00.0: Multiqueue Enabled: Rx Queue count = 8, Tx Queue
count = 8 XDP Queue count = 0
ixgbe 0000:0d:00.0: 32.000 Gb/s available PCIe bandwidth, limited by 5
GT/s x8 link at 0000:00:01.0 (capable of 63.008 Gb/s with 8 GT/s x8 link)
ixgbe 0000:0d:00.0 eth0: MAC: 2, PHY: 20, SFP+: 5, PBA No: E68787-011
ixgbe 0000:0d:00.0 eth0: Enabled Features: RxQ: 8 TxQ: 8 FdirHash
ixgbe 0000:0d:00.0 eth0: Intel(R) 10 Gigabit Network Connection
talking via 10Gbps fibre to Netgear switch to 20 clients (1GbE over copper)
Disabling irqbalance and using set_irq_affinity (-x local) results in
/proc/interrupts
132: 6932 1128 1977 2921 7857 7960
4392 3754 IR-PCI-MSI 520192-edge enp0s31f6
133: 264128 280513 208862 114758 608511
839 90376 187222 IR-PCI-MSI 6815744-edge enp13s0-TxRx-0
134: 2844471 365014 105711 1009702 423599
688773 0 589578 IR-PCI-MSI 6815745-edge enp13s0-TxRx-1
135: 95567 140332 43052 46623 86704 57976
135573 167195 IR-PCI-MSI 6815746-edge enp13s0-TxRx-2
136: 318887 197761 625025 193550 101415
102114 55755 291272 IR-PCI-MSI 6815747-edge enp13s0-TxRx-3
137: 491537 214000 158900 244269 266167 97220
25460 289919 IR-PCI-MSI 6815748-edge enp13s0-TxRx-4
138: 65767 248812 115232 238502 80103 46189
50782 81833 IR-PCI-MSI 6815749-edge enp13s0-TxRx-5
139: 207157 260936 48883 47421 216735 97253
92818 89079 IR-PCI-MSI 6815750-edge enp13s0-TxRx-6
140: 21512 424646 5111 390019 441436 4728
291370 278387 IR-PCI-MSI 6815751-edge enp13s0-TxRx-7
141: 0 0 0 3 0 0
0 0 IR-PCI-MSI 6815752-edge enp13s0
TxRx-1 on CPU0 seems a bit unbalanced, but otherwise fine?
ethtool would suggest it's not flow control related (which would explain
things?), now experimenting with tuning interrupt moderation / ring size
/ avoiding CPU0 . Is there anything else to try?
NIC statistics:
rx_packets: 188092169
tx_packets: 59581798
rx_bytes: 263463989424
tx_bytes: 25727566814
rx_errors: 0
tx_errors: 0
rx_dropped: 0
tx_dropped: 0
multicast: 1237
collisions: 0
rx_over_errors: 0
rx_crc_errors: 0
rx_frame_errors: 0
rx_fifo_errors: 0
rx_missed_errors: 0
tx_aborted_errors: 0
tx_carrier_errors: 0
tx_fifo_errors: 0
tx_heartbeat_errors: 0
rx_pkts_nic: 188092169
tx_pkts_nic: 59581798
rx_bytes_nic: 264216358100
tx_bytes_nic: 25965914840
lsc_int: 3
tx_busy: 0
non_eop_descs: 0
broadcast: 259
rx_no_buffer_count: 0
tx_timeout_count: 0
tx_restart_queue: 4
rx_length_errors: 0
rx_long_length_errors: 0
rx_short_length_errors: 0
tx_flow_control_xon: 0
rx_flow_control_xon: 0
tx_flow_control_xoff: 0
rx_flow_control_xoff: 0
rx_csum_offload_errors: 0
alloc_rx_page: 40526166
alloc_rx_page_failed: 0
alloc_rx_buff_failed: 0
rx_no_dma_resources: 0
hw_rsc_aggregated: 0
hw_rsc_flushed: 0
fdir_match: 188027415
fdir_miss: 178920
fdir_overflow: 0
fcoe_bad_fccrc: 0
fcoe_last_errors: 0
rx_fcoe_dropped: 0
rx_fcoe_packets: 0
rx_fcoe_dwords: 0
fcoe_noddp: 0
fcoe_noddp_ext_buff: 0
tx_fcoe_packets: 0
tx_fcoe_dwords: 0
os2bmc_rx_by_bmc: 0
os2bmc_tx_by_bmc: 0
os2bmc_tx_by_host: 0
os2bmc_rx_by_host: 0
tx_hwtstamp_timeouts: 0
tx_hwtstamp_skipped: 0
rx_hwtstamp_cleared: 0
tx_queue_0_packets: 2551919
tx_queue_0_bytes: 303310258
tx_queue_1_packets: 34799436
tx_queue_1_bytes: 2365301599
tx_queue_2_packets: 726053
tx_queue_2_bytes: 101796295
tx_queue_3_packets: 5109959
tx_queue_3_bytes: 4885506128
tx_queue_4_packets: 4655587
tx_queue_4_bytes: 4755028677
tx_queue_5_packets: 2533314
tx_queue_5_bytes: 2516349075
tx_queue_6_packets: 4109721
tx_queue_6_bytes: 5511688758
tx_queue_7_packets: 5095809
tx_queue_7_bytes: 5288586024
tx_queue_8_packets: 0
tx_queue_8_bytes: 0
tx_queue_9_packets: 0
tx_queue_9_bytes: 0
tx_queue_10_packets: 0
tx_queue_10_bytes: 0
tx_queue_11_packets: 0
tx_queue_11_bytes: 0
tx_queue_12_packets: 0
tx_queue_12_bytes: 0
tx_queue_13_packets: 0
tx_queue_13_bytes: 0
tx_queue_14_packets: 0
tx_queue_14_bytes: 0
tx_queue_15_packets: 0
tx_queue_15_bytes: 0
tx_queue_16_packets: 0
tx_queue_16_bytes: 0
tx_queue_17_packets: 0
tx_queue_17_bytes: 0
tx_queue_18_packets: 0
tx_queue_18_bytes: 0
tx_queue_19_packets: 0
tx_queue_19_bytes: 0
tx_queue_20_packets: 0
tx_queue_20_bytes: 0
tx_queue_21_packets: 0
tx_queue_21_bytes: 0
tx_queue_22_packets: 0
tx_queue_22_bytes: 0
tx_queue_23_packets: 0
tx_queue_23_bytes: 0
tx_queue_24_packets: 0
tx_queue_24_bytes: 0
tx_queue_25_packets: 0
tx_queue_25_bytes: 0
tx_queue_26_packets: 0
tx_queue_26_bytes: 0
tx_queue_27_packets: 0
tx_queue_27_bytes: 0
tx_queue_28_packets: 0
tx_queue_28_bytes: 0
tx_queue_29_packets: 0
tx_queue_29_bytes: 0
tx_queue_30_packets: 0
tx_queue_30_bytes: 0
tx_queue_31_packets: 0
tx_queue_31_bytes: 0
tx_queue_32_packets: 0
tx_queue_32_bytes: 0
tx_queue_33_packets: 0
tx_queue_33_bytes: 0
tx_queue_34_packets: 0
tx_queue_34_bytes: 0
tx_queue_35_packets: 0
tx_queue_35_bytes: 0
tx_queue_36_packets: 0
tx_queue_36_bytes: 0
tx_queue_37_packets: 0
tx_queue_37_bytes: 0
tx_queue_38_packets: 0
tx_queue_38_bytes: 0
tx_queue_39_packets: 0
tx_queue_39_bytes: 0
tx_queue_40_packets: 0
tx_queue_40_bytes: 0
tx_queue_41_packets: 0
tx_queue_41_bytes: 0
tx_queue_42_packets: 0
tx_queue_42_bytes: 0
tx_queue_43_packets: 0
tx_queue_43_bytes: 0
tx_queue_44_packets: 0
tx_queue_44_bytes: 0
tx_queue_45_packets: 0
tx_queue_45_bytes: 0
tx_queue_46_packets: 0
tx_queue_46_bytes: 0
tx_queue_47_packets: 0
tx_queue_47_bytes: 0
tx_queue_48_packets: 0
tx_queue_48_bytes: 0
tx_queue_49_packets: 0
tx_queue_49_bytes: 0
tx_queue_50_packets: 0
tx_queue_50_bytes: 0
tx_queue_51_packets: 0
tx_queue_51_bytes: 0
tx_queue_52_packets: 0
tx_queue_52_bytes: 0
tx_queue_53_packets: 0
tx_queue_53_bytes: 0
tx_queue_54_packets: 0
tx_queue_54_bytes: 0
tx_queue_55_packets: 0
tx_queue_55_bytes: 0
tx_queue_56_packets: 0
tx_queue_56_bytes: 0
tx_queue_57_packets: 0
tx_queue_57_bytes: 0
tx_queue_58_packets: 0
tx_queue_58_bytes: 0
tx_queue_59_packets: 0
tx_queue_59_bytes: 0
tx_queue_60_packets: 0
tx_queue_60_bytes: 0
tx_queue_61_packets: 0
tx_queue_61_bytes: 0
tx_queue_62_packets: 0
tx_queue_62_bytes: 0
tx_queue_63_packets: 0
tx_queue_63_bytes: 0
tx_queue_64_packets: 0
tx_queue_64_bytes: 0
tx_queue_65_packets: 0
tx_queue_65_bytes: 0
tx_queue_66_packets: 0
tx_queue_66_bytes: 0
tx_queue_67_packets: 0
tx_queue_67_bytes: 0
tx_queue_68_packets: 0
tx_queue_68_bytes: 0
tx_queue_69_packets: 0
tx_queue_69_bytes: 0
tx_queue_70_packets: 0
tx_queue_70_bytes: 0
rx_queue_0_packets: 11217199
rx_queue_0_bytes: 16124868046
rx_queue_1_packets: 157176240
rx_queue_1_bytes: 227080033991
rx_queue_2_packets: 1501661
rx_queue_2_bytes: 1972184904
rx_queue_3_packets: 5075867
rx_queue_3_bytes: 5607058971
rx_queue_4_packets: 5326935
rx_queue_4_bytes: 6020504920
rx_queue_5_packets: 1606940
rx_queue_5_bytes: 1282193306
rx_queue_6_packets: 2148621
rx_queue_6_bytes: 1388353208
rx_queue_7_packets: 4038706
rx_queue_7_bytes: 3988792078
rx_queue_8_packets: 0
rx_queue_8_bytes: 0
rx_queue_9_packets: 0
rx_queue_9_bytes: 0
rx_queue_10_packets: 0
rx_queue_10_bytes: 0
rx_queue_11_packets: 0
rx_queue_11_bytes: 0
rx_queue_12_packets: 0
rx_queue_12_bytes: 0
rx_queue_13_packets: 0
rx_queue_13_bytes: 0
rx_queue_14_packets: 0
rx_queue_14_bytes: 0
rx_queue_15_packets: 0
rx_queue_15_bytes: 0
rx_queue_16_packets: 0
rx_queue_16_bytes: 0
rx_queue_17_packets: 0
rx_queue_17_bytes: 0
rx_queue_18_packets: 0
rx_queue_18_bytes: 0
rx_queue_19_packets: 0
rx_queue_19_bytes: 0
rx_queue_20_packets: 0
rx_queue_20_bytes: 0
rx_queue_21_packets: 0
rx_queue_21_bytes: 0
rx_queue_22_packets: 0
rx_queue_22_bytes: 0
rx_queue_23_packets: 0
rx_queue_23_bytes: 0
rx_queue_24_packets: 0
rx_queue_24_bytes: 0
rx_queue_25_packets: 0
rx_queue_25_bytes: 0
rx_queue_26_packets: 0
rx_queue_26_bytes: 0
rx_queue_27_packets: 0
rx_queue_27_bytes: 0
rx_queue_28_packets: 0
rx_queue_28_bytes: 0
rx_queue_29_packets: 0
rx_queue_29_bytes: 0
rx_queue_30_packets: 0
rx_queue_30_bytes: 0
rx_queue_31_packets: 0
rx_queue_31_bytes: 0
rx_queue_32_packets: 0
rx_queue_32_bytes: 0
rx_queue_33_packets: 0
rx_queue_33_bytes: 0
rx_queue_34_packets: 0
rx_queue_34_bytes: 0
rx_queue_35_packets: 0
rx_queue_35_bytes: 0
rx_queue_36_packets: 0
rx_queue_36_bytes: 0
rx_queue_37_packets: 0
rx_queue_37_bytes: 0
rx_queue_38_packets: 0
rx_queue_38_bytes: 0
rx_queue_39_packets: 0
rx_queue_39_bytes: 0
rx_queue_40_packets: 0
rx_queue_40_bytes: 0
rx_queue_41_packets: 0
rx_queue_41_bytes: 0
rx_queue_42_packets: 0
rx_queue_42_bytes: 0
rx_queue_43_packets: 0
rx_queue_43_bytes: 0
rx_queue_44_packets: 0
rx_queue_44_bytes: 0
rx_queue_45_packets: 0
rx_queue_45_bytes: 0
rx_queue_46_packets: 0
rx_queue_46_bytes: 0
rx_queue_47_packets: 0
rx_queue_47_bytes: 0
rx_queue_48_packets: 0
rx_queue_48_bytes: 0
rx_queue_49_packets: 0
rx_queue_49_bytes: 0
rx_queue_50_packets: 0
rx_queue_50_bytes: 0
rx_queue_51_packets: 0
rx_queue_51_bytes: 0
rx_queue_52_packets: 0
rx_queue_52_bytes: 0
rx_queue_53_packets: 0
rx_queue_53_bytes: 0
rx_queue_54_packets: 0
rx_queue_54_bytes: 0
rx_queue_55_packets: 0
rx_queue_55_bytes: 0
rx_queue_56_packets: 0
rx_queue_56_bytes: 0
rx_queue_57_packets: 0
rx_queue_57_bytes: 0
rx_queue_58_packets: 0
rx_queue_58_bytes: 0
rx_queue_59_packets: 0
rx_queue_59_bytes: 0
rx_queue_60_packets: 0
rx_queue_60_bytes: 0
rx_queue_61_packets: 0
rx_queue_61_bytes: 0
rx_queue_62_packets: 0
rx_queue_62_bytes: 0
rx_queue_63_packets: 0
rx_queue_63_bytes: 0
rx_queue_64_packets: 0
rx_queue_64_bytes: 0
rx_queue_65_packets: 0
rx_queue_65_bytes: 0
rx_queue_66_packets: 0
rx_queue_66_bytes: 0
rx_queue_67_packets: 0
rx_queue_67_bytes: 0
rx_queue_68_packets: 0
rx_queue_68_bytes: 0
rx_queue_69_packets: 0
rx_queue_69_bytes: 0
rx_queue_70_packets: 0
rx_queue_70_bytes: 0
tx_pb_0_pxon: 0
tx_pb_0_pxoff: 0
tx_pb_1_pxon: 0
tx_pb_1_pxoff: 0
tx_pb_2_pxon: 0
tx_pb_2_pxoff: 0
tx_pb_3_pxon: 0
tx_pb_3_pxoff: 0
tx_pb_4_pxon: 0
tx_pb_4_pxoff: 0
tx_pb_5_pxon: 0
tx_pb_5_pxoff: 0
tx_pb_6_pxon: 0
tx_pb_6_pxoff: 0
tx_pb_7_pxon: 0
tx_pb_7_pxoff: 0
rx_pb_0_pxon: 0
rx_pb_0_pxoff: 0
rx_pb_1_pxon: 0
rx_pb_1_pxoff: 0
rx_pb_2_pxon: 0
rx_pb_2_pxoff: 0
rx_pb_3_pxon: 0
rx_pb_3_pxoff: 0
rx_pb_4_pxon: 0
rx_pb_4_pxoff: 0
rx_pb_5_pxon: 0
rx_pb_5_pxoff: 0
rx_pb_6_pxon: 0
rx_pb_6_pxoff: 0
rx_pb_7_pxon: 0
rx_pb_7_pxoff: 0
_______________________________________________
E1000-devel mailing list
E1000-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel Ethernet, visit
https://forums.intel.com/s/topic/0TO0P00000018NbWAI/intel-ethernet