On 2015-05-02 00:16:43, Christian Schulte <c...@schulte.it> wrote: > >Synopsis: After some time (minutes or seconds) the em0 interface stops > >working > >Category: system > >Environment: > System : OpenBSD 5.7 > Details : OpenBSD 5.7-stable (GENERIC.MP) #0: Fri May 1 23:59:46 > CEST 2015 > > r...@t60.schulte.it:/usr/src/sys/arch/amd64/compile/GENERIC.MP > > Architecture: OpenBSD.amd64 > Machine : amd64 > >Description: > Following is the contents of the /etc/hostname.em0 file: > > inet 192.168.10.50 255.255.255.0 192.168.10.255 > > The em0 interface works as expected. After some time it stops > working. Processes currently transmitting data will show a > (Broken pipe) error. Doing ifconfig em0 down && sh /etc/netstart > the interface starts working again for some time and than hangs > again. > > >How-To-Repeat: > The issue is reproducible by simply using the em0 interface. > >Fix: > ifconfig em0 down && sh /etc/netstart > > [...] >
I wonder if this is the same thing that I mentioned here: http://marc.info/?l=openbsd-misc&m=141007061612003&w=2 Though I don't get a "(Broken pipe)" error, em0 just hangs and won't transmit packets. In a nutshell, CPU activity causes em0 to hang on both my T60 and X61t laptops. Provided I don't do CPU intensive activities, it can run fine for days, or potentially weeks at a time. However, running bogofilter (which I've stopped using since it wedged the network reliably every time I'd fetch my email), compiling a large port (smaller ones don't tend to wedge it, only larger ones that take at least several minutes to compile seem to) or just trying to do something like fetch and upgrade packages, or just going to a CPU intensive webpage like google maps will invariably cause the network to wedge. Neither machine will send or receive packets, including responding to ping requests when this happens. If I need to update packages or compile something, I'll usually just run a script like the following so I don't need to babysit the computer: while true; do sleep 25 && ifconfig em0 down up && sleep 1 && ifconfig em0 down up && echo ping; done I was contacted privately by a developer and sent a few patches to the UVM and softraid systems which caused them to KERNEL_LOCK() and KERNEL_UNLOCK() in a few key places which has helped a lot, but the network will still lock up reliably with heavy CPU activity. All I can say is that I first noticed this when I upgraded from a mid-July 2014 snap to an early-September 2014 snap, which I know is a very large window. I've unfortunately been too busy to try to go back and figure out what change caused this, and have been getting by with the above script when necessary. I've seen a lot of work done on the networking code and in UVM over the last year or so, so I've been upgrading to newer snaps as they've been released hoping that they'd fix it, but it seems like the problem may lie somewhere else or be obscure or specific to my setup. I've just assumed that since no one else has reported this problem, that there was something unique about my system or setup that was causing this, which would tend to lower the severity of this bug since as far as I knew up until now, I was the only one affected by it. I realize this is a rather poor bug report, but hopefully by at least mentioning the fact that a few key KERNEL_LOCK()/KERNEL_UNLOCK() calls sprinkled around some UVM and softraid code reduce the occurance of this bug, it at least gives someone a somewhat smaller target to look at. I can provide more information if necessary. A dmesg from the T60 (with the above mentioned patches applied) follows: -- Bryan OpenBSD 5.7-current (GENERIC.MP) #14: Sat Apr 11 00:04:25 PDT 2015 r...@shoshoni-m.shoshoni.info:/usr/src/sys/arch/i386/compile/GENERIC.MP cpu0: Intel(R) Core(TM)2 CPU T7200 @ 2.00GHz ("GenuineIntel" 686-class) 2 GHz cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,NXE,LONG,SSE3,DTES64,MWAIT,DS-CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,LAHF,PERF real mem = 2682601472 (2558MB) avail mem = 2626478080 (2504MB) mpath0 at root scsibus0 at mpath0: 256 targets mainbus0 at root bios0 at mainbus0: date 04/01/10, BIOS32 rev. 0 @ 0xfd6b0, SMBIOS rev. 2.4 @ 0xe0010 (68 entries) bios0: vendor LENOVO version "79ETE6WW (2.26 )" date 04/01/2010 bios0: LENOVO 2623D9U acpi0 at bios0: rev 2 acpi0: sleep states S0 S3 S4 S5 acpi0: tables DSDT FACP SSDT ECDT TCPA APIC MCFG HPET BOOT SSDT SSDT SSDT SSDT acpi0: wakeup devices LID_(S3) SLPB(S3) LURT(S3) DURT(S3) EXP0(S4) EXP1(S4) EXP2(S4) EXP3(S4) PCI1(S4) USB0(S3) USB1(S3) USB2(S3) USB7(S3) HDEF(S4) acpitimer0 at acpi0: 3579545 Hz, 24 bits acpiec0 at acpi0 acpimadt0 at acpi0 addr 0xfee00000: PC-AT compat cpu0 at mainbus0: apid 0 (boot processor) mtrr: Pentium Pro MTRR support, 8 var ranges, 88 fixed ranges cpu0: apic clock running at 166MHz cpu0: mwait min=64, max=64, C-substates=0.2.2.2.2, IBE cpu1 at mainbus0: apid 1 (application processor) cpu1: Intel(R) Core(TM)2 CPU T7200 @ 2.00GHz ("GenuineIntel" 686-class) 2 GHz cpu1: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,NXE,LONG,SSE3,DTES64,MWAIT,DS-CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,LAHF,PERF ioapic0 at mainbus0: apid 1 pa 0xfec00000, version 20, 24 pins ioapic0: misconfigured as apic 2, remapped to apid 1 acpimcfg0 at acpi0 addr 0xf0000000, bus 0-63 acpihpet0 at acpi0: 14318179 Hz acpiprt0 at acpi0: bus 0 (PCI0) acpiprt1 at acpi0: bus 1 (AGP_) acpiprt2 at acpi0: bus 2 (EXP0) acpiprt3 at acpi0: bus 3 (EXP1) acpiprt4 at acpi0: bus 4 (EXP2) acpiprt5 at acpi0: bus 12 (EXP3) acpiprt6 at acpi0: bus 21 (PCI1) acpicpu0 at acpi0: C3, C2, C1, PSS acpicpu1 at acpi0: C3, C2, C1, PSS acpipwrres0 at acpi0: PUBS, resource for USB0, USB2, USB7 acpitz0 at acpi0: critical temperature is 127 degC acpitz1 at acpi0: critical temperature is 99 degC acpibtn0 at acpi0: LID_ acpibtn1 at acpi0: SLPB acpibat0 at acpi0: BAT0 model "92P1139" serial 659 type LION oem "Panasonic" acpibat1 at acpi0: BAT1 not present acpiac0 at acpi0: AC unit online acpithinkpad0 at acpi0 acpidock0 at acpi0: GDCK docked (15) bios0: ROM list: 0xc0000/0xfe00 0xd0000/0x1000 0xd1000/0x1000 0xdc000/0x4000! 0xe0000/0x10000! cpu0: Enhanced SpeedStep 1995 MHz: speeds: 2000, 1667, 1333, 1000 MHz pci0 at mainbus0 bus 0: configuration mode 1 (bios) pchb0 at pci0 dev 0 function 0 "Intel 82945GM Host" rev 0x03 ppb0 at pci0 dev 1 function 0 "Intel 82945GM PCIE" rev 0x03: apic 1 int 16 pci1 at ppb0 bus 1 radeondrm0 at pci1 dev 0 function 0 "ATI Radeon Mobility X1300 M52-64" rev 0x00 drm0 at radeondrm0 radeondrm0: apic 1 int 16 azalia0 at pci0 dev 27 function 0 "Intel 82801GB HD Audio" rev 0x02: msi azalia0: codecs: Analog Devices AD1981HD, Conexant/0x2bfa, using Analog Devices AD1981HD audio0 at azalia0 ppb1 at pci0 dev 28 function 0 "Intel 82801GB PCIE" rev 0x02: apic 1 int 20 pci2 at ppb1 bus 2 em0 at pci2 dev 0 function 0 "Intel 82573L" rev 0x00: msi, address 00:16:41:52:7e:81 ppb2 at pci0 dev 28 function 1 "Intel 82801GB PCIE" rev 0x02: apic 1 int 21 pci3 at ppb2 bus 3 wpi0 at pci3 dev 0 function 0 "Intel PRO/Wireless 3945ABG" rev 0x02: msi, MoW1, address 00:13:02:20:41:18 ppb3 at pci0 dev 28 function 2 "Intel 82801GB PCIE" rev 0x02: apic 1 int 22 pci4 at ppb3 bus 4 xhci0 at pci4 dev 0 function 0 "Renesas uPD720202 xHCI" rev 0x02: msi usb0 at xhci0: USB revision 3.0 uhub0 at usb0 "Renesas xHCI root hub" rev 3.00/1.00 addr 1 ppb4 at pci0 dev 28 function 3 "Intel 82801GB PCIE" rev 0x02: apic 1 int 23 pci5 at ppb4 bus 12 uhci0 at pci0 dev 29 function 0 "Intel 82801GB USB" rev 0x02: apic 1 int 16 uhci1 at pci0 dev 29 function 1 "Intel 82801GB USB" rev 0x02: apic 1 int 17 uhci2 at pci0 dev 29 function 2 "Intel 82801GB USB" rev 0x02: apic 1 int 18 uhci3 at pci0 dev 29 function 3 "Intel 82801GB USB" rev 0x02: apic 1 int 19 ehci0 at pci0 dev 29 function 7 "Intel 82801GB USB" rev 0x02: apic 1 int 19 usb1 at ehci0: USB revision 2.0 uhub1 at usb1 "Intel EHCI root hub" rev 2.00/1.00 addr 1 ppb5 at pci0 dev 30 function 0 "Intel 82801BAM Hub-to-PCI" rev 0xe2 pci6 at ppb5 bus 21 cbb0 at pci6 dev 0 function 0 "TI PCI1510 CardBus" rev 0x00: apic 1 int 16 cardslot0 at cbb0 slot 0 flags 0 cardbus0 at cardslot0: bus 22 device 0 cacheline 0x8, lattimer 0xb0 pcmcia0 at cardslot0 ichpcib0 at pci0 dev 31 function 0 "Intel 82801GBM LPC" rev 0x02: PM disabled pciide0 at pci0 dev 31 function 1 "Intel 82801GB IDE" rev 0x02: DMA, channel 0 configured to compatibility, channel 1 configured to compatibility atapiscsi0 at pciide0 channel 0 drive 0 scsibus1 at atapiscsi0: 2 targets cd0 at scsibus1 targ 0 lun 0: <HL-DT-ST, DVDRAM GSA-U10N, 1.05> ATAPI 5/cdrom removable cd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2 pciide0: channel 1 ignored (disabled) ahci0 at pci0 dev 31 function 2 "Intel 82801GBM AHCI" rev 0x02: msi, AHCI 1.1 ahci0: port 0: 1.5Gb/s scsibus2 at ahci0: 32 targets sd0 at scsibus2 targ 0 lun 0: <ATA, INTEL SSDSC2CW24, 400i> SCSI3 0/direct fixed naa.5001517bb2a98d08 sd0: 228936MB, 512 bytes/sector, 468862128 sectors, thin ichiic0 at pci0 dev 31 function 3 "Intel 82801GB SMBus" rev 0x02: apic 1 int 23 iic0 at ichiic0 usb2 at uhci0: USB revision 1.0 uhub2 at usb2 "Intel UHCI root hub" rev 1.00/1.00 addr 1 usb3 at uhci1: USB revision 1.0 uhub3 at usb3 "Intel UHCI root hub" rev 1.00/1.00 addr 1 usb4 at uhci2: USB revision 1.0 uhub4 at usb4 "Intel UHCI root hub" rev 1.00/1.00 addr 1 usb5 at uhci3: USB revision 1.0 uhub5 at usb5 "Intel UHCI root hub" rev 1.00/1.00 addr 1 isa0 at ichpcib0 isadma0 at isa0 com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo pckbc0 at isa0 port 0x60/5 pckbd0 at pckbc0 (kbd slot) pckbc0: using irq 1 for kbd slot wskbd0 at pckbd0: console keyboard pms0 at pckbc0 (aux slot) pckbc0: using irq 12 for aux slot wsmouse0 at pms0 mux 0 pcppi0 at isa0 port 0x61 spkr0 at pcppi0 aps0 at isa0 port 0x1600/31 npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16 com4 at pcmcia0 function 0 "U.S. Robotics, XJ/CC1560, Megahertz 56kbps Modem" port 0xa3f8/8: ns16550a, 16 byte fifo com4: probed fifo depth: 0 bytes uhub6 at uhub1 port 6 "IBM product 0x4485" rev 2.00/0.01 addr 2 uhidev0 at uhub6 port 2 configuration 1 interface 0 "Logitech USB-PS/2 Optical Mouse" rev 2.00/20.00 addr 3 uhidev0: iclass 3/1 ums0 at uhidev0: 3 buttons, Z dir wsmouse1 at ums0 mux 0 uhidev1 at uhub6 port 3 configuration 1 interface 0 "Gravis GamePad Pro USB" rev 1.00/2.00 addr 4 uhidev1: iclass 3/0 uhid0 at uhidev1: input=4, output=0, feature=0 ugen0 at uhub5 port 2 "STMicroelectronics Biometric Coprocessor" rev 1.00/0.01 addr 2 vscsi0 at root scsibus3 at vscsi0: 256 targets softraid0 at root scsibus4 at softraid0: 256 targets sd1 at scsibus4 targ 1 lun 0: <OPENBSD, SR CRYPTO, 005> SCSI2 0/direct fixed sd1: 200595MB, 512 bytes/sector, 410819160 sectors softraid0: volume sd1 is roaming, it used to be sd2, updating metadata root on sd1a (bfe3b486511fab55.a) swap on sd1b dump on sd1b radeondrm0: 1600x1200 wsdisplay0 at radeondrm0 mux 1: console (std, vt100 emulation), using wskbd0 wsdisplay0: screen 1-5 added (std, vt100 emulation) wpi0: radio is disabled by hardware switch wpi0: could not initialize hardware