Greetings, I administer a web server for my institution and last night we had a problem. The server is a 1U Intel Xeon E5620 machine. The on-board network interface is an Intel 82574L Gigabit Controller. Scientific Linux 6.4, kernel 2.6.32-431.1.2.el6.x86_64. At some point last night the network interface stopped working giving a backtrace on dev_watchdog. I could not restart the service network, it complained that the interface eth0 was not available. I tried to reconfigure it with NetworkManager, unsuccessfully. A full system reboot fixed the problem, although I couldn't identify the problem. I do not know if this matters, but this problem never occurred before the last yum update. Here below the portion of /var/log/messages that relates to the problem
================================================================= Dec 25 20:01:52 veritasm xinetd[1966]: EXIT: nrpe status=0 pid=20943 duration=0(sec) Dec 25 20:02:21 veritasm xinetd[1966]: START: nrpe pid=20947 from=::ffff:199.104.151.131 Dec 25 20:02:21 veritasm xinetd[1966]: EXIT: nrpe status=0 pid=20947 duration=0(sec) Dec 26 02:18:37 veritasm kernel: ------------[ cut here ]------------ Dec 26 02:18:37 veritasm kernel: WARNING: at net/sched/sch_generic.c:261 dev_watchdog+0x26b/0x280() (Not tainted) Dec 26 02:18:37 veritasm kernel: Hardware name: X8DTL Dec 26 02:18:37 veritasm kernel: NETDEV WATCHDOG: eth0 (e1000e): transmit queue 0 timed out Dec 26 02:18:37 veritasm kernel: Modules linked in: autofs4 8021q sunrpc garp stp llc cpufreq_ondemand acpi_cpufreq freq_table mperf ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 microcode iTCO_wdt iTCO_vendor_support sg i2c_i801 i2c_core lpc_ich mfd_core e1000e ptp pps_core ioatdma dca i7core_edac edac_core shpchp ext4 jbd2 mbcache raid1 sr_mod cdrom sd_mod crc_t10dif pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan] Dec 26 02:18:37 veritasm kernel: Pid: 130, comm: kipmi0 Not tainted 2.6.32-431.1.2.el6.x86_64 #1 Dec 26 02:18:37 veritasm kernel: Call Trace: Dec 26 02:18:37 veritasm kernel: <IRQ> [<ffffffff81071e27>] ? warn_slowpath_common+0x87/0xc0 Dec 26 02:18:37 veritasm kernel: [<ffffffff81071f16>] ? warn_slowpath_fmt+0x46/0x50 Dec 26 02:18:37 veritasm kernel: [<ffffffff8147b75b>] ? dev_watchdog+0x26b/0x280 Dec 26 02:18:37 veritasm kernel: [<ffffffff8105dd5c>] ? scheduler_tick+0xcc/0x260 Dec 26 02:18:37 veritasm kernel: [<ffffffff8147b4f0>] ? dev_watchdog+0x0/0x280 Dec 26 02:18:37 veritasm kernel: [<ffffffff81084b07>] ? run_timer_softirq+0x197/0x340 Dec 26 02:18:37 veritasm kernel: [<ffffffff810ac905>] ? tick_dev_program_event+0x65/0xc0 Dec 26 02:18:37 veritasm kernel: [<ffffffff8107a8e1>] ? __do_softirq+0xc1/0x1e0 Dec 26 02:18:37 veritasm kernel: [<ffffffff810ac9da>] ? tick_program_event+0x2a/0x30 Dec 26 02:18:37 veritasm kernel: [<ffffffff8100c30c>] ? call_softirq+0x1c/0x30 Dec 26 02:18:37 veritasm kernel: [<ffffffff8100fa75>] ? do_softirq+0x65/0xa0 Dec 26 02:18:37 veritasm kernel: [<ffffffff8107a795>] ? irq_exit+0x85/0x90 Dec 26 02:18:37 veritasm kernel: [<ffffffff815310ba>] ? smp_apic_timer_interrupt+0x4a/0x60 Dec 26 02:18:37 veritasm kernel: [<ffffffff8100bb93>] ? apic_timer_interrupt+0x13/0x20 Dec 26 02:18:37 veritasm kernel: <EOI> [<ffffffff8152a367>] ? _spin_unlock_irqrestore+0x17/0x20 Dec 26 02:18:37 veritasm kernel: [<ffffffff812e7790>] ? ipmi_thread+0x70/0x1c0 Dec 26 02:18:37 veritasm kernel: [<ffffffff812e7720>] ? ipmi_thread+0x0/0x1c0 Dec 26 02:18:37 veritasm kernel: [<ffffffff8109af06>] ? kthread+0x96/0xa0 Dec 26 02:18:37 veritasm kernel: [<ffffffff8100c20a>] ? child_rip+0xa/0x20 Dec 26 02:18:37 veritasm kernel: [<ffffffff8109ae70>] ? kthread+0x0/0xa0 Dec 26 02:18:37 veritasm kernel: [<ffffffff8100c200>] ? child_rip+0x0/0x20 Dec 26 02:18:37 veritasm kernel: ---[ end trace fc057a7fca6eff49 ]--- Dec 26 02:18:37 veritasm kernel: e1000e 0000:06:00.0: eth0: Reset adapter unexpectedly Dec 26 02:18:37 veritasm NetworkManager[1724]: <info> (eth0): carrier now OFF (device state 8, deferring action for 4 seconds) Dec 26 02:18:38 veritasm kernel: e1000e 0000:06:00.0: eth0: Timesync Tx Control register not set as expected Dec 26 02:18:41 veritasm NetworkManager[1724]: <info> (eth0): device state change: 8 -> 2 (reason 40) Dec 26 02:18:41 veritasm NetworkManager[1724]: <info> (eth0): deactivating device (reason: 40). Dec 26 02:18:43 veritasm ntpd[1974]: Deleting interface #5 eth0, 199.104.151.141#123, interface stats: received=789, sent=886, dropped=0, active_time=232100 secs Dec 26 08:58:36 veritasm kernel: fuse init (API version 7.13) Dec 26 08:58:36 veritasm rtkit-daemon[2379]: Sucessfully made thread 23005 of process 23005 (/usr/bin/pulseaudio) owned by '500' high priority at nice level -11. Dec 26 08:58:37 veritasm rtkit-daemon[2379]: Sucessfully made thread 23039 of process 23039 (/usr/bin/pulseaudio) owned by '500' high priority at nice level -11. Dec 26 08:58:37 veritasm pulseaudio[23039]: pid.c: Daemon already running. ===================================================================== and the dmesg portion too ====================================================================== e1000e 0000:06:00.0: eth0: registered PHC clock e1000e 0000:06:00.0: eth0: (PCI Express:2.5GT/s:Width x1) 00:25:90:c2:ec:00 e1000e 0000:06:00.0: eth0: Intel(R) PRO/1000 Network Connection e1000e 0000:06:00.0: eth0: MAC: 3, PHY: 8, PBA No: 0101FF-0FF ADDRCONF(NETDEV_UP): eth0: link is not ready e1000e: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready 8021q: adding VLAN 0 to HW filter on device eth0 eth0: no IPv6 routers present NETDEV WATCHDOG: eth0 (e1000e): transmit queue 0 timed out e1000e 0000:06:00.0: eth0: Reset adapter unexpectedly e1000e 0000:06:00.0: eth0: Timesync Tx Control register not set as expected ======================================================================= Hope somebody has an idea of where the problem might be. Regards, Nicola Galante