Am 22.07.2015 um 09:23 schrieb Stefan Priebe - Profihost AG:

Am 21.07.2015 um 23:15 schrieb Thomas Gleixner:
On Tue, 21 Jul 2015, Stefan Priebe wrote:
Am 20.07.2015 um 12:53 schrieb Thomas Gleixner:
On Mon, 20 Jul 2015, Stefan Priebe - Profihost AG wrote:
Hello list,

i've 36 servers all running vanilla 3.18.18 kernel which have a very
high disk and network load.

Since a few days i encounter regular the following error messages and
pretty often completely hanging disk i/o:
[535040.439859] do_IRQ: 0.126 No irq handler for vector (irq -1)

All systems are Single E5 Xeons and I'm running irqbalance on them.

Does it stop if you disable irqbalance ?

No. The machines still crash.

crash as in running into a BUG? Or is it just that disk I/O is stalled?

Sorry i meant I/O is stalled. It crashes to me as i can't login anymore
due to hanging I/O.

Can you please provide the full dmesg output of such a machine?

Yes (this time from a machine using 3.18.14) =>
http://pastebin.com/raw.php?i=S6kAk0iS

I'll cook up a debug patch for that against 3.18.18.

Do you have any special upstream commits in mind?

Stefan


That would be great!

Stefan

Thanks,

        tglx

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to