date:20151121

Re: Kernel 4.1.12 crash

2015-11-21 Thread Andrew

Memory corruption, if happens, IMHO shouldn't be a hardware-related - 
almost all of these boxes, except H61M-based box from 1st log, works for 
a long time with uptime more than year; and only software was changed on 
it; H61M-based box runs memtest86 for a tens of hours w/o any error. If 
it was caused by hardware - they should crash even earlier.


Rarely on different servers I saw 'zram decompression error' messages 
(in this case I've got such message on H61M-based box).


Also, other people that uses accel-ppp as BRAS software, have different 
kernel panics/bugs/oopses on fresh kernels.


I'll try to apply these patches, and I'll try to switch back to kernels 
that were stable on some boxes.


21.11.2015 01:13, Alexander Duyck пишет:

On 11/20/2015 05:58 AM, Andrew wrote:

Hi all.

Today some BRASes on 4.1.12 kernel were crashed.

Here's crash traces: http://pastebin.com/p68hNS8R
http://pastebin.com/36ieRAM2 http://pastebin.com/3BRTVEB6

On 3.2 kernel same hardware works OK, troubles were noticed after kernel
upgrade.

What additional info is needed?


Looking over the traces there seem to be two areas called out.

The first is the fib_trie resize BUG_ON that was triggered due to the 
parent and child not being associated.  I think that might be due to 
memory corruption as I cannot find any spots where we are resizing 
without correctly setting up the parent-child relationship of the 
nodes first.


The other spot that is showing up is ppp_shutdown_interface and it's 
related path.  It looks like there are a couple of patches you could 
try back-porting to see if it resolves the issue.  If they do then 
perhaps they should be considered candidates for stable:


8cb775bc0a3 ("ppp: fix device unregistration upon netns deletion")
58a89ecaca5 ("ppp: fix lockdep splat in ppp_dev_uninit()")

- Alex


--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[patch] net/hsr: fix a warning message

2015-11-21 Thread Dan Carpenter

WARN_ON_ONCE() takes a condition, it doesn't take an error message.  I
have converted this to WARN() instead.

Signed-off-by: Dan Carpenter 

diff --git a/net/hsr/hsr_device.c b/net/hsr/hsr_device.c
index 35a9788..c7d1adc 100644
--- a/net/hsr/hsr_device.c
+++ b/net/hsr/hsr_device.c
@@ -312,7 +312,7 @@ static void send_hsr_supervision_frame(struct hsr_port 
*master, u8 type)
return;
 
 out:
-   WARN_ON_ONCE("HSR: Could not send supervision frame\n");
+   WARN_ONCE(1, "HSR: Could not send supervision frame\n");
kfree_skb(skb);
 }
 
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Linux 3.16.7: WARNING at kernfs_get+0x2a/0x30() and ida_remove+0xdd/0x120() in loop

2015-11-21 Thread Eugene A. Kravtsov

Last week I have a problem with kernel update my PPPoE BRAS server
from Linux 3.2.0-4-686-pae #1 SMP Debian 3.2.68-1+deb7u1 i686 GNU/Linux
to 3.16.0-4-686-pae #1 Debian 3.16.7-ckt11-1+deb8u5.
At any time, for no reason, at any load server downed to oops with following
trace http://spec.oborona.net/bras_panic

Server software is: rp-pppoe + pppoe + tc htb shapers on ppp interfaces
called from ip-ip script. No special settings are not using.

I tried to install new debian kernel 4.2.6, but the problem is not solved,
everything became worse, server worked with 4.2.6 about 15 minutes and downed
to panic
(no trace - network rsyslog is empty).

After that, i decide to try stable kernel from kernel.org - 4.2.3,
and saw the same panic with no network logs as with 4.2.6.

Many technical staff from Russia and UA ISP confirmed these problems
with ppp in all new kernels (accell-ppp and rp-pppoe).
https://translate.google.ru/translate?hl=ru&sl=ru&tl=en&u=http%3A%2F%2Fforum.nag.ru%2Fforum%2Findex.php%3Fshowtopic%3D45266%26st%3D6220
This is the same our problem from UA colleague
http://www.spinics.net/lists/netdev/msg352992.html

Last kernel with no problem, for me - 3.2.68-1+deb7u1 i686 GNU/Linux.
With 3.2 kerel server uptime is infinite.

Can i help with additional information?

--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

71 matches

Mail list logo