[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-23 Thread Launchpad Bug Tracker
This bug was fixed in the package irqbalance - 1.0.6-2ubuntu0.14.04.4

---
irqbalance (1.0.6-2ubuntu0.14.04.4) trusty; urgency=medium

  * d/p/NUMA-is-not-available-fix.patch: Avoid crashes when NUMA
is not available. (LP: #1469214)

 -- dann frazier   Thu, 10 Sep 2015 13:11:21
-0600

** Changed in: irqbalance (Ubuntu Trusty)
   Status: Fix Committed => Fix Released

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-23 Thread Launchpad Bug Tracker
This bug was fixed in the package irqbalance - 1.0.6-3ubuntu1.1

---
irqbalance (1.0.6-3ubuntu1.1) vivid; urgency=medium

  * d/p/NUMA-is-not-available-fix.patch: Avoid crashes when NUMA
is not available. (LP: #1469214)

 -- dann frazier   Thu, 10 Sep 2015 13:01:56 -0600

** Changed in: irqbalance (Ubuntu Vivid)
   Status: Fix Committed => Fix Released

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-21 Thread Colin Ian King
Tested with wily 1.0.6-3ubuntu3, bug fixed.

** Tags removed: verification-needed
** Tags added: verification-done

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-21 Thread Colin Ian King
Tested with vivid 1.0.6-3ubuntu1.1, bug is fixed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-21 Thread Colin Ian King
Andrew, I'm still testing it for vivid, will be done in a few hours.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-21 Thread Andrew Cloke
Great! Many thanks...

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-21 Thread Colin Ian King
I've tested 1.0.6-2ubuntu0.14.04.4 for several hours and the problem is
fixed, I can't reproduce this at all.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-21 Thread Andrew Cloke
Hi Colin, I believe you now have access to the necessary hardware, but
please let me know if this is still an issue. Thanks.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-11 Thread Colin Ian King
Hi there,

I need access to the machine to test this,  any hints on the machine
name and how to access it would be useful. Thanks.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-11 Thread Launchpad Bug Tracker
** Branch linked: lp:ubuntu/vivid-proposed/irqbalance

** Branch linked: lp:ubuntu/trusty-proposed/irqbalance

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-11 Thread Timo Aaltonen
Hello Colin, or anyone else affected,

Accepted irqbalance into vivid-proposed. The package will build now and
be available at
https://launchpad.net/ubuntu/+source/irqbalance/1.0.6-3ubuntu1.1 in a
few hours, and then in the -proposed repository.

Please help us by testing this new package.  See
https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to
enable and use -proposed.  Your feedback will aid us getting this update
out to other Ubuntu users.

If this package fixes the bug for you, please add a comment to this bug,
mentioning the version of the package you tested, and change the tag
from verification-needed to verification-done. If it does not fix the
bug for you, please add a comment stating that, and change the tag to
verification-failed.  In either case, details of your testing will help
us make a better decision.

Further information regarding the verification process can be found at
https://wiki.ubuntu.com/QATeam/PerformingSRUVerification .  Thank you in
advance!

** Changed in: irqbalance (Ubuntu Vivid)
   Status: In Progress => Fix Committed

** Tags added: verification-needed

** Changed in: irqbalance (Ubuntu Trusty)
   Status: In Progress => Fix Committed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-10 Thread dann frazier
** Changed in: irqbalance (Ubuntu Vivid)
   Status: Triaged => In Progress

** Changed in: irqbalance (Ubuntu Trusty)
   Status: Triaged => In Progress

** Changed in: irqbalance (Ubuntu Utopic)
   Status: Triaged => Won't Fix

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-09 Thread Launchpad Bug Tracker
This bug was fixed in the package irqbalance - 1.0.6-3ubuntu3

---
irqbalance (1.0.6-3ubuntu3) wily; urgency=medium

  * d/p/NUMA-is-not-available-fix.patch: Avoid crashes when NUMA
is not available. (LP: #1469214)

 -- dann frazier   Wed, 09 Sep 2015 17:35:26 -0600

** Changed in: irqbalance (Ubuntu Wily)
   Status: In Progress => Fix Released

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-09 Thread Launchpad Bug Tracker
** Branch linked: lp:ubuntu/wily-proposed/irqbalance

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-09-09 Thread dann frazier
** Changed in: irqbalance (Ubuntu Wily)
   Status: Triaged => In Progress

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-16 Thread Ming Lei
Dann,

I have figured out patches for fixing wily kernel, see following link:

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1474171/comments/4

so you can reproduce the issue on a totally clean wily distribution, :-)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-14 Thread Ming Lei
> I prepared a wily package w/ the proposed upstream backport for testing:
>   lp:~dannf/ubuntu/wily/irqbalance/lp1469214

> Unfortunately, I'm still seeing irqbalance crash even with this
backport:

I guess you still test irqbalance on c33, looks that upgrade from trusty isn't 
good, and
I can see lots of this kind of falut in different processes(sshd, stress-ng, 
systemd...)
just after a fresh boot with irqbalance disabled(see attachment), and sounds 
like a bad upgrade.


If you verify the patch on trusty/utopic/vivid, it does fix the issue according 
to my tests.

** Attachment added: "wily.log"
   
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+attachment/4429049/+files/wily.log

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-14 Thread dann frazier
Ming was able to help me reliable reproduce this with the command:
  stress-ng --sequential 0 --seq-start 86 --seq-end 90 -t 60 --syslog --metrics 
--times -v

I prepared a wily package w/ the proposed upstream backport for testing:
   lp:~dannf/ubuntu/wily/irqbalance/lp1469214

Unfortunately, I'm still seeing irqbalance crash even with this
backport:

[ 2461.635168] irqbalance[558]: unhandled input address range fault (11) at 
0x20202020202034, esr 0x9204
[ 2461.635175] pgd = ffcfab3f3000
[ 2461.675979] [20202020202034] *pgd=

[ 2461.733566] CPU: 4 PID: 558 Comm: irqbalance Not tainted 3.13.0-57-generic 
#95-Ubuntu
[ 2461.733570] task: ffcfa9cdcd00 ti: ffcfa9df8000 task.ti: 
ffcfa9df8000
[ 2461.733577] PC is at 0x40605c
[ 2461.733580] LR is at 0x4040e4
[ 2461.733582] pc : [<0040605c>] lr : [<004040e4>] pstate: 
8000
[ 2461.733584] sp : 007fd95cf7a0
[ 2461.733585] x29: 007fd95cf7a0 x28: 0041a000 
[ 2461.733588] x27: 0041a000 x26: 00409510 
[ 2461.733591] x25: 0041a000 x24: 00405000 
[ 2461.733593] x23: 0041acf8 x22: 0041a000 
[ 2461.733596] x21: 14ab0130 x20: 0041a000 
[ 2461.733598] x19: 14a9f0e0 x18:  
[ 2461.733601] x17: 007fa72118ec x16: 007fa75a72e0 
[ 2461.733603] x15: 003bcfb11b54656b x14: 2030203020302030 
[ 2461.733606] x13: 2030203020302030 x12: 2030203020302030 
[ 2461.733608] x11: 2030203020302030 x10: 2030203020302030 
[ 2461.733611] x9 : 2030203020302030 x8 : 14a9bc80 
[ 2461.733613] x7 : 0020 x6 : 14a9bc90 
[ 2461.733616] x5 : 0001 x4 : 007fa722a2a0 
[ 2461.733618] x3 : 14a9b880 x2 : 0001 
[ 2461.733620] x1 : 4320202020202020 x0 : 3355000a

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-14 Thread Launchpad Bug Tracker
** Branch linked: lp:~dannf/ubuntu/wily/irqbalance/lp1469214

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-13 Thread Ming Lei
On Mon, Jul 13, 2015 at 9:27 AM, Ming Lei <1469...@bugs.launchpad.net> wrote:
> Dann,
>
> Please follow the steps in #12, in which you should trigger the crash in
> 4 minutes.

> I've been running that in a loop and I'm currently on iteration #76
> w/o a crash :(

The issue is nothing to do with kernel, and it should be made sure that 
irqbalance
is running first.

I can reproduce the issue on trusty, utopic and vivid easily with the
approach in #12.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-13 Thread Ming Lei
> BTW, looks wily kernel can't boot to shell prompt on mcdivitt.

That kernel(v4.0) isn't the final kernel for wily, so do we need to pay
attention to that?

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-13 Thread dann frazier
On Mon, Jul 13, 2015 at 9:27 AM, Ming Lei <1469...@bugs.launchpad.net> wrote:
> Dann,
>
> Please follow the steps in #12, in which you should trigger the crash in
> 4 minutes.

I've been running that in a loop and I'm currently on iteration #76
w/o a crash :(

Maybe it's
Linux ms10-33-mcdivittB0 3.19.0-22-generic #22-Ubuntu SMP Tue Jun 16
17:18:17 UTC 2015 aarch64 aarch64 aarch64 GNU/Linux

> BTW, looks wily kernel can't boot to shell prompt on mcdivitt.

OK - mind filing a separate bug for that?

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-13 Thread Ming Lei
Dann,

Please follow the steps in #12, in which you should trigger the crash in
4 minutes.

BTW, looks wily kernel can't boot to shell prompt on mcdivitt.

Thanks,

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-09 Thread dann frazier
On Tue, Jul 7, 2015 at 2:25 AM, Ming Lei <1469...@bugs.launchpad.net> wrote:
> On Tue, Jul 7, 2015 at 11:16 AM, Ming Lei  wrote:
>> Looks there are two kinds of translation fault from irqbalance:
>>
>> 1) happend in place_irq_in_node() which can reproduce in vivid package
>>
>> 2) the 2nd one happened in glib2, which  is built by myself, because
>> irqbalance can choose to use its own local glib if there isn't glib2 
>> available,
>> and the glib2 does exist in my server in which I build irqbalance.
>
>
> Both of two above reports can be fixed by the following irqbalance commit:
>
> NUMA is not available fix
>
> https://github.com/Irqbalance/irqbalance/commit/a3c812eb6cd627cd3fae45b8345538558b86973c
>
> Looks stress-ng can't only find kernel bug, but also userspace
> issue, :-)

I was looking to upload a fix for wily, but I haven't been able to
reproduce it to in order to verify the fix. I ran 'stress-ng --seq 0
-t 60 --syslog --metrics --times -v' overnight in a loop, but
irqbalance never crashed. How long should I expect this to take on
average? Does it usually crash in a single run?

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-08 Thread Ming Lei
** Description changed:

- Running stress-ng on a HP ProLiant m400 server can cause unhandled level
- 3 translations faults:
+ 
+ [Impact]
+ irqbalance can be crashed(got signal of segment fault) on trusty, utopic, 
vivid and wily.
+ 
+ [Test Case]
+ stress-ng --seq 0 -t 60 --syslog --metrics --times -v
+ 
+ [Regression Potential]
+ The proposed patch has been merged irqbalance upstream 1.0.7, so there 
shouldn't be potential regression.
+ 
+ 
https://github.com/Irqbalance/irqbalance/commit/a3c812eb6cd627cd3fae45b8345538558b86973c
+ 
+ 
+ [Other Info]
+ 
+ See following about the segmentation fault log.
+ 
+ 
+ 
+ 
+ Running stress-ng on a HP ProLiant m400 server can cause unhandled level 3 
translations faults:
  
  use stress-ng from git://kernel.ubuntu.com/cking/stress-ng
  
  ./stress-ng --seq 0 -t 60 -v
  
  and after some time this trips the following:
  
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922560] 
systemd-timesyn[481]: unhandled level 3 translation fault (7) at 0x7fa8ea6008, 
esr 0x9207
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922561] pgd = 
ffcfb563f000
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922563] [7fa8ea6008] 
*pgd=004fb4f28003, *pud=004fb4f28003, *pmd=004fb4f38003, 
*pte=1d151c00
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922566]
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922569] CPU: 6 PID: 481 
Comm: systemd-timesyn Not tainted 3.19.0-21-generic #21-Ubuntu
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922571] Hardware name: HP 
ProLiant m400 Server Cartridge (DT)
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922573] task: 
ffcfb4e3b100 ti: ffcfb4d2c000 task.ti: ffcfb4d2c000
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922588] PC is at 0x7fa8d81824
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922589] LR is at 0x7fa8e3b3e4
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922590] pc : 
[<007fa8d81824>] lr : [<007fa8e3b3e4>] pstate: 8000
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922591] sp : 007ff120d660
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922592] x29: 
007ff120d660 x28: 007fa8f1c000
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922594] x27: 
007fa8f32084 x26: 007fa8f32000
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922595] x25: 
007fa8f1d788 x24: 007fa8f1d888
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922597] x23: 
0001 x22: 007fa8f1faa0
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922599] x21: 
007ff120d7f0 x20: 007ff120d7d0
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922600] x19: 
007fa8f31000 x18: 007fa8f1e000
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922602] x17: 
007fa8e3b3b8 x16: 007fa8ea6000
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922603] x15: 
003b9aca x14: 00219bbdd000
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922605] x13: 
aa751223 x12: 
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922607] x11: 
0101010101010101 x10: 7f7f7f7f7f7f7f7f
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922609] x9 : 
37333c43484f5e46 x8 : 007ff120d818
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922610] x7 : 
007ff120d8f0 x6 : 007ff120d828
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922612] x5 : 
ff80ffd0 x4 : 007ff120d8c0
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922613] x3 : 
007ff120d7d0 x2 : 007fa8f1faa0
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922615] x1 : 
0001 x0 : 0064
  Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922616]

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Alberto Salvia Novella
** Changed in: irqbalance (Ubuntu Trusty)
   Status: Confirmed => Triaged

** Changed in: irqbalance (Ubuntu Utopic)
   Status: Confirmed => Triaged

** Changed in: irqbalance (Ubuntu Vivid)
   Status: Confirmed => Triaged

** Changed in: irqbalance (Ubuntu Wily)
   Status: Confirmed => Triaged

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Alberto Salvia Novella
** Changed in: irqbalance (Ubuntu Trusty)
 Assignee: (unassigned) => dann frazier (dannf)

** Changed in: irqbalance (Ubuntu Utopic)
 Assignee: (unassigned) => dann frazier (dannf)

** Changed in: irqbalance (Ubuntu Vivid)
 Assignee: (unassigned) => dann frazier (dannf)

** Changed in: irqbalance (Ubuntu Wily)
 Assignee: (unassigned) => dann frazier (dannf)

** Changed in: irqbalance (Ubuntu Trusty)
   Importance: Undecided => Medium

** Changed in: irqbalance (Ubuntu Utopic)
   Importance: Undecided => Medium

** Changed in: irqbalance (Ubuntu Vivid)
   Importance: Undecided => Medium

** Changed in: irqbalance (Ubuntu Wily)
   Importance: Undecided => Medium

** No longer affects: linux (Ubuntu)

** No longer affects: linux (Ubuntu Trusty)

** No longer affects: linux (Ubuntu Utopic)

** No longer affects: linux (Ubuntu Vivid)

** No longer affects: linux (Ubuntu Wily)

** Tags added: trusty utopic vivid wily

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users.

** Changed in: irqbalance (Ubuntu Utopic)
   Status: New => Confirmed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users.

** Changed in: irqbalance (Ubuntu Trusty)
   Status: New => Confirmed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users.

** Changed in: irqbalance (Ubuntu)
   Status: New => Confirmed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Ming Lei
** Changed in: irqbalance (Ubuntu Vivid)
   Status: In Progress => Confirmed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Ubuntu Foundations Team Bug Bot
The attachment "0001-stress-ng-support-sequential-range.patch" seems to
be a patch.  If it isn't, please remove the "patch" flag from the
attachment, remove the "patch" tag, and if you are a member of the
~ubuntu-reviewers, unsubscribe the team.

[This is an automated message performed by a Launchpad user owned by
~brian-murray, for any issues please contact him.]

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Andrew Cloke
Following Ming's identification of an irqbalance patch that fixes this
issue, I'm marking the "Affected" status on "linux (Ubuntu)" as being
"invalid".

** Changed in: linux (Ubuntu Trusty)
   Status: New => Invalid

** Changed in: linux (Ubuntu Utopic)
   Status: New => Invalid

** Changed in: linux (Ubuntu Vivid)
   Status: New => Invalid

** Changed in: linux (Ubuntu Wily)
   Status: Triaged => Invalid

** Changed in: irqbalance (Ubuntu Vivid)
   Status: New => In Progress

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread dann frazier
** Also affects: irqbalance (Ubuntu Trusty)
   Importance: Undecided
   Status: New

** Also affects: linux (Ubuntu Trusty)
   Importance: Undecided
   Status: New

** Also affects: irqbalance (Ubuntu Utopic)
   Importance: Undecided
   Status: New

** Also affects: linux (Ubuntu Utopic)
   Importance: Undecided
   Status: New

** Also affects: irqbalance (Ubuntu Wily)
   Importance: Undecided
   Status: New

** Also affects: linux (Ubuntu Wily)
   Importance: Medium
 Assignee: dann frazier (dannf)
   Status: Triaged

** Also affects: irqbalance (Ubuntu Vivid)
   Importance: Undecided
   Status: New

** Also affects: linux (Ubuntu Vivid)
   Importance: Undecided
   Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Andrew Cloke
** Also affects: irqbalance (Ubuntu)
   Importance: Undecided
   Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Colin Ian King
Thanks Ming for finding the fix. I was going to do a bisect on the
upstream code but ran out of time last night.  Nice find!

Colin

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-07 Thread Ming Lei
On Tue, Jul 7, 2015 at 11:16 AM, Ming Lei  wrote:
> Looks there are two kinds of translation fault from irqbalance:
>
> 1) happend in place_irq_in_node() which can reproduce in vivid package
>
> 2) the 2nd one happened in glib2, which  is built by myself, because
> irqbalance can choose to use its own local glib if there isn't glib2 
> available,
> and the glib2 does exist in my server in which I build irqbalance.


Both of two above reports can be fixed by the following irqbalance commit:

NUMA is not available fix

https://github.com/Irqbalance/irqbalance/commit/a3c812eb6cd627cd3fae45b8345538558b86973c

Looks stress-ng can't only find kernel bug, but also userspace
issue, :-)

Thanks,
Ming

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-06 Thread Ming Lei
Looks there are two kinds of translation fault from irqbalance:

1) happend in place_irq_in_node() which can reproduce in vivid package

2) the 2nd one happened in glib2, which  is built by myself, because
irqbalance can choose to use its own local glib if there isn't glib2 available,
and the glib2 does exist in my server in which I build irqbalance.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-06 Thread Ming Lei
On Tue, Jul 7, 2015 at 2:37 AM, Colin Ian King
<1469...@bugs.launchpad.net> wrote:
> captured irqbalance segfaulting:
>
> Program received signal SIGSEGV, Segmentation fault.
> 0x00408f8c in place_irq_in_node (info=0x2c3d0050, data=0x0) at 
> placement.c:145
> 145 if (irq_numa_node(info)->number != -1) {
> (gdb) where
> #0  0x00408f8c in place_irq_in_node (info=0x2c3d0050, data=0x0) at 
> placement.c:145
> #1  0x00405154 in for_each_irq (list=0x2c3df660, cb=0x408f4c 
> , data=0x0)
> at classify.c:508
> #2  0x0040923c in calculate_placement () at placement.c:196
> #3  0x00407800 in main (argc=2, argv=0x7fcd014928) at irqbalance.c:372
>
> (gdb) print info
> $1 = (struct irq_info *) 0x2c3d0050

Suppose info is one address in heap, then it is valid, and the segfault
should be caused by invalid info->numa_node.

Thanks

>
> --
> You received this bug notification because you are subscribed to linux
> in Ubuntu.
> https://bugs.launchpad.net/bugs/1469214
>
> Title:
>   HP ProLiant m400 Server crashes with unhandled level 3 translation
>   fault
>
> Status in linux package in Ubuntu:
>   Triaged
>
> Bug description:
>   Running stress-ng on a HP ProLiant m400 server can cause unhandled
>   level 3 translations faults:
>
>   use stress-ng from git://kernel.ubuntu.com/cking/stress-ng
>
>   ./stress-ng --seq 0 -t 60 -v
>
>   and after some time this trips the following:
>
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922560] 
> systemd-timesyn[481]: unhandled level 3 translation fault (7) at 
> 0x7fa8ea6008, esr 0x9207
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922561] pgd = 
> ffcfb563f000
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922563] [7fa8ea6008] 
> *pgd=004fb4f28003, *pud=004fb4f28003, *pmd=004fb4f38003, 
> *pte=1d151c00
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922566]
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922569] CPU: 6 PID: 481 
> Comm: systemd-timesyn Not tainted 3.19.0-21-generic #21-Ubuntu
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922571] Hardware name: HP 
> ProLiant m400 Server Cartridge (DT)
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922573] task: 
> ffcfb4e3b100 ti: ffcfb4d2c000 task.ti: ffcfb4d2c000
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922588] PC is at 
> 0x7fa8d81824
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922589] LR is at 
> 0x7fa8e3b3e4
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922590] pc : 
> [<007fa8d81824>] lr : [<007fa8e3b3e4>] pstate: 8000
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922591] sp : 
> 007ff120d660
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922592] x29: 
> 007ff120d660 x28: 007fa8f1c000
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922594] x27: 
> 007fa8f32084 x26: 007fa8f32000
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922595] x25: 
> 007fa8f1d788 x24: 007fa8f1d888
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922597] x23: 
> 0001 x22: 007fa8f1faa0
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922599] x21: 
> 007ff120d7f0 x20: 007ff120d7d0
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922600] x19: 
> 007fa8f31000 x18: 007fa8f1e000
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922602] x17: 
> 007fa8e3b3b8 x16: 007fa8ea6000
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922603] x15: 
> 003b9aca x14: 00219bbdd000
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922605] x13: 
> aa751223 x12: 
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922607] x11: 
> 0101010101010101 x10: 7f7f7f7f7f7f7f7f
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922609] x9 : 
> 37333c43484f5e46 x8 : 007ff120d818
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922610] x7 : 
> 007ff120d8f0 x6 : 007ff120d828
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922612] x5 : 
> ff80ffd0 x4 : 007ff120d8c0
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922613] x3 : 
> 007ff120d7d0 x2 : 007fa8f1faa0
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922615] x1 : 
> 0001 x0 : 0064
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922616]
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-06 Thread Colin Ian King
captured irqbalance segfaulting:

Program received signal SIGSEGV, Segmentation fault.
0x00408f8c in place_irq_in_node (info=0x2c3d0050, data=0x0) at 
placement.c:145
145 if (irq_numa_node(info)->number != -1) {
(gdb) where
#0  0x00408f8c in place_irq_in_node (info=0x2c3d0050, data=0x0) at 
placement.c:145
#1  0x00405154 in for_each_irq (list=0x2c3df660, cb=0x408f4c 
, data=0x0)
at classify.c:508
#2  0x0040923c in calculate_placement () at placement.c:196
#3  0x00407800 in main (argc=2, argv=0x7fcd014928) at irqbalance.c:372

(gdb) print info
$1 = (struct irq_info *) 0x2c3d0050

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-06 Thread Ubuntu Foundations Team Bug Bot
** Tags added: patch

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-06 Thread Ming Lei
On Mon, Jul 6, 2015 at 9:28 PM, Colin Ian King
<1469...@bugs.launchpad.net> wrote:
> I re-ran this today with the following script as a non-root user:
>
> #!/bin/bash
> tests="affinity aio bigheap brk bsearch cache chdir chmod clock context cpu 
> crypt dentry dir dup epoll eventfd fstat fallocate fault fifo flock fork 
> futex get getrandom hdd hsearch inotify io itimer kcmp kill lease link lockf 
> longjmp lsearch malloc matrix memcpy memfd mincore mlock mmap mmapmany mremap 
> msg mq nice null open pipe poll procfs pthread qsort readahead rename rlimit 
> seek sem sem-sysv sendfile shm-sysv sigfd sigfpe sigq sigsegv sock splice 
> stack str switch symlink sysinfo sysfs tee timer timerfd tsearch udp 
> udp-flood urandom utime vecmath vfork vm vm-rw vm-splice wcs wait yield xattr 
> zero zombie"
>
> for t in $tests
> do
> echo $t
> echo $t | sudo tee /dev/kmsg
> ./stress-ng --$t 0 -v -t 60
> done
>
> and hit this issue:
>
> [14098.848615] urandom
> [14111.696335] irqbalance[828]: unhandled level 2 translation fault (11) at 
> 0x4f64, esr 0x9206
> [14111.696341] pgd = ffcfef71b000
> [14111.737149] [4f64] *pgd=004fef1f3003, *pud=004fef1f3003, 
> *pmd=
>

As I suggested, it should be helpful to provide /proc/$(pidof
irqbalance)/maps, otherwise we can't know where both the faulted
and PC address are.

Finally I have figured out one simple way to reproduce the issue:

1) apply the attached debug patch to stress-ng

2) run the following script:

sudo cat /proc/$(pidof irqbalance)/maps
/home/ubuntu/git/stress-ng/stress-ng --sequential 0 --seq-start 80
--seq-end 84 -t 60 --syslog --metrics --times -v

And the above command just runs the following 4 stresses in 4 minutes:

stress-ng: info:  [1067] dispatching hogs: 8 tsearch, 8 udp, 8 udp-flood,
8  urandom

3) the above may trigger the following faults from irqbalance with
~3/4 probability, and the faulted address is in heap, and PC points to
code of libglib-2.0.so, so looks like a use-after-free in irqbalance or
libglib? And no information shows it is related with kernel, also
the four stresses are quite simple and shouldn't cause trouble to
kernel.


# irqbalance memory maps
0040-0040a000 r-xp  08:02 10496929
  /usr/sbin/irqbalance
00419000-0041a000 r-xp 9000 08:02 10496929
  /usr/sbin/irqbalance
0041a000-0041b000 rwxp a000 08:02 10496929
  /usr/sbin/irqbalance
16294000-162b5000 rwxp  00:00 0  [heap]
162b5000-162ce000 rwxp  00:00 0  [heap]
7f8fbf9000-7f8fbfb000 rwxp  00:00 0
7f8fbfb000-7f8fc11000 r-xp  08:02 4722034
  /lib/aarch64-linux-gnu/libpthread-2.21.so
7f8fc11000-7f8fc2 ---p 00016000 08:02 4722034
  /lib/aarch64-linux-gnu/libpthread-2.21.so
7f8fc2-7f8fc21000 r-xp 00015000 08:02 4722034
  /lib/aarch64-linux-gnu/libpthread-2.21.so
7f8fc21000-7f8fc22000 rwxp 00016000 08:02 4722034
  /lib/aarch64-linux-gnu/libpthread-2.21.so
7f8fc22000-7f8fc26000 rwxp  00:00 0
7f8fc26000-7f8fc7f000 r-xp  08:02 4718668
  /lib/aarch64-linux-gnu/libpcre.so.3.13.1
7f8fc7f000-7f8fc8f000 ---p 00059000 08:02 4718668
  /lib/aarch64-linux-gnu/libpcre.so.3.13.1
7f8fc8f000-7f8fc9 r-xp 00059000 08:02 4718668
  /lib/aarch64-linux-gnu/libpcre.so.3.13.1
7f8fc9-7f8fc91000 rwxp 0005a000 08:02 4718668
  /lib/aarch64-linux-gnu/libpcre.so.3.13.1
7f8fc91000-7f8fdc1000 r-xp  08:02 4722027
  /lib/aarch64-linux-gnu/libc-2.21.so
7f8fdc1000-7f8fdd ---p 0013 08:02 4722027
  /lib/aarch64-linux-gnu/libc-2.21.so
7f8fdd-7f8fdd4000 r-xp 0012f000 08:02 4722027
  /lib/aarch64-linux-gnu/libc-2.21.so
7f8fdd4000-7f8fdd6000 rwxp 00133000 08:02 4722027
  /lib/aarch64-linux-gnu/libc-2.21.so
7f8fdd6000-7f8fdda000 rwxp  00:00 0
7f8fdda000-7f8fde3000 r-xp  08:02 10885206
  /usr/lib/aarch64-linux-gnu/libnuma.so.1.0.0
7f8fde3000-7f8fdf2000 ---p 9000 08:02 10885206
  /usr/lib/aarch64-linux-gnu/libnuma.so.1.0.0
7f8fdf2000-7f8fdf3000 r-xp 8000 08:02 10885206
  /usr/lib/aarch64-linux-gnu/libnuma.so.1.0.0
7f8fdf3000-7f8fdf4000 rwxp 9000 08:02 10885206
  /usr/lib/aarch64-linux-gnu/libnuma.so.1.0.0
7f8fdf4000-7f8fdf8000 rwxp  00:00 0
7f8fdf8000-7f8fe89000 r-xp  08:02 4722041
  /lib/aarch64-linux-gnu/libm-2.21.so
7f8fe89000-7f8fe98000 ---p 00091000 08:02 4722041
  /lib/aarch64-linux-gnu/libm-2.21.so
7f8fe98000-7f8fe99000 r-xp 0009 08:02 4722041
  /lib/aarch64-linux-gnu/libm-2.21.so
7f8fe99000-7f8fe9a000 rwxp 00091000 08:02 4722041
  /lib/aarch64-linux-gnu/libm-2.21.so
7f8fe9a000-7f8ff8c000 r-xp  08:02 4718610
  /lib/aarch64-linux-gnu/libglib-2.0.so.0.4400.1
7f8ff8c000-7f8ff9c000 ---p 000f2000 08:02 4718610
  /lib/aarch64-linux-gnu/libglib-2.0.so.0.4400.1
7f8ff9c000-7f8ff9d000 r-xp 000f2000 08:02 4718610
  /lib/aarch64-linux-gnu/libglib-2.0.so.0.4400.1
7f8ff9d000-7f8ff9e000 rwxp 000f3000 08:02 4718610
  /lib/aarch64-linux-gnu/libglib-2.0.

[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-06 Thread Colin Ian King
I re-ran this today with the following script as a non-root user:

#!/bin/bash
tests="affinity aio bigheap brk bsearch cache chdir chmod clock context cpu 
crypt dentry dir dup epoll eventfd fstat fallocate fault fifo flock fork futex 
get getrandom hdd hsearch inotify io itimer kcmp kill lease link lockf longjmp 
lsearch malloc matrix memcpy memfd mincore mlock mmap mmapmany mremap msg mq 
nice null open pipe poll procfs pthread qsort readahead rename rlimit seek sem 
sem-sysv sendfile shm-sysv sigfd sigfpe sigq sigsegv sock splice stack str 
switch symlink sysinfo sysfs tee timer timerfd tsearch udp udp-flood urandom 
utime vecmath vfork vm vm-rw vm-splice wcs wait yield xattr zero zombie"

for t in $tests
do
echo $t
echo $t | sudo tee /dev/kmsg
./stress-ng --$t 0 -v -t 60
done

and hit this issue:

[14098.848615] urandom
[14111.696335] irqbalance[828]: unhandled level 2 translation fault (11) at 
0x4f64, esr 0x9206
[14111.696341] pgd = ffcfef71b000
[14111.737149] [4f64] *pgd=004fef1f3003, *pud=004fef1f3003, 
*pmd=

[14111.836705] CPU: 0 PID: 828 Comm: irqbalance Not tainted 3.19.0-21-generic 
#21-Ubuntu
[14111.836707] Hardware name: HP ProLiant m400 Server Cartridge (DT)
[14111.836710] task: ffcfefb0bd40 ti: ffcfb452c000 task.ti: 
ffcfb452c000
[14111.836723] PC is at 0x7fb1061834
[14111.836725] LR is at 0x7fb10617f4
[14111.836728] pc : [<007fb1061834>] lr : [<007fb10617f4>] pstate: 
8000
[14111.836729] sp : 007fc7cef6e0
[14111.836731] x29: 007fc7cef6e0 x28: 004095a0 
[14111.836735] x27: 00409548 x26: 0041a000 
[14111.836737] x25: 0001 x24: 0010 
[14111.836740] x23: 004e58a0 x22: 004e5880 
[14111.836750] x21: 0018 x20: 007fb10fd000 
[14111.836762] x19: 0002 x18:  
[14111.836765] x17: 007fb0d678ec x16: 007fb10fc2e0 
[14111.836768] x15: 0020 x14: 0001 
[14111.836770] x13:  x12:  
[14111.836773] x11: 007fc7ced250 x10: 0010 
[14111.836775] x9 : 00a0 x8 : 0007 
[14111.836778] x7 : 0033 x6 : 004e5c80 
[14111.836780] x5 : 0001 x4 : 007fb0d802a0 
[14111.836783] x3 : 004e5880 x2 : 0001 
[14111.836785] x1 : 03fa x0 : 4f5c

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-03 Thread Ming Lei
Hi Colin,

On Sat, Jul 4, 2015 at 12:43 AM, Colin Ian King
<1469...@bugs.launchpad.net> wrote:
> I was able to hit the following translation fault running sudo ./stress-
> ng --seq 0 -t 60 --syslog --metrics --times -v

I suggest to not run stress-ng as root, otherwise it can be less
serious because:

  - root user can do bad things easily, and it is quite easy to kill any
of process
  - in reality most of loads are run as non-root

If some system processes(irqbalance, systemd-*) are only killed
becasue stress-ng is running as root, it can be a low priority issue.
Otherwise we need pay close attention to the issue.

And I always run 'stress-ng' as ubuntu user without sudo, that may
be the reason why it is difficult for me to reproduce that.

Even with the two new approaches, it is still not easy for me to
reproduce that. I only see one time of translation fault by your
first approach(./stress-ng --seq 0 ...)  in 6 hours, and can't trigger
that with your 2nd approach(by bash script).

Folllows the log[1] I triggered, and I think it is very likely a userspace
issue. From irqbalanc-dbgsym package, we can easily find 'PC is at
0x406078' is one address in text section, and it should be inside
function of 'place_irq_in_node' because the exec file isn't built as
relocation. One thing I still can't understand is that why the fault
address is '0x0040' in the context.


[1]
[ 3616.92] Bits 55-60 of /proc/PID/pagemap entries are about to
stop being page-shift some time soon. See the
linux/Documentation/vm/pagemap.txt for details.
[ 3616.93] Bits 55-60 of /proc/PID/pagemap entries are about to
stop being page-shift some time soon. See the
linux/Documentation/vm/pagemap.txt for details.
[ 5316.367265] irqbalance[1457]: unhandled level 2 translation fault
(11) at 0x0040, esr 0x9206
[ 5316.476937] pgd = ffcfb5478000
[ 5316.520692] [0040] *pgd=004fb4a3c003,
*pud=004fb4a3c003, *pmd=
[ 5316.620270]
[ 5316.638140] CPU: 7 PID: 1457 Comm: irqbalance Not tain-21-generic #21-Ubuntu
[ 5316.733212] Hardware name: HP ProLiant m400 Server Cartridge (DT)
[ 5316.806382] task: ffcfb55e6e40 ti: ffcfa72b task.ti:
ffcfa72b
[ 5316.896258] PC is at 0x406078
[ 5316.931865] LR is at 0x404100
[ 5316.967457] pc : [<00406078>] lr : [<00404100>]
pstate: 2000
[ 5317.056268] sp : 007fc07ff2d0
[ 5317.096038] x29: 007fc07ff2d0 x28: 004095a0
[ 5317.160023] x27: 00409548 x26: 0041a000
[ 5317.223897] x25: 00405000 x24: 0041acf8
[ 5317.287868] x23: 0041a000 x22: 0041a000
[ 5317.351841] x21: 2e0d6050 x20: 0041a000
[ 5317.415744] x19: 2e0e9020 x18: 
[ 5317.479620] x17: 007fb5ac287c x16: 0041a188
[ 5317.543490] x15: 003bdd2370f74a1c x14: 2030203020302030
[ 5317.607373] x13: 2030203020302030 x12: 2030203020302030
[ 5317.671263] x11: 2030203020302030 x10: 2030203020302030
[ 5317.735137] x9 : 00a0 x8 : 0001
[ 5317.799113] x7 : 0033 x6 : 2e0d6e08
[ 5317.862983] x5 : 0040 x4 : 
[ 5317.926867] x3 : 2e0d7008 x2 : 
[ 5317.990840] x1 : 002c x0 : 0003
[ 5318.054713]

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-03 Thread Ming Lei
Hi Colin,

That looks one progress, but still takes time to reproduce that,
and I will use your new approach to reproduce that.

When you are doing that, could you dump the file of /proc/$(pidof
irqbalance)/maps so that we can see where the faulted address are
in the process's vm space?

thanks,


On Sat, Jul 4, 2015 at 4:10 AM, Colin Ian King
<1469...@bugs.launchpad.net> wrote:
> Running the following:
>
> #!/bin/bash
> tests="affinity aio bigheap brk bsearch cache chdir chmod clock context cpu 
> crypt dentry dir dup epoll eventfd fstat fallocate fault fifo flock fork 
> futex get getrandom hdd hsearch inotify io itimer kcmp kill lease link lockf 
> longjmp lsearch malloc matrix memcpy memfd mincore mlock mmap mmapmany mremap 
> msg mq nice null open pipe poll procfs pthread qsort readahead rename rlimit 
> seek sem sem-sysv sendfile shm-sysv sigfd sigfpe sigq sigsegv sock splice 
> stack str switch symlink sysinfo sysfs tee timer timerfd tsearch udp 
> udp-flood urandom utime vecmath vfork vm vm-rw vm-splice wcs wait yield xattr 
> zero zombie"
>
> for t in $tests
> do
> echo $t
> echo $t > /dev/kmsg
> ./stress-ng --$t 0 -v -t 60
> done
>
> eventually tripped the translation fault in irqbalance.  I ran this
> after a clean reboot.
>
> [ 4901.799846] timerfd
> [ 4961.807050] tsearch
> [ 5021.884456] udp
> [ 5081.895058] udp-flood
> [ 5141.674365] irqbalance[827]: unhandled level 2 translation fault (11) at 
> 0x002d6da4, esr 0x9206
> [ 5141.674376] pgd = ffcfb51a
> [ 5141.715215] [002d6da4] *pgd=004fb677e003, *pud=004fb677e003, 
> *pmd=
>
> [ 5141.816183] CPU: 0 PID: 827 Comm: irqbalance Not tainted 3.19.0-21-generic 
> #21-Ubuntu
> [ 5141.816185] Hardware name: HP ProLiant m400 Server Cartridge (DT)
> [ 5141.816188] task: ffcfac088000 ti: ffcfab71 task.ti: 
> ffcfab71
> [ 5141.816206] PC is at 0x7f88287834
> [ 5141.816208] LR is at 0x7f882877f4
> [ 5141.816210] pc : [<007f88287834>] lr : [<007f882877f4>] pstate: 
> 8000
> [ 5141.816212] sp : 007ff2e46b30
> [ 5141.816214] x29: 007ff2e46b30 x28: 004095a0
> [ 5141.816217] x27: 00409548 x26: 0041a000
> [ 5141.816220] x25: 0001 x24: 0010
> [ 5141.816222] x23: 2d6c98a0 x22: 2d6c9880
> [ 5141.816225] x21: 0018 x20: 007f88323000
> [ 5141.816228] x19: 0002 x18: 
> [ 5141.816230] x17: 007f87f8d8ec x16: 007f883222e0
> [ 5141.816233] x15: 0020 x14: 0001
> [ 5141.816235] x13:  x12: 
> [ 5141.816237] x11: 007ff2e446a0 x10: 0010
> [ 5141.816240] x9 : 00a0 x8 : 0007
> [ 5141.816242] x7 : 0033 x6 : 2d6c9c80
> [ 5141.816245] x5 : 0001 x4 : 007f87fa62a0
> [ 5141.816247] x3 : 2d6c9880 x2 : 0001
> [ 5141.816250] x1 : 03fa x0 : 002d6d9c
>
> [ 5141.907792] urandom
> [ 5201.928712] utime
> [ 5261.934534] vecmath
> [ 5321.940302] vfork
> [ 5381.947904] vm
> [ 5441.991784] vm-rw
> [ 5502.017614] vm-splice
> [ 5562.023334] wcs
> [ 5622.037054] wait
> [ 5682.043302] yield
> [ 5742.056595] xattr
> [ 5802.075772] zero
> [ 5862.087396] zombie
>
> --
> You received this bug notification because you are subscribed to linux
> in Ubuntu.
> https://bugs.launchpad.net/bugs/1469214
>
> Title:
>   HP ProLiant m400 Server crashes with unhandled level 3 translation
>   fault
>
> Status in linux package in Ubuntu:
>   Triaged
>
> Bug description:
>   Running stress-ng on a HP ProLiant m400 server can cause unhandled
>   level 3 translations faults:
>
>   use stress-ng from git://kernel.ubuntu.com/cking/stress-ng
>
>   ./stress-ng --seq 0 -t 60 -v
>
>   and after some time this trips the following:
>
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922560] 
> systemd-timesyn[481]: unhandled level 3 translation fault (7) at 
> 0x7fa8ea6008, esr 0x9207
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922561] pgd = 
> ffcfb563f000
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922563] [7fa8ea6008] 
> *pgd=004fb4f28003, *pud=004fb4f28003, *pmd=004fb4f38003, 
> *pte=1d151c00
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922566]
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922569] CPU: 6 PID: 481 
> Comm: systemd-timesyn Not tainted 3.19.0-21-generic #21-Ubuntu
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922571] Hardware name: HP 
> ProLiant m400 Server Cartridge (DT)
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922573] task: 
> ffcfb4e3b100 ti: ffcfb4d2c000 task.ti: ffcfb4d2c000
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922588] PC is at 
> 0x7fa8d81824
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922589] LR is at 
> 0x7fa8e3b3e4
>   Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922590] pc : 
> [<007fa8d81824>]

[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-03 Thread Colin Ian King
Running the following:

#!/bin/bash
tests="affinity aio bigheap brk bsearch cache chdir chmod clock context cpu 
crypt dentry dir dup epoll eventfd fstat fallocate fault fifo flock fork futex 
get getrandom hdd hsearch inotify io itimer kcmp kill lease link lockf longjmp 
lsearch malloc matrix memcpy memfd mincore mlock mmap mmapmany mremap msg mq 
nice null open pipe poll procfs pthread qsort readahead rename rlimit seek sem 
sem-sysv sendfile shm-sysv sigfd sigfpe sigq sigsegv sock splice stack str 
switch symlink sysinfo sysfs tee timer timerfd tsearch udp udp-flood urandom 
utime vecmath vfork vm vm-rw vm-splice wcs wait yield xattr zero zombie"

for t in $tests
do
echo $t
echo $t > /dev/kmsg
./stress-ng --$t 0 -v -t 60
done

eventually tripped the translation fault in irqbalance.  I ran this
after a clean reboot.

[ 4901.799846] timerfd
[ 4961.807050] tsearch
[ 5021.884456] udp
[ 5081.895058] udp-flood
[ 5141.674365] irqbalance[827]: unhandled level 2 translation fault (11) at 
0x002d6da4, esr 0x9206
[ 5141.674376] pgd = ffcfb51a
[ 5141.715215] [002d6da4] *pgd=004fb677e003, *pud=004fb677e003, 
*pmd=

[ 5141.816183] CPU: 0 PID: 827 Comm: irqbalance Not tainted 3.19.0-21-generic 
#21-Ubuntu
[ 5141.816185] Hardware name: HP ProLiant m400 Server Cartridge (DT)
[ 5141.816188] task: ffcfac088000 ti: ffcfab71 task.ti: 
ffcfab71
[ 5141.816206] PC is at 0x7f88287834
[ 5141.816208] LR is at 0x7f882877f4
[ 5141.816210] pc : [<007f88287834>] lr : [<007f882877f4>] pstate: 
8000
[ 5141.816212] sp : 007ff2e46b30
[ 5141.816214] x29: 007ff2e46b30 x28: 004095a0 
[ 5141.816217] x27: 00409548 x26: 0041a000 
[ 5141.816220] x25: 0001 x24: 0010 
[ 5141.816222] x23: 2d6c98a0 x22: 2d6c9880 
[ 5141.816225] x21: 0018 x20: 007f88323000 
[ 5141.816228] x19: 0002 x18:  
[ 5141.816230] x17: 007f87f8d8ec x16: 007f883222e0 
[ 5141.816233] x15: 0020 x14: 0001 
[ 5141.816235] x13:  x12:  
[ 5141.816237] x11: 007ff2e446a0 x10: 0010 
[ 5141.816240] x9 : 00a0 x8 : 0007 
[ 5141.816242] x7 : 0033 x6 : 2d6c9c80 
[ 5141.816245] x5 : 0001 x4 : 007f87fa62a0 
[ 5141.816247] x3 : 2d6c9880 x2 : 0001 
[ 5141.816250] x1 : 03fa x0 : 002d6d9c 

[ 5141.907792] urandom
[ 5201.928712] utime
[ 5261.934534] vecmath
[ 5321.940302] vfork
[ 5381.947904] vm
[ 5441.991784] vm-rw
[ 5502.017614] vm-splice
[ 5562.023334] wcs
[ 5622.037054] wait
[ 5682.043302] yield
[ 5742.056595] xattr
[ 5802.075772] zero
[ 5862.087396] zombie

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-07-03 Thread Colin Ian King
I was able to hit the following translation fault running sudo ./stress-
ng --seq 0 -t 60 --syslog --metrics --times -v

[90103.913447] irqbalance[807]: unhandled level 2 translation fault (11) at 
0x001754a4, esr 0x9206
[90103.913454] pgd = ffcfb5926000
[90103.954271] [001754a4] *pgd=004fb5a8b003, *pud=004fb5a8b003, 
*pmd=

[90104.053696] CPU: 1 PID: 807 Comm: irqbalance Not tainted 3.19.0-21-generic 
#21-Ubuntu
[90104.053698] Hardware name: HP ProLiant m400 Server Cartridge (DT)
[90104.053701] task: ffcfb59c4980 ti: ffcfb5814000 task.ti: 
ffcfb5814000
[90104.053717] PC is at 0x7f95548834
[90104.053719] LR is at 0x7f955487f4
[90104.053721] pc : [<007f95548834>] lr : [<007f955487f4>] pstate: 
8000
[90104.053723] sp : 007fcf72a410
[90104.053725] x29: 007fcf72a410 x28: 004095a0 
[90104.053728] x27: 00409548 x26: 0041a000 
[90104.053731] x25: 0001 x24: 0010 
[90104.053733] x23: 175398a0 x22: 17539880 
[90104.053736] x21: 0018 x20: 007f955e4000 
[90104.053738] x19: 0002 x18:  
[90104.053741] x17: 007f9524e8ec x16: 007f955e32e0 
[90104.053743] x15: 0020 x14: 0001 
[90104.053745] x13:  x12:  
[90104.053748] x11: 007fcf727f80 x10: 0010 
[90104.053750] x9 : 00a0 x8 : 0007 
[90104.053753] x7 : 0033 x6 : 17539c80 
[90104.053755] x5 : 0001 x4 : 007f952672a0 
[90104.053758] x3 : 17539880 x2 : 0001 
[90104.053760] x1 : 03fa x0 : 0017549c

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-06-30 Thread Ming Lei
Oops, the test result in #4 is for LP1469218 instead of this one.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-06-29 Thread Ming Lei
I can't reproduce it after running half a day on ms10-36, and OOM is often 
triggered .

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-06-29 Thread Colin Ian King
Hrm, OK, I'll see if I can find a better reproducer.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-06-29 Thread dann frazier
fyi, I ran this in a loop over the weekend and the issue has not
reproduced.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-06-26 Thread Joseph Salisbury
** Changed in: linux (Ubuntu)
   Importance: Undecided => Medium

** Changed in: linux (Ubuntu)
   Status: Incomplete => Triaged

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

2015-06-26 Thread Colin Ian King
** Summary changed:

- HP ProLiant m400 Server 
+ HP ProLiant m400 Server crashes with unhandled level 3 translation fault

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1469214

Title:
  HP ProLiant m400 Server crashes with unhandled level 3 translation
  fault

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469214/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs