Just happened two days ago on Trusty 14.04.4 with Linux 4.4.0-31 on one of Ceph Jewel OSD server. It ran fine for 8 days though and suddenly the CPU load spiked to 600.
The server is from SuperMicro SuperStorage Server SSG-6048R-E1CR36L with these following specs: 2x Intel Xeon E5-2630 v3 @ 2.4GHz 16C/32T 128GB DDR3 ECC 2x 80GB Intel SSD S3500 series for OS Drive in mdraid1 mode 2x 800GB Intel PCIe SSD S3700 series for Ceph OSD Journal 36x 6TB Samsung NAS 7200rpm SAS drives for Ceph OSDs 4x 10GbE SFP+ Intel 82599ES ethernet with LACP bonding mode The apport-collect log is attached. Hope this helps. /chrone ** Attachment added: "ubuntu trusty hwe lts xenial - ceph jewel osd - apport-collect.zip" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1568729/+attachment/4708813/+files/ubuntu%20trusty%20hwe%20lts%20xenial%20-%20ceph%20jewel%20osd%20-%20apport-collect.zip -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1568729 Title: divide error: 0000 [#1] SMP in task_numa_migrate - handle_mm_fault To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1568729/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs