Public bug reported:

boot failures with 5.4.0-7-generic on OPAL power box:

I was running ADT tests and the machine hung/rebooted. I was unable to
log in. After I rebooted the machine with the ipmi tool the machine
crashed with the following kernel output:

[   51.081421774,5] SkiBoot skiboot-5.4.8-5787ad3 starting...
[   51.081426316,5] initial console log level: memory 7, driver 5
[   51.081429224,6] CPU: P8 generation processor(max 8 threads/core)
[   51.081432044,7] CPU: Boot CPU PIR is 0x0470 PVR is 0x004d0200
[   51.081435009,7] CPU: Initial max PIR set to 0x1fff
[   51.082535316,5] OPAL table: 0x300bfc40 .. 0x300c0110, branch table: 
0x30002000
[   51.082543101,5] FDT: Parsing fdt @0xff00000
[   51.087692296,5] XSCOM: chip 0x0 at 0x3fc0000000000 [P8 DD2.0]
[   51.087702232,5] XSCOM: chip 0x8 at 0x3fc4000000000 [P8 DD2.0]
[   51.087709775,6] XSTOP: XSCOM addr = 0x2010c82, FIR bit = 31
[   51.087713185,6] MFSI 0:0: Initialized
[   51.087715462,6] MFSI 0:2: Initialized
[   51.087717669,6] MFSI 0:1: Initialized
[   51.087720203,6] MFSI 8:0: Initialized
[   51.087722365,6] MFSI 8:2: Initialized
[   51.087724518,6] MFSI 8:1: Initialized
[   51.088044434,5] LPC: LPC[000]: Initialized, access via XSCOM @0xb0020
[   51.088162270,5] LPC: LPC: Default bus on chip 0x0
[   51.088303476,6] MEM: parsing reserved memory from node 
/ibm,hostboot/reserved-memory
[   51.088313438,7] HOMER: Init chip 0
[   51.088316406,7]   PBA BAR0 : 0x00000007fd800000
[   51.088319108,7]   PBA MASK0: 0x0000000000300000
[   51.088321761,7]   HOMER Image at 0x7fd800000 size 4MB
[   51.088325579,7]   PBA BAR2 : 0x40000007fda00000
[   51.088328358,7]   PBA MASK2: 0x0000000000000000
[   51.088330928,7]   SLW Image at 0x7fda00000 size 1MB
[   51.088334409,7]   PBA BAR3 : 0x00000007ff800000
[   51.088337060,7]   PBA MASK3: 0x0000000000700000
[   51.088339732,7]   OCC Common Area at 0x7ff800000 size 8MB
[   51.088342594,7] HOMER: Init chip 8
[   51.088345257,7]   PBA BAR0 : 0x00000007fdc00000
[   51.088347872,7]   PBA MASK0: 0x0000000000300000
[   51.088350519,7]   HOMER Image at 0x7fdc00000 size 4MB
[   51.088354173,7]   PBA BAR2 : 0x40000007fde00000
[   51.088356860,7]   PBA MASK2: 0x0000000000000000
[   51.088359365,7]   SLW Image at 0x7fde00000 size 1MB
[   51.088362788,7]   PBA BAR3 : 0x00000007ff800000
[   51.088365419,7]   PBA MASK3: 0x0000000000700000
[   51.088367946,7]   OCC Common Area at 0x7ff800000 size 8MB
[   51.088387526,7] CPU idle state device tree init
[   51.088391002,4] SLW: HB-provided idle states property found
[   51.088567406,7] AST: PNOR LPC offset: 0x0c000000
[   51.088650577,5] PLAT: Using virtual UART
[   51.088977615,7] UART: Using LPC IRQ 4
[   51.203625382,5] PLAT: Detected Firestone platform
[   51.219765305,5] PLAT: Detected BMC platform AMI
[   51.239417466,5] CENTAUR: Found centaur for chip 0x0 channel 4
[   51.239524825,5] CENTAUR:   FSI host: 0x0 cMFSI0 port 7
[   51.241283553,5] CENTAUR: Found centaur for chip 0x0 channel 5
[   51.241759761,5] CENTAUR:   FSI host: 0x0 cMFSI0 port 6
[   51.242362656,5] PSI[0x000]: Found PSI bridge [active=0]
[   51.242690427,5] PSI[0x008]: Found PSI bridge [active=0]
[   51.245117930,5] CPU: All 128 processors called in...
[    2.472212005,5] FLASH: Found system flash: Macronix MXxxL51235F id:0
[    2.472354468,5] BT: Interface initialized, IO 0x00e4
[    3.421491873,5] NVRAM: Size is 576 KB
[    4.095942958,5] STB: secure mode off
[    4.096004331,5] STB: trusted mode off
[    4.096965839,5] CAPI: Preloading ucode 200ea
[    4.097023615,5] FLASH: Queueing preload of 2/200ea
[    4.097202595,5] FLASH: Queueing preload of 0/0
[    4.097723471,5] FLASH: Queueing preload of 1/0
[    4.097739635,7] FFS: Partition map size: 0x1000
[    4.101069429,7] FLASH: CAPP partition has ECC
[    4.117588444,5] STB: sb_verify skipped resource 2, secure_mode=0
[    4.117607170,5] Chip 0 Found PBCQ0 at /xscom@3fc0000000000/pbcq@2012000
[    4.117610665,7] PHB3[0:0]: X[PE]=0x02012000 X[PCI]=0x09012000 
X[SPCI]=0x09013c00
[    4.117690635,7] PHB3[0:0] REGS     = 0x0003fffe40000000 [4k]
[    4.124862367,7] PHB3[0:0] PCIBAR   = 0x0003fffe40000000
[    4.144741905,7] PHB3[0:0] MMIO0    = 0x0000200000000000 [0x0000010000000000]
[    4.147663099,7] PHB3[0:0] MMIO1    = 0x00003fe000000000 [0x0000000080000000]
[    4.151015049,7] PHB3[0:0] BAREN    = 0xf800000000000000
[    4.151018735,7] PHB3[0:0] NEWBAREN = 0xf800000000000000
[    4.152491015,7] PHB3[0:0] IRSNC    = 0x0100000000000000
[    4.177266431,5] STB: tb_measure skipped resource 2, trusted_mode=0
[    4.177266472,7] PHB3[0:0] IRSNM    = 0xff00000000000000
[    4.177269336,7] PHB3[0:0] LSI      = 0xff00000000000000
[    4.177278668,5] Chip 0 Found PBCQ1 at /xscom@3fc0000000000/pbcq@2012400
[    4.177282022,7] PHB3[0:1]: X[PE]=0x02012400 X[PCI]=0x09012400 
X[SPCI]=0x09013c40
[    4.178715842,7] PHB3[0:1] REGS     = 0x0003fffe40100000 [4k]
[    4.183043807,7] PHB3[0:1] PCIBAR   = 0x0003fffe40100000
[    4.190163295,5] Chip 8 Found PBCQ0 at /xscom@3fc4000000000/pbcq@2012000
[    4.208231423,5] Chip 8 Found PBCQ1 at /xscom@3fc4000000000/pbcq@2012400
[    7.170627939,5] Chip 8 Found PBCQ2 at /xscom@3fc4000000000/pbcq@2012800
[    8.269331117,3] PHB#0000: Base location code not found !
[   13.422844377,5] STB: sb_verify skipped resource 0, secure_mode=0
[   13.422853191,7] BT: seq 0x05 netfn 0x06 cmd 0x06: Message sent to host
[   13.423112031,5] STB: tb_measure skipped resource 0, trusted_mode=0
[   13.425274455,3] FLASH: No ROOTFS partition
[   13.435729110,3] PHB#0001: Base location code not found !
[   13.497563875,3] PHB#0020: Base location code not found !
[   14.047321740,3] PHB#0021: Base location code not found !
[   14.109002459,3] PHB#0022: Base location code not found !
[   14.170907665,5] PCI: Resetting PHBs...
[   15.273761743,5] PCI: Probing slots...
[   16.432898479,5] PHB#0000:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..ff 
SLOT=Slot5 
[   16.434393023,5] PHB#0001:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..ff 
SLOT=Slot4 
[   16.434910029,5] PHB#0020:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..ff 
SLOT=Slot2 
[   16.435845882,5] PHB#0021:00:00.0 [ROOT] 1014 03dc R:00 C:060400 B:01..15 
SLOT=Backplane PLX 
[   16.438571433,5] PHB#0021:01:00.0 [SWUP] 10b5 8725 R:ca C:060400 B:02..15 
LOC_CODE=Backplane PLX
[   16.440061205,5] PHB#0021:02:01.0 [SWDN] 10b5 8725 R:ca C:060400 B:03..07 
SLOT=Slot3 
[   16.445628911,5] PHB#0021:02:08.0 [SWDN] 10b5 8725 R:ca C:060400 B:08..0c 
[   16.447124810,5] PHB#0021:02:09.0 [SWDN] 10b5 8725 R:ca C:060400 B:0d..0d 
SLOT=Backplane USB 
[   16.449944597,5] PHB#0021:0d:00.0 [EP  ] 104c 8241 R:02 C:0c0330 (      
usb-xhci) LOC_CODE=Backplane USB
[   16.451469746,5] PHB#0021:02:0a.0 [SWDN] 10b5 8725 R:ca C:060400 B:0e..0e 
SLOT=Backplane SATA 
[   16.485306975,5] PHB#0021:0e:00.0 [LGCY] 1b4b 9235 R:11 C:010601 (          
sata) LOC_CODE=Backplane SATA
[   16.490275695,5] PHB#0021:02:0b.0 [SWDN] 10b5 8725 R:ca C:060400 B:0f..10 
SLOT=Backplane BMC 
[   16.491741344,5] PHB#0021:0f:00.0 [ETOX] 1a03 1150 R:03 C:060400 B:10..10 
LOC_CODE=Backplane BMC
[   16.493424358,5] PHB#0021:10:00.0 [PCID] 1a03 2000 R:30 C:030000 (           
vga) LOC_CODE=Backplane BMC
[   16.496259662,5] PHB#0021:02:0c.0 [SWDN] 10b5 8725 R:ca C:060400 B:11..15 
 Petitboot (v1.4.4-e414dbe)                           8335-GTA 0000000000000000
 ──────────────────────────────────────────────────────────────────────────────
    Ubuntu, with Linux 5.4.0-7-generic (recovery mode)
    Ubuntu, with Linux 5.4.0-7-generic
    Ubuntu
  [Network: enP4p1s0f0 / 98:be:94:01:1f:a4]
    netboot enP4p1s0f0 (pxelinux.0)
  [Network: enP4p1s0f2 / 98:be:94:01:1f:a6]
    netboot enP4p1s0f2 (pxelinux.0)
  [Network: enP4p1s0f1 / 98:be:94:01:1f:a5]
    netboot enP4p1s0f1 (pxelinux.0)
  [Network: enP4p1s0f3 / 98:be:94:01:1f:a7]
    netboot enP4p1s0f3 (pxelinux.0)

  System information
  System configuration
  System status log
  Language
  Rescan devices
  Retrieve config from URL
 *Exit to shell                                        
 ──────────────────────────────────────────────────────────────────────────────
 Enter=accept, e=edit, n=new, x=exit, l=language, g=log, h=help
The system is going down NOW!ig from tftp://10.245.71.3/ppc64el/pxelinux.cfg/01-
Sent SIGTERM to all processes
Sent SIGKILL to all processes
cpu 0x78: Vector: 300 (Data Access) at [c0000007ff3b3a00]
    pc: c0000000004a5a0c: xhci_irq+0x44c/0x18a0
    lr: c0000000004a5604: xhci_irq+0x44/0x18a0
    sp: c0000007ff3b3c80
   msr: 9000000000009033
   dar: b0
 dsisr: 40000000
  current = 0xc000000001300280
  paca    = 0xc00000000fe96800   softe: 0        irq_happened: 0x01
    pid   = 0, comm = swapper/120
enter ? for help
[c0000007ff3b3c80] c0000000004a6bcc xhci_irq+0x160c/0x18a0 (unreliable)
[c0000007ff3b3e00] c000000000096bd8 handle_irq_event_percpu+0x58/0x170
[c0000007ff3b3eb0] c000000000096d5c handle_irq_event+0x6c/0x9c
[c0000007ff3b3ee0] c00000000009ac18 handle_fasteoi_irq+0xc8/0x184
[c0000007ff3b3f10] c0000000000961a0 generic_handle_irq+0x34/0x54
[c0000007ff3b3f30] c00000000000df28 __do_irq+0xb4/0xd0
[c0000007ff3b3f90] c000000000019d58 call_do_irq+0x14/0x24
[c00000000133ba80] c00000000000dfd4 do_IRQ+0x90/0xcc
[c00000000133bad0] c0000000000021a8 hardware_interrupt_common+0x128/0x180
--- Exception: 501 (Hardware Interrupt) at c00000000000d8f0 
arch_local_irq_restore+0x70/0x80
[c00000000133bdc0] 0000000000000001 (unreliable)
[c00000000133bde0] c0000000004db3e0 cpuidle_enter_state+0x1c8/0x238
[c00000000133be30] c00000000008c39c cpu_startup_entry+0x250/0x2ec
[c00000000133bee0] c00000000000b4d8 rest_init+0x9c/0xb0
[c00000000133bf00] c0000000007b3bf8 start_kernel+0x510/0x518
[c00000000133bf90] c000000000008c60 start_here_common+0x20/0x440
78:mon> 

Was able to reboot back into a previous kernel w/o any issues.

** Affects: linux (Ubuntu)
     Importance: High
         Status: New

** Changed in: linux (Ubuntu)
   Importance: Undecided => High

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1855143

Title:
  5.4.0-7 kernel crash on boot on power box

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1855143/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to