------- Comment From vipar...@in.ibm.com 2017-07-07 17:18 EDT-------
This is same issue being debugged under LTC bug 149014 / LP1659111

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1702998

Title:
  Ubuntu 17.04: Guest crashed @writeback_sb_inodes+0x310/0x590

Status in The Ubuntu-power-systems project:
  New
Status in linux package in Ubuntu:
  New

Bug description:
  == Comment: #0 - Lata Kuntal <lakun...@in.ibm.com> - 2017-03-03 00:50:54 ==
  Ubuntu 17.04 guest dropped at xmon after crashing at 
writeback_sb_inodes+0x310/0x590. 
  The guest is having XFS rootfs and NPIV disk. It crashed after 30+ hrs of 
BASE and NFS stress test .

  Crash logs
  =======
  root@guskvm:~# virsh console gusg1 --force
  Connected to domain gusg1
  Escape character is ^]

  0:mon>
  0:mon> t
  [c0000000a4bc7940] c00000000036f790 writeback_sb_inodes+0x310/0x590
  [c0000000a4bc7a50] c00000000036faf4 __writeback_inodes_wb+0xe4/0x150
  [c0000000a4bc7ab0] c00000000036ff1c wb_writeback+0x2cc/0x440
  [c0000000a4bc7b80] c000000000370c30 wb_workfn+0x150/0x560
  [c0000000a4bc7c90] c0000000000ed8c0 process_one_work+0x2b0/0x5a0
  [c0000000a4bc7d20] c0000000000edc58 worker_thread+0xa8/0x650
  [c0000000a4bc7dc0] c0000000000f67b4 kthread+0x154/0x1a0
  [c0000000a4bc7e30] c00000000000b4e8 ret_from_kernel_thread+0x5c/0x74
  0:mon> r
  R00 = c00000000036f790   R16 = c0000000eca70300
  R01 = c0000000a4bc78e0   R17 = c0000000f7035240
  R02 = c00000000143c900   R18 = 0000000000000000
  R03 = c0000000f7035150   R19 = 0000000000000000
  R04 = 0000000000000019   R20 = c0000000a4bc4000
  R05 = 0000000000000100   R21 = ffffffffffffff7f
  R06 = 0000000000000000   R22 = c00000000433d758
  R07 = 0000000000000000   R23 = c00000000433d738
  R08 = 0000000000034995   R24 = 0000000000000000
  R09 = 0000000000000000   R25 = 0000000000000000
  R10 = 0000000080000000   R26 = c0000000f70351d8
  R11 = c0000000a4bc7a40   R27 = 0000000000000000
  R12 = 0000000000002200   R28 = 0000000000000001
  R13 = c00000000fb80000   R29 = c00000000433d728
  R14 = 0000000000000000   R30 = c0000000f7035150
  R15 = c0000000f70351d8   R31 = 0000000000000000
  pc  = c00000000036c120 locked_inode_to_wb_and_lock_list+0x50/0x290
  cfar= c0000000000b2a14 kvmppc_save_tm+0x168/0x16c
  lr  = c00000000036f790 writeback_sb_inodes+0x310/0x590
  msr = 8000000000009033   cr  = 24002482
  ctr = c000000000381e30   xer = 0000000000000000   trap =  300
  dar = 0000000000000000   dsisr = 40000000
  0:mon> e
  cpu 0x0: Vector: 300 (Data Access) at [c0000000a4bc7660]
      pc: c00000000036c120: locked_inode_to_wb_and_lock_list+0x50/0x290
      lr: c00000000036f790: writeback_sb_inodes+0x310/0x590
      sp: c0000000a4bc78e0
     msr: 8000000000009033
     dar: 0
   dsisr: 40000000
    current = 0xc0000000fbe96000
    paca    = 0xc00000000fb80000   softe: 0        irq_happened: 0x01
      pid   = 17305, comm = kworker/u16:0
  Linux version 4.10.0-8-generic (buildd@bos01-ppc64el-001) (gcc version 6.3.0 
20161229 (Ubuntu 6.3.0-2ubuntu1) ) #10-Ubuntu SMP Mon Feb 13 14:00:06 UTC 2017 
(Ubuntu 4.10.0-8.10-generic 4.10.0-rc8)
  0:mon> d
  0000000000000000 **************** ****************  |                |
  0:mon>

  
  Host and guest kernel build
  =====================
  4.10.0-8-generic

  
  OPAL firmware version
  ----------------------------------------
    T side    : FW860.20 (SV860_078)
    Boot side : FW860.20 (SV860_078)

  
  == Comment: #4 - VIPIN K. PARASHAR <vipar...@in.ibm.com> - 2017-03-03 
02:55:20 ==
  [140071.761707] Adding 153536k swap on /dev/loop0.  Priority:-2 extents:1 
across:153536k FS
  [140072.153143] Adding 153472k swap on /dev/loop0.  Priority:-2 extents:1 
across:153472k FS
  [140072.441833] Unable to handle kernel paging request for data at address 
0x00000000
  [140072.442064] Faulting instruction address: 0xc00000000036c120
  0:mon>

  0:mon> e
  cpu 0x0: Vector: 300 (Data Access) at [c0000000a4bc7660]
      pc: c00000000036c120: locked_inode_to_wb_and_lock_list+0x50/0x290
      lr: c00000000036f790: writeback_sb_inodes+0x310/0x590
      sp: c0000000a4bc78e0
     msr: 8000000000009033
     dar: 0
   dsisr: 40000000
    current = 0xc0000000fbe96000
    paca    = 0xc00000000fb80000         softe: 0        irq_happened: 0x01
      pid   = 17305, comm = kworker/u16:0
  Linux version 4.10.0-8-generic (buildd@bos01-ppc64el-001) (gcc version 6.3.0 
20161229 (Ubuntu 6.3.0-2ubuntu1) ) #10-Ubuntu SMP Mon Feb 13 14:00:06 UTC 2017 
(Ubuntu 4.10.0-8.10-generic 4.10.0-rc8)
  0:mon> t
  [c0000000a4bc7940] c00000000036f790 writeback_sb_inodes+0x310/0x590
  [c0000000a4bc7a50] c00000000036faf4 __writeback_inodes_wb+0xe4/0x150
  [c0000000a4bc7ab0] c00000000036ff1c wb_writeback+0x2cc/0x440
  [c0000000a4bc7b80] c000000000370c30 wb_workfn+0x150/0x560
  [c0000000a4bc7c90] c0000000000ed8c0 process_one_work+0x2b0/0x5a0
  [c0000000a4bc7d20] c0000000000edc58 worker_thread+0xa8/0x650
  [c0000000a4bc7dc0] c0000000000f67b4 kthread+0x154/0x1a0
  [c0000000a4bc7e30] c00000000000b4e8 ret_from_kernel_thread+0x5c/0x74
  0:mon> r
  R00 = c00000000036f790   R16 = c0000000eca70300
  R01 = c0000000a4bc78e0   R17 = c0000000f7035240
  R02 = c00000000143c900   R18 = 0000000000000000
  R03 = c0000000f7035150   R19 = 0000000000000000
  R04 = 0000000000000019   R20 = c0000000a4bc4000
  R05 = 0000000000000100   R21 = ffffffffffffff7f
  R06 = 0000000000000000   R22 = c00000000433d758
  R07 = 0000000000000000   R23 = c00000000433d738
  R08 = 0000000000034995   R24 = 0000000000000000
  R09 = 0000000000000000   R25 = 0000000000000000
  R10 = 0000000080000000   R26 = c0000000f70351d8
  R11 = c0000000a4bc7a40   R27 = 0000000000000000
  R12 = 0000000000002200   R28 = 0000000000000001
  R13 = c00000000fb80000   R29 = c00000000433d728
  R14 = 0000000000000000   R30 = c0000000f7035150
  R15 = c0000000f70351d8   R31 = 0000000000000000
  pc  = c00000000036c120 locked_inode_to_wb_and_lock_list+0x50/0x290
  cfar= c0000000000b2a14 kvmppc_save_tm+0x168/0x16c
  lr  = c00000000036f790 writeback_sb_inodes+0x310/0x590
  msr = 8000000000009033   cr  = 24002482
  ctr = c000000000381e30   xer = 0000000000000000   trap =  300
  dar = 0000000000000000   dsisr = 40000000
  0:mon> S
  msr    = 8000000000001033  sprg0 = 0000000000000000
  pvr    = 00000000004b0201  sprg1 = c00000000fb80000
  dec    = 00000000b56746ff  sprg2 = c00000000fb80000
  sp     = c0000000a4bc7100  sprg3 = 0000000000000000
  toc    = c00000000143c900  dar   = 0000000000000400
  srr0   = 000000000008c59c  srr1  = 0000000000001033 dsisr  = 40000000
  dscr   = 0000000000000000  ppr   = 0000000000000000 pir    = 00000030
  dpdes  = 0000000000000000  tir   = 0000000000000000 cir    = 00000000
  fscr   = 0000000000000180  tar   = 0000000000000000 pspb   = 00000000
  mmcr0  = 0000000080000000  mmcr1 = 0000000000000000 mmcr2  = 0000000000000000
  pmc1   = 00000000 pmc2 = 00000000  pmc3 = 00000000  pmc4   = 00000000
  mmcra  = 0000000000000000   siar = 0000000000000000 pmc5   = b9ad0e28
  sdar   = 0000000000000000   sier = 0000000000000000 pmc6   = 7f0fdfbe
  ebbhr  = 0000000000000000  ebbrr = 0000000000000000 bescr  = 0000000000000000
  0:mon> 

  
  Crash is due to Kernel hitting a DSI  while executing 
locked_inode_to_wb_and_lock_list routine.

  == Comment: #8 - VIPIN K. PARASHAR <vipar...@in.ibm.com> - 2017-03-03 
05:07:03 ==
  Its crashing at fs/fs-writeback.c

  static struct bdi_writeback *
  locked_inode_to_wb_and_lock_list(struct inode *inode)
          __releases(&inode->i_lock)
          __acquires(&wb->list_lock)
  {
          while (true) {
                  struct bdi_writeback *wb = inode_to_wb(inode);

                  /*
                   * inode_to_wb() association is protected by both
                   * @inode->i_lock and @wb->list_lock but list_lock nests
                   * outside i_lock.  Drop i_lock and verify that the
                   * association hasn't changed after acquiring list_lock.
                   */
                  wb_get(wb);                                                   
 <-----------
                  spin_unlock(&inode->i_lock);
                  spin_lock(&wb->list_lock);

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-power-systems/+bug/1702998/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to