Andrea,  thanks for the updated kernels.

On the first one, I got 23 installs before I ran into an issue; I'll
test the newer kernel next.

https://paste.ubuntu.com/p/2B4Kk3wbvQ/

[ 5436.870482] BUG: unable to handle kernel NULL pointer dereference at 
00000000000009b8
[ 5436.873374] IP: cache_set_flush+0xf6/0x190 [bcache]
[ 5436.875208] PGD 0 P4D 0 
[ 5436.876488] Oops: 0000 [#1] SMP PTI
[ 5436.877842] Modules linked in: zfs(PO) zunicode(PO) zavl(PO) icp(PO) 
zcommon(PO) znvpair(PO) spl(O) nls_utf8 isofs ppdev nls_iso8859_1 kvm_intel kvm 
irqbypass joydev input_leds serio_raw parport_pc parport qemu_fw_cfg mac_hid 
sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp 
libiscsi scsi_transport_iscsi virtio_rng ip_tables x_tables autofs4 btrfs 
zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor 
async_tx xor raid6_pq libcrc32c raid0 multipath linear raid1 bcache psmouse 
nvme nvme_core virtio_blk virtio_net virtio_scsi floppy i2c_piix4 pata_acpi
[ 5436.896104] CPU: 0 PID: 11216 Comm: kworker/0:1 Tainted: P           O     
4.15.0-54-generic #58+lp1796292+1
[ 5436.899985] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 
0.0.0 02/06/2015
[ 5436.902645] Workqueue: events cache_set_flush [bcache]
[ 5436.904374] RIP: 0010:cache_set_flush+0xf6/0x190 [bcache]
[ 5436.906183] RSP: 0018:ffffab52826bbe58 EFLAGS: 00010202
[ 5436.909050] RAX: 0000000000000000 RBX: ffff94d104aa0cc0 RCX: 0000000000000000
[ 5436.911939] RDX: 0000000000000000 RSI: 0000000020000001 RDI: 0000000000000000
[ 5436.914448] RBP: ffffab52826bbe78 R08: ffff94d13f61ac30 R09: ffff94d13f342b98
[ 5436.917113] R10: ffffab52803b3d10 R11: 00000000000002c6 R12: 0000000000000001
[ 5436.919210] R13: ffff94d13f622140 R14: ffff94d104aa0db8 R15: 0000000000000000
[ 5436.921401] FS:  0000000000000000(0000) GS:ffff94d13f600000(0000) 
knlGS:0000000000000000
[ 5436.923743] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 5436.926299] CR2: 00000000000009b8 CR3: 0000000038252000 CR4: 00000000000006f0
[ 5436.929093] Call Trace:
[ 5436.930818]  process_one_work+0x1de/0x410
[ 5436.932818]  worker_thread+0x32/0x410
[ 5436.935332]  kthread+0x121/0x140
[ 5436.937309]  ? process_one_work+0x410/0x410
[ 5436.939393]  ? kthread_create_worker_on_cpu+0x70/0x70
[ 5436.941263]  ret_from_fork+0x35/0x40
[ 5436.943060] Code: b8 00 00 00 a8 02 74 c8 31 f6 4c 89 e7 e8 43 0e ff ff eb 
bc 66 83 bb 4c f7 ff ff 00 48 8b 83 58 ff ff ff 74 31 41 bc 01 00 00 00 <48> 8b 
b8 b8 09 00 00 48 85 ff 74 05 e8 f9 9d 0d d3 0f b7 8b 4c 
[ 5436.950188] RIP: cache_set_flush+0xf6/0x190 [bcache] RSP: ffffab52826bbe58
[ 5436.952796] CR2: 00000000000009b8
[ 5436.954567] ---[ end trace b771415397e98c3d ]---

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1796292

Title:
  Tight timeout for bcache removal causes spurious failures

Status in curtin:
  Fix Released
Status in linux package in Ubuntu:
  Confirmed
Status in linux source package in Bionic:
  Confirmed
Status in linux source package in Cosmic:
  Confirmed
Status in linux source package in Disco:
  Confirmed
Status in linux source package in Eoan:
  Confirmed

Bug description:
  I've had a number of deployment faults where curtin would report
  Timeout exceeded for removal of /sys/fs/bcache/xxx when doing a mass-
  deployment of 30+ nodes. Upon retrying the node would usually deploy
  fine. Experimentally I've set the timeout ridiculously high, and it
  seems I'm getting no faults with this. I'm wondering if the timeout
  for removal is set too tight, or might need to be made configurable.

  --- curtin/util.py~     2018-05-18 18:40:48.000000000 +0000
  +++ curtin/util.py      2018-10-05 09:40:06.807390367 +0000
  @@ -263,7 +263,7 @@
       return _subp(*args, **kwargs)
   
   
  -def wait_for_removal(path, retries=[1, 3, 5, 7]):
  +def wait_for_removal(path, retries=[1, 3, 5, 7, 1200, 1200]):
       if not path:
           raise ValueError('wait_for_removal: missing path parameter')

To manage notifications about this bug go to:
https://bugs.launchpad.net/curtin/+bug/1796292/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to