@kaihengfeng So v5.7 was fine and after many reboots it has been found that this commit below introduced the issue.
Do I also need to find when the issue was resolved ? (between v5.8-rc1 and v5.9.10) or is this information enough ? 54b2fcee1db041a83b52b51752dade6090cf952f is the first bad commit commit 54b2fcee1db041a83b52b51752dade6090cf952f Author: Keith Busch <kbu...@kernel.org> Date: Mon Apr 27 11:54:46 2020 -0700 nvme-pci: remove last_sq_tail The nvme driver does not have enough tags to wrap the queue, and blk-mq will no longer call commit_rqs() when there are no new submissions to notify. Signed-off-by: Keith Busch <kbu...@kernel.org> Reviewed-by: Sagi Grimberg <s...@grimberg.me> Signed-off-by: Christoph Hellwig <h...@lst.de> Signed-off-by: Jens Axboe <ax...@kernel.dk> drivers/nvme/host/pci.c | 23 ++++------------------- 1 file changed, 4 insertions(+), 19 deletions(-) And my $ git bisect log is the following FWIW. git bisect start # good: [3d77e6a8804abcc0504c904bd6e5cdf3a5cf8162] Linux 5.7 git bisect good 3d77e6a8804abcc0504c904bd6e5cdf3a5cf8162 # bad: [b3a9e3b9622ae10064826dccb4f7a52bd88c7407] Linux 5.8-rc1 git bisect bad b3a9e3b9622ae10064826dccb4f7a52bd88c7407 # bad: [ee01c4d72adffb7d424535adf630f2955748fa8b] Merge branch 'akpm' (patches from Andrew) git bisect bad ee01c4d72adffb7d424535adf630f2955748fa8b # bad: [16d91548d1057691979de4686693f0ff92f46000] Merge tag 'xfs-5.8-merge-8' of git://git.kernel.org/pub/scm/fs/xfs/xfs-linux git bisect bad 16d91548d1057691979de4686693f0ff92f46000 # good: [cfa3b8068b09f25037146bfd5eed041b78878bee] Merge tag 'for-linus-hmm' of git://git.kernel.org/pub/scm/linux/kernel/git/rdma/rdma git bisect good cfa3b8068b09f25037146bfd5eed041b78878bee # good: [3fd911b69b3117e03181262fc19ae6c3ef6962ce] Merge tag 'drm-misc-next-2020-05-07' of git://anongit.freedesktop.org/drm/drm-misc into drm-next git bisect good 3fd911b69b3117e03181262fc19ae6c3ef6962ce # good: [1966391fa576e1fb2701be8bcca197d8f72737b7] mm/migrate.c: attach_page_private already does the get_page git bisect good 1966391fa576e1fb2701be8bcca197d8f72737b7 # bad: [0c8d3fceade2ab1bbac68bca013e62bfdb851d19] bcache: configure the asynchronous registertion to be experimental git bisect bad 0c8d3fceade2ab1bbac68bca013e62bfdb851d19 # bad: [84b8d0d7aa159652dc191d58c4d353b6c9173c54] nvmet: use type-name map for ana states git bisect bad 84b8d0d7aa159652dc191d58c4d353b6c9173c54 # good: [72e6329f86c714785ac195d293cb19dd24507880] nvme-fc and nvmet-fc: revise LLDD api for LS reception and LS request git bisect good 72e6329f86c714785ac195d293cb19dd24507880 # good: [e4fcc72c1a420bdbe425530dd19724214ceb44ec] nvmet-fc: slight cleanup for kbuild test warnings git bisect good e4fcc72c1a420bdbe425530dd19724214ceb44ec # good: [31fdad7be18992606078caed6ff71741fa76310a] nvme: consolodate io settings git bisect good 31fdad7be18992606078caed6ff71741fa76310a # bad: [2a5bcfdd41d68559567cec3c124a75e093506cc1] nvme-pci: align io queue count with allocted nvme_queue in nvme_probe git bisect bad 2a5bcfdd41d68559567cec3c124a75e093506cc1 # good: [6623c5b3dfa5513190d729a8516db7a5163ec7de] nvme: clean up error handling in nvme_init_ns_head git bisect good 6623c5b3dfa5513190d729a8516db7a5163ec7de # good: [74943d45eef4db64b1e5c9f7ad1d018576e113c5] nvme-pci: remove volatile cqes git bisect good 74943d45eef4db64b1e5c9f7ad1d018576e113c5 # bad: [54b2fcee1db041a83b52b51752dade6090cf952f] nvme-pci: remove last_sq_tail git bisect bad 54b2fcee1db041a83b52b51752dade6090cf952f # first bad commit: [54b2fcee1db041a83b52b51752dade6090cf952f] nvme-pci: remove last_sq_tail -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1910866 Title: nvme drive fails after some time To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1910866/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs