Re: [PATCH] drm/amdkfd: Fix some issues at userptr buffer validation process.

Chen, Xiaogang Wed, 19 Apr 2023 22:30:01 -0700


On 4/18/2023 6:17 PM, Felix Kuehling wrote:

On 2023-04-13 23:27, Chen, Xiaogang wrote:
On 4/13/2023 3:08 PM, Felix Kuehling wrote:
Am 2023-04-12 um 02:14 schrieb Xiaogang.Chen:
From: Xiaogang Chen<xiaogang.c...@amd.com>

Notice userptr buffer restore process has following issues:
1: amdgpu_ttm_tt_get_user_pages can fail(-EFAULT). If it failed weshould not setit valid(mem->invalid = 0). In this case mem has no associated hmmrange or user_pages
associated.
We don't want to suspend the process indefinitely when this happens.This can happen if usermode calls munmap before unregistering theuserptr. What we want to happen in this case is, the process shouldresume. If it accesses the virtual address, it will result in a pagefault, which alerts the application to its mistake. If it doesn'taccess the virtual address, then there is no harm.
It's a good catch that there is no useful hmm_range in this case tocheck validity, so we should not warn about it inconfirm_valid_user_pages_locked.
Not sure why you said "suspend the process indefinitely". Whenmem(kgd_mem) has no hmm_range due to amdgpu_ttm_tt_get_user_pagesfail the patch does not mark it valid( set mem->invalid != 0) andkeep it at userptr_inval_list. The process has not been suspended.
User mode queues are stopped. Until the queues are restarted, theprocess is effectively suspended (for GPU execution). If invaliduserptr mappings cause restore to fail, that means, the GPU queueswill remain stopped. That's what I mean with "suspend the processindefinitely".

My understanding your concern is that I have restore process rescheduleitself indefinitely. At confirm_valid_user_pages_locked the real thingis if mem has hmm range associated. If it has and mem->invalid if true Iwill reschedule next attempt. If mem has no hmm range it will be kept atinvalid list and not trigger reschedule.

Yes, in this customer application case amdgpu_ttm_tt_get_user_pagesfailed at vma_lookup. I think it unmap its buffer before unregisterfrom KFD. This patch handles amdgpu_ttm_tt_get_user_pages in general:not mark it valid(mem->invalid != 0), keep it at userptr_inval_list,then at confirm_valid_user_pages_locked, check if mem has hmm rangeassociated before WARN.
I think it would be easier to just mark it as valid. mem->invalid == 0means, it's safe to resume the user mode queues. For userptrs withouta valid VMA this is the case as the corresponding page table entrieshave been invalidated (V=0).

I have mem->invalid != 0 because it has no hmm range associated and stayat invalid list. I think that keep consistency with its status.

2: mmu notifier can happen concurrently and updatemem->range->notifier->invalidate_seq,but not mem->range->notifier_seq. That causesmem->range->notifier_seq stalewhen mem is in process_info->userptr_inval_list andamdgpu_amdkfd_restore_userptr_workergot interrupted. At next rescheduled next attempt we use stalemem->range->notifier_seq
to compare with mem->range->notifier->invalidate_seq.
amdgpu_hmm_range_get_pages updates mem->range->notifier_seq with thecurrent mem->range->notifier->invalidate_seq. If an eviction happensafter this, there is a collision and the range needs to berevalidated. I think when you say "mem->range->notifier_seq isstale", it means there was a collision. When this happens,mem->invalid should be set to true at the same time. Soconfirm_valid_user_pages_locked should not complain becausemem->invalid and amdgpu_ttm_tt_get_user_pages_done should agree thatthe range is invalid.
Yes, "mem->range->notifier_seq is stale" means it is different frommem->range->notifier_seq. It is caused by mmu notifier happenedconcurrently on same buffer that is still in restore process. Forthis case the patch update mem->range->notifier_seq:
+            if (mem->range)
+ mem->range->notifier_seq =mem->range->notifier->invalidate_seq;
You should not update mem->range->notifier_seq without also getting anup-to-date page address list. Matching sequence numbers indicate thatyour page list is up to date. If you just update the sequence number,you're basically lying to yourself.
You need to call amdgpu_hmm_range_get_pages to get an updated pagelist and update the mem->range->notifier_seq at the same time. Thereis no need to do this in more than one place. That's inupdate_invalid_user_pages.

ok, it maybe redundant. At next round I will remove it, depend on nextscheduled attempt to update mem->range->notifier_seq byamdgpu_ttm_tt_get_user_pages.

Then restore process will skip confirm_valid_user_pages_locked, jumpto next scheduled attempt: "goto unlock_notifier_out".
"At next rescheduled next attempt we use stalemem->range->notifier_seq": This is not really stale. Thenotifier_seq indicates whether the pages returned by the last callto amdgpu_hmm_range_get_pages are still valid. If it's "stale", itmeans an invalidation (evict_userptr) happened and we need toamdgpu_hmm_range_get_pages again. In theory, if an invalidationhappened since the last call, then mem->invalid should also be true.So again, the sequence numbers and mem->invalid should agree andthere should be no warning.
When invalidation (evict_userptr) happen concurrently restore processwill schedule next attempt. The mem->invalid is set to true byevict_userptr, also the sequence numbers. Both agree and with thispatch we do not see WARN.
Why do they disagree without this patch? I think what you did there(updating the sequence number without getting updated pages) isincorrect. If the sequence number and mem->invalid are updatedtogether under the same lock, they should always agree. There shouldbe no need to mess with the sequence numbers after the fact.

I did not mean mem->invalid and sequence number disagree. I meanmem->range->notifier_seq and mem->range->notifier->invalidate_seqdisagree. We can update mem->range->notifier_seq at next attempt.

Regards,
  Felix
The warning messages printed in confirm_valid_user_pages_lockedindicate that there is a mismatch between the sequence numbers andmem->invalid. As I understand it, such a mismatch should beimpossible. Unless there are some bad assumptions in the code. Ihaven't figured out what those bad assumptions are yet. Other thanthe case for -EFAULT you pointed out above.
From my debugging this customer app the warnings is due to we did nothandle well if amdgpu_hmm_range_get_pages return -EFAULT and mmunotifier happen on same buffer concurrently. That lead we use memwithout hmm range associated and stale mem->range->notifier_seq forfollowing operations. With this patch there is no warning messagesand not see other errors.
Xiaogang
Regards,
  Felix
Signed-off-by: Xiaogang Chen<xiaogang.c...@amd.com>
---
.../gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c | 45+++++++++++++++----
  1 file changed, 37 insertions(+), 8 deletions(-)
diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.cb/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c
index 7b1f5933ebaa..6881f1b0844c 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c
@@ -2444,7 +2444,9 @@ static int update_invalid_user_pages(structamdkfd_process_info *process_info,
              ret = -EAGAIN;
              goto unlock_out;
          }
-        mem->invalid = 0;
+         /* set mem valid if mem has hmm range associated */
+        if (mem->range)
+            mem->invalid = 0;
      }
    unlock_out:
@@ -2576,16 +2578,28 @@ static intconfirm_valid_user_pages_locked(struct amdkfd_process_info *process_i
      list_for_each_entry_safe(mem, tmp_mem,
                   &process_info->userptr_inval_list,
                   validate_list.head) {
-        bool valid = amdgpu_ttm_tt_get_user_pages_done(
-                mem->bo->tbo.ttm, mem->range);
+        /* Only check mem with hmm range associated */
+        bool valid;
  -        mem->range = NULL;
-        if (!valid) {
-            WARN(!mem->invalid, "Invalid BO not marked invalid");
+        if (mem->range) {
+            valid = amdgpu_ttm_tt_get_user_pages_done(
+                    mem->bo->tbo.ttm, mem->range);
+
+            mem->range = NULL;
+            if (!valid) {
+                WARN(!mem->invalid, "Invalid BO not marked invalid");
+                ret = -EAGAIN;
+                continue;
+            }
+        } else
+            /* keep mem without hmm range at userptr_inval_list */
+            continue;
+
+        if (mem->invalid) {
+            WARN(1, "Valid BO is marked invalid");
              ret = -EAGAIN;
              continue;
          }
-        WARN(mem->invalid, "Valid BO is marked invalid");
            list_move_tail(&mem->validate_list.head,
&process_info->userptr_valid_list);
@@ -2644,8 +2658,23 @@ static voidamdgpu_amdkfd_restore_userptr_worker(struct work_struct *work)
       * reference counting inside KFD will handle this case.
       */
      mutex_lock(&process_info->notifier_lock);
-    if (process_info->evicted_bos != evicted_bos)
+    if (process_info->evicted_bos != evicted_bos) {
+ /* mmu notifier interruptedamdgpu_amdkfd_restore_userptr_worker+ * before reschedule next attempt update stalemem->range->notifier_seq
+         * inside userptr_inval_list
+         */
+        struct kgd_mem *mem, *tmp_mem;
+
+        list_for_each_entry_safe(mem, tmp_mem,
+                &process_info->userptr_inval_list,
+                validate_list.head) {
+
+            if (mem->range)
+ mem->range->notifier_seq =mem->range->notifier->invalidate_seq;
+        }
+
          goto unlock_notifier_out;
+    }
        if (confirm_valid_user_pages_locked(process_info)) {
          WARN(1, "User pages unexpectedly invalid");

Re: [PATCH] drm/amdkfd: Fix some issues at userptr buffer validation process.

Reply via email to