Public bug reported:

When running Comfy-UI with Strix Halo an illegal memory access occurs
intermittently like below:

  0%| | 0/20 [00:00<?, ?it/s]
  5%|▌ | 1/20 [00:01<00:23, 1.21s/it]
 10%|â–ˆ | 2/20 [00:06<01:01, 3.43s/it]
 15%|█▌ | 3/20 [00:11<01:10, 4.14s/it]
 20%|██ | 4/20 [00:16<01:11, 4.49s/it]
 25%|██▌ | 5/20 [00:21<01:13, 4.90s/it]
 30%|███ | 6/20 [00:26<01:09, 4.93s/it]
 35%|███▌ | 7/20 [00:31<01:04, 4.96s/it]
 40%|████ | 8/20 [00:36<00:59, 4.98s/it]
 45%|████▌ | 9/20 [00:41<00:54, 4.98s/it]
 50%|█████ | 10/20 [00:46<00:49, 4.99s/it]
 55%|█████▌ | 11/20 [00:52<00:46, 5.20s/it]
 60%|██████ | 12/20 [00:57<00:41, 5.14s/it]
 65%|██████▌ | 13/20 [01:02<00:35, 5.09s/it]
 70%|███████ | 14/20 [01:07<00:30, 5.07s/it]
 75%|███████▌ | 15/20 [01:12<00:25, 5.06s/it]
 80%|████████ | 16/20 [01:18<00:21, 5.26s/it]
 85%|████████▌ | 17/20 [01:23<00:15, 5.24s/it]
 90%|█████████ | 18/20 [01:28<00:10, 5.24s/it]
 95%|█████████▌| 19/20 [01:33<00:05, 5.24s/it]
100%|██████████| 20/20 [01:39<00:00, 5.24s/it]
100%|██████████| 20/20 [01:39<00:00, 4.96s/it]
/home/taccuser/workspace/7.1-18/dart/scripts/rocm-on-radeon/semi-automated-scripts/pyt_rocm_flux_inf_comfy-ui/ComfyUI/comfy/sd.py:694:
 UserWarning: HIP warning: an illegal memory access was encountered (Triggered 
internally at 
/pytorch/aten/src/ATen/hip/impl/HIPGuardImplMasqueradingAsCUDA.h:83.)
  out = self.process_output(self.first_stage_model.decode(samples, 
**vae_options).to(self.output_device).float())
!!! Exception during processing !!! HIP error: an illegal memory access was 
encountered

This issue is caused by a mismatch of expectations between the KFD and
thunk CWSR save area size. It is fixed by this change:

https://lore.kernel.org/amd-
gfx/[email protected]/

** Affects: linux-oem-6.14 (Ubuntu)
     Importance: Undecided
         Status: Invalid

** Affects: linux-oem-6.17 (Ubuntu)
     Importance: Undecided
         Status: Invalid

** Affects: linux-oem-6.14 (Ubuntu Noble)
     Importance: Undecided
         Status: New

** Affects: linux-oem-6.17 (Ubuntu Noble)
     Importance: Undecided
         Status: New


** Tags: originate-from-2131159

** Also affects: linux-oem-6.17 (Ubuntu)
   Importance: Undecided
       Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2131160

Title:
  Illegal memory access with Strix Halo

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-oem-6.14/+bug/2131160/+subscriptions


-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to