[PATCH] D89980: [hip] Remove kernel argument coercion.

Matt Arsenault via Phabricator via cfe-commits Tue, 27 Oct 2020 09:31:23 -0700

arsenm added inline comments.


================
Comment at: clang/test/CodeGenCUDA/amdgpu-kernel-arg-pointer-type.cu:19
+// COMMON-LABEL: define amdgpu_kernel void @_Z7kernel1Pi(i32*{{.*}} %x)
+// OPT: [[VAL:%.*]] = load i32, i32* %x, align 4
 // OPT: [[INC:%.*]] = add nsw i32 [[VAL]], 1
----------------
hliao wrote:
> arsenm wrote:
> > This is still a regression. Fixing up AA does not solve the problem this 
> > promotions this is intended to solve. Generic accesses are worse 
> > independently of the aliasing properties
> Do you mean FLAT load/store has worse addressing mode than GLOBAL ones?
Yes. The flat offsets have a smaller range, and do not have the saddr mode. 
Flat accesses also won't avoid the extra lgmkcnt wait


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D89980/new/

https://reviews.llvm.org/D89980

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[PATCH] D89980: [hip] Remove kernel argument coercion.

Reply via email to