================
@@ -849,19 +849,15 @@ void AMDGPUToolChain::addClangTargetOptions(
     CC1Args.push_back("-fapply-global-visibility-to-externs");
   }
 
-  // For SPIR-V we want to retain the pristine output of Clang CodeGen, since
-  // optimizations might lose structure / information that is necessary for
-  // generating optimal concrete AMDGPU code.
-  // TODO: using the below option is a temporary placeholder until Clang
-  //       provides the required functionality, which essentially boils down to
-  //       -O0 being refactored / reworked to not imply optnone / remove TBAA.
-  //       Once that is added, we should pivot to that functionality, being
-  //       mindful to not corrupt the user provided and subsequently embedded
-  //       command-line (i.e. if the user asks for -O3 this is what the
-  //       finalisation should use).
+  // For SPIR-V, enable basic optimizations but disable target-specific
+  // transformations that could harm JIT performance.
   if (getTriple().isSPIRV() &&
----------------
arsenm wrote:

Just remove the whole block. Optimization decisions are for the target pass 
pipeline, clang shouldn't be randomly picking and choosing debug flags to set 

https://github.com/llvm/llvm-project/pull/189383
_______________________________________________
cfe-commits mailing list
[email protected]
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to