tra abandoned this revision.
tra added a comment.
Emitting IR is not sufficient to ensure that the kernels survive GDCE, so the
patch does not work with optimizations on.
http://reviews.llvm.org/D11666 would have to do for now.
http://reviews.llvm.org/D11993
_
eliben accepted this revision.
eliben added a comment.
lgtm
Comment at: test/CodeGenCUDA/ptx-kernels.cu:26
@@ -16,1 +25,2 @@
// CHECK: !{{[0-9]+}} = !{void ()* @global_function, !"kernel", i32 1}
+// CHECK: !{{[0-9]+}} = !{void (i32)* @_Z16templated_kernelIiEvT_, !"kernel",
i3
echristo accepted this revision.
echristo added a comment.
This revision is now accepted and ready to land.
LGTM. Thanks for working on this.
-eric
http://reviews.llvm.org/D11993
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.
tra created this revision.
tra added reviewers: echristo, rsmith, eliben.
tra added a subscriber: cfe-commits.
This is a somewhat different way to do it than D11666 which got rolled back.
Codegen postpones emitting instantiated kernel function template until it's
used.
If kernel is used only fro