[clang] 8bd7e41 - Replace separator in OpenMP variant name mangling.

2020-06-03 Thread Shilei Tian via cfe-commits
Author: Lukas Sommer Date: 2020-06-03T16:36:32-04:00 New Revision: 8bd7e4188a096b063065aac70ce39129c479f124 URL: https://github.com/llvm/llvm-project/commit/8bd7e4188a096b063065aac70ce39129c479f124 DIFF: https://github.com/llvm/llvm-project/commit/8bd7e4188a096b063065aac70ce39129c479f124.diff

[clang] 0f0ce3c - [OpenMP][NVPTX] Take functions in `deviceRTLs` as `convergent`

2021-02-03 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-02-03T20:58:12-05:00 New Revision: 0f0ce3c12edefd25448e39c4d20718a10d3d42c1 URL: https://github.com/llvm/llvm-project/commit/0f0ce3c12edefd25448e39c4d20718a10d3d42c1 DIFF: https://github.com/llvm/llvm-project/commit/0f0ce3c12edefd25448e39c4d20718a10d3d42c1.diff L

[clang] ebb1092 - [Clang][OpenMP] Added support for nowait target in CodeGen via regular task

2020-09-25 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2020-09-25T22:10:36-04:00 New Revision: ebb1092a2875739d3e9bb6b1fb230c0e0d88ebff URL: https://github.com/llvm/llvm-project/commit/ebb1092a2875739d3e9bb6b1fb230c0e0d88ebff DIFF: https://github.com/llvm/llvm-project/commit/ebb1092a2875739d3e9bb6b1fb230c0e0d88ebff.diff L

[clang] e20d64c - [Clang][OpenMP] Fixed an issue of segment fault when using target nowait

2020-10-26 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2020-10-26T22:33:22-04:00 New Revision: e20d64c3d9d81cad701f31d8481367222c76c787 URL: https://github.com/llvm/llvm-project/commit/e20d64c3d9d81cad701f31d8481367222c76c787 DIFF: https://github.com/llvm/llvm-project/commit/e20d64c3d9d81cad701f31d8481367222c76c787.diff L

[clang] d38788b - [Clang][OpenMP] Avoid unnecessary privatization of mapper array when there is no user defined mapper

2020-10-26 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2020-10-27T00:02:32-04:00 New Revision: d38788b357ad29083eb3b03027db908559aae2a6 URL: https://github.com/llvm/llvm-project/commit/d38788b357ad29083eb3b03027db908559aae2a6 DIFF: https://github.com/llvm/llvm-project/commit/d38788b357ad29083eb3b03027db908559aae2a6.diff L

[clang] 0661328 - [Clang][OpenMP] Added the support for target data nowait

2020-10-28 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2020-10-28T15:53:30-04:00 New Revision: 0661328d7efb81a8ac7f2ca0734a65f9be105f29 URL: https://github.com/llvm/llvm-project/commit/0661328d7efb81a8ac7f2ca0734a65f9be105f29 DIFF: https://github.com/llvm/llvm-project/commit/0661328d7efb81a8ac7f2ca0734a65f9be105f29.diff L

[clang] 82e537a - [Clang][OpenMP] Fixed an issue that clang crashed when compiling OpenMP program in device only mode without host IR

2021-01-19 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-01-19T14:18:42-05:00 New Revision: 82e537a9d28a2c18bd1637e2eac0e0af658ed829 URL: https://github.com/llvm/llvm-project/commit/82e537a9d28a2c18bd1637e2eac0e0af658ed829 DIFF: https://github.com/llvm/llvm-project/commit/82e537a9d28a2c18bd1637e2eac0e0af658ed829.diff L

[clang] 3809e5d - [Clang][OpenMP] Use `clang_cc1` test for `declare_target_device_only_compilation.cpp`

2021-01-20 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-01-20T20:34:10-05:00 New Revision: 3809e5dac965e7c25f3c286884a7af6e48946865 URL: https://github.com/llvm/llvm-project/commit/3809e5dac965e7c25f3c286884a7af6e48946865 DIFF: https://github.com/llvm/llvm-project/commit/3809e5dac965e7c25f3c286884a7af6e48946865.diff L

[clang] 5ad038a - [Clang][OpenMP][NVPTX] Replace `libomptarget-nvptx-path` with `libomptarget-nvptx-bc-path`

2021-01-23 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-01-23T14:42:38-05:00 New Revision: 5ad038aafa3a07a4491bf12cf6edf2026f3f17d1 URL: https://github.com/llvm/llvm-project/commit/5ad038aafa3a07a4491bf12cf6edf2026f3f17d1 DIFF: https://github.com/llvm/llvm-project/commit/5ad038aafa3a07a4491bf12cf6edf2026f3f17d1.diff L

[clang] 63b42a0 - [NFC] clang/test/openMP/target_codegen.cpp should not depend on ssa name

2021-01-07 Thread Shilei Tian via cfe-commits
Author: Jeroen Dobbelaere Date: 2021-01-07T16:39:17-05:00 New Revision: 63b42a0514567d24df617e4587e80e4564ebf120 URL: https://github.com/llvm/llvm-project/commit/63b42a0514567d24df617e4587e80e4564ebf120 DIFF: https://github.com/llvm/llvm-project/commit/63b42a0514567d24df617e4587e80e4564ebf120.d

[clang] 3274cdc - [Clang][OpenMP] Remove the mandatory flush for capture for OpenMP 5.1

2021-07-26 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-07-26T11:00:44-04:00 New Revision: 3274cdc83ecdf2af569ad4f564d55d0e43b1072e URL: https://github.com/llvm/llvm-project/commit/3274cdc83ecdf2af569ad4f564d55d0e43b1072e DIFF: https://github.com/llvm/llvm-project/commit/3274cdc83ecdf2af569ad4f564d55d0e43b1072e.diff L

[clang] 33d6609 - [Clang][OpenMP] Update driver test case for OpenMP offload to use sm_35

2021-02-20 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-02-20T15:14:13-05:00 New Revision: 33d660939d9acb027f9941f037802936124dad8c URL: https://github.com/llvm/llvm-project/commit/33d660939d9acb027f9941f037802936124dad8c DIFF: https://github.com/llvm/llvm-project/commit/33d660939d9acb027f9941f037802936124dad8c.diff L

[clang] 76151ac - [Clang][OpenMP] Require CUDA 9.2+ for OpenMP offloading on NVPTX target

2021-02-22 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-02-22T11:00:33-05:00 New Revision: 76151acf893ac7bdaf460fdfe8a8494cf477157a URL: https://github.com/llvm/llvm-project/commit/76151acf893ac7bdaf460fdfe8a8494cf477157a DIFF: https://github.com/llvm/llvm-project/commit/76151acf893ac7bdaf460fdfe8a8494cf477157a.diff L

[clang] c41ae24 - [OpenMP][Clang][NVPTX] Only build one bitcode library for each SM

2021-03-08 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-03-08T12:03:04-05:00 New Revision: c41ae246ac673e97ec1abdc2b9cbe1989f8682fe URL: https://github.com/llvm/llvm-project/commit/c41ae246ac673e97ec1abdc2b9cbe1989f8682fe DIFF: https://github.com/llvm/llvm-project/commit/c41ae246ac673e97ec1abdc2b9cbe1989f8682fe.diff L

[clang] 53d474a - [Clang][OpenMP][NVPTX] Fixed failure in openmp-offload-gpu.c if the system has CUDA

2021-04-13 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-04-13T13:22:49-04:00 New Revision: 53d474abc92c42bed5d0cab5c79ee9ea5666aad1 URL: https://github.com/llvm/llvm-project/commit/53d474abc92c42bed5d0cab5c79ee9ea5666aad1 DIFF: https://github.com/llvm/llvm-project/commit/53d474abc92c42bed5d0cab5c79ee9ea5666aad1.diff L

[clang] 2a95cb5 - [Clang][Docs] Claim the atomic compare

2021-04-15 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-04-15T11:10:15-04:00 New Revision: 2a95cb5858f8bc0fed3956f72bd51bf6a948e660 URL: https://github.com/llvm/llvm-project/commit/2a95cb5858f8bc0fed3956f72bd51bf6a948e660 DIFF: https://github.com/llvm/llvm-project/commit/2a95cb5858f8bc0fed3956f72bd51bf6a948e660.diff L

[clang] 2c6ffb4 - [NFC] clang-format -i clang/lib/CodeGen/CGStmtOpenMP.cpp

2021-08-22 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-08-22T22:57:05-04:00 New Revision: 2c6ffb4eb20a2598148fdcc819e62f856ccaa373 URL: https://github.com/llvm/llvm-project/commit/2c6ffb4eb20a2598148fdcc819e62f856ccaa373 DIFF: https://github.com/llvm/llvm-project/commit/2c6ffb4eb20a2598148fdcc819e62f856ccaa373.diff L

[clang] 148bc25 - [Clang][OpenMP] Use enum to dereference children data array in OMPAtomicDirective

2021-08-24 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2021-08-24T16:00:24-04:00 New Revision: 148bc251f48ee68af9b7c7eb725d4ed888629b5c URL: https://github.com/llvm/llvm-project/commit/148bc251f48ee68af9b7c7eb725d4ed888629b5c DIFF: https://github.com/llvm/llvm-project/commit/148bc251f48ee68af9b7c7eb725d4ed888629b5c.diff L

[clang] 114df24 - [Clang][Doc] Update the release note for clang

2022-07-26 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2022-07-26T15:39:21-04:00 New Revision: 114df244ec50ce0145702974335965c3aa2c3dcc URL: https://github.com/llvm/llvm-project/commit/114df244ec50ce0145702974335965c3aa2c3dcc DIFF: https://github.com/llvm/llvm-project/commit/114df244ec50ce0145702974335965c3aa2c3dcc.diff L

[clang] 52e6a27 - Clean up `OMPAtomicDirective::Create`

2022-04-15 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2022-04-15T11:41:26-04:00 New Revision: 52e6a27690ca8e5f07cc646716c3736475b7746b URL: https://github.com/llvm/llvm-project/commit/52e6a27690ca8e5f07cc646716c3736475b7746b DIFF: https://github.com/llvm/llvm-project/commit/52e6a27690ca8e5f07cc646716c3736475b7746b.diff L

[clang] e8760b5 - [Clang][OpenMP] Use bitfields for flags in `OMPAtomicDirective`

2022-04-15 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2022-04-15T21:34:28-04:00 New Revision: e8760b51ee0f972587cb0af922a3f828ab6926d6 URL: https://github.com/llvm/llvm-project/commit/e8760b51ee0f972587cb0af922a3f828ab6926d6 DIFF: https://github.com/llvm/llvm-project/commit/e8760b51ee0f972587cb0af922a3f828ab6926d6.diff L

[clang] 20a9fb9 - [Clang][OpenMP] Fix the issue that temp cubin files are not removed after compilation when using new OpenMP driver

2022-04-22 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2022-04-22T18:07:28-04:00 New Revision: 20a9fb953e46b1d97aaee7b182b0f3d48f340bd1 URL: https://github.com/llvm/llvm-project/commit/20a9fb953e46b1d97aaee7b182b0f3d48f340bd1 DIFF: https://github.com/llvm/llvm-project/commit/20a9fb953e46b1d97aaee7b182b0f3d48f340bd1.diff L

[clang] 9c1085c - [Clang][OpenMP] Add the support for floating-point variables for specific atomic clauses

2022-05-03 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2022-05-03T11:30:54-04:00 New Revision: 9c1085c7e20bdd7c4a487f50313ebeeb2b6683b8 URL: https://github.com/llvm/llvm-project/commit/9c1085c7e20bdd7c4a487f50313ebeeb2b6683b8 DIFF: https://github.com/llvm/llvm-project/commit/9c1085c7e20bdd7c4a487f50313ebeeb2b6683b8.diff L

[clang] eb673be - [OMPIRBuilder] Add the support for compare capture

2022-06-01 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2022-06-01T19:53:43-04:00 New Revision: eb673be5ac8510646692f82a606a1f2c10f24828 URL: https://github.com/llvm/llvm-project/commit/eb673be5ac8510646692f82a606a1f2c10f24828 DIFF: https://github.com/llvm/llvm-project/commit/eb673be5ac8510646692f82a606a1f2c10f24828.diff L

[clang] 3a96256 - [Clang][OpenMP] Avoid using `IgnoreImpCasts` if possible

2022-06-02 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2022-06-02T17:45:02-04:00 New Revision: 3a96256b7e292de15ad57438202aeea4818956ba URL: https://github.com/llvm/llvm-project/commit/3a96256b7e292de15ad57438202aeea4818956ba DIFF: https://github.com/llvm/llvm-project/commit/3a96256b7e292de15ad57438202aeea4818956ba.diff L

[clang] b917433 - [NFC][Doc] Finish atomic compare

2022-06-02 Thread Shilei Tian via cfe-commits
Author: Shilei Tian Date: 2022-06-02T21:50:07-04:00 New Revision: b917433835473d5a9dff5b1034e861cf5dc9e5bc URL: https://github.com/llvm/llvm-project/commit/b917433835473d5a9dff5b1034e861cf5dc9e5bc DIFF: https://github.com/llvm/llvm-project/commit/b917433835473d5a9dff5b1034e861cf5dc9e5bc.diff L

[clang] [Clang][OpenMP] Fix private variables registration in `simd` (PR #74105)

2023-12-05 Thread Shilei Tian via cfe-commits
https://github.com/shiltian closed https://github.com/llvm/llvm-project/pull/74105 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for bare kernel (PR #68373)

2023-12-11 Thread Shilei Tian via cfe-commits
https://github.com/shiltian updated https://github.com/llvm/llvm-project/pull/68373 >From ef3d7309feb1fcedb27f10bd11273eeb08ebf7aa Mon Sep 17 00:00:00 2001 From: Shilei Tian Date: Mon, 11 Dec 2023 19:25:01 -0500 Subject: [PATCH] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for b

[clang] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for bare kernel (PR #68373)

2023-12-11 Thread Shilei Tian via cfe-commits
https://github.com/shiltian updated https://github.com/llvm/llvm-project/pull/68373 >From 380046a1117cef08e40f9bcdce2c51c3fe73a26f Mon Sep 17 00:00:00 2001 From: Shilei Tian Date: Tue, 12 Dec 2023 00:11:13 -0500 Subject: [PATCH] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for b

[clang] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for bare kernel (PR #68373)

2023-11-13 Thread Shilei Tian via cfe-commits
@@ -14633,6 +14633,26 @@ StmtResult Sema::ActOnOpenMPTargetTeamsDirective(ArrayRef Clauses, } setFunctionHasBranchProtectedScope(); + bool HasBareClause = false; + bool HasThreadLimitClause = false; + bool HasNumTeamsClause = false; + OMPClause *BareClause = nullptr;

[libc] [clang-tools-extra] [libcxx] [clang] [lld] [lldb] [flang] [llvm] [OpenACC] Initial commits to support OpenACC (PR #70234)

2023-11-17 Thread Shilei Tian via cfe-commits
shiltian wrote: We will want to have OpenACC label(s) such that people who are interested can be notified. https://github.com/llvm/llvm-project/pull/70234 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/l

[clang] [clang-tools-extra] [libcxx] [flang] [lldb] [llvm] [libc] [lld] [OpenACC] Initial commits to support OpenACC (PR #70234)

2023-11-17 Thread Shilei Tian via cfe-commits
shiltian wrote: You might also want to update the team "pr-subscribers-openacc" because currently it only reflects Flang. https://github.com/llvm/llvm-project/pull/70234 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cg

[openmp] [clang] [OpenMP] Add support for Solaris/x86_64 (PR #70593)

2023-11-02 Thread Shilei Tian via cfe-commits
https://github.com/shiltian approved this pull request. LG https://github.com/llvm/llvm-project/pull/70593 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [Clang][OpenMP]: fixed crash due to invalid binary expression in checking atomic semantics (PR #71480)

2023-11-06 Thread Shilei Tian via cfe-commits
shiltian wrote: This doesn't look like the right place to fix this issue to me. @alexey-bataev might have better suggestion. https://github.com/llvm/llvm-project/pull/71480 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org

[clang] [Clang][OpenMP] fixed crash due to invalid binary expression in checking atomic semantics (PR #71480)

2023-11-06 Thread Shilei Tian via cfe-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/71480 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [Clang][OpenMP] fixed crash due to invalid binary expression in checking atomic semantics (PR #71480)

2023-11-07 Thread Shilei Tian via cfe-commits
https://github.com/shiltian closed https://github.com/llvm/llvm-project/pull/71480 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [Clang][OpenMP] fixed crash due to invalid binary expression in checking atomic semantics (PR #71480)

2023-11-07 Thread Shilei Tian via cfe-commits
shiltian wrote: I have merged it given our front end expert @alexey-bataev has approved it. https://github.com/llvm/llvm-project/pull/71480 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-comm

[clang] [Clang][OpenMP] Return empty QualType when a negative array was created (PR #71552)

2023-11-07 Thread Shilei Tian via cfe-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/71552 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [openmp] [OpenMP] Add support for Solaris/x86_64 (PR #70593)

2023-11-08 Thread Shilei Tian via cfe-commits
@@ -70,6 +72,15 @@ struct kmp_sys_timer { struct timespec start; }; +#ifdef KMP_OS_SOLARIS +// Convert timeval to timespec. +#define TIMEVAL_TO_TIMESPEC(tv, ts) \ shiltian wrote: It looks like this macro is not gu

[clang] [Clang][OpenMP] Return empty QualType when a negative array was created (PR #71552)

2023-11-09 Thread Shilei Tian via cfe-commits
shiltian wrote: The changes look good to me. @alexey-bataev WDYT? https://github.com/llvm/llvm-project/pull/71552 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [Clang][OpenMP] Return empty QualType when a negative array was created (PR #71552)

2023-11-09 Thread Shilei Tian via cfe-commits
https://github.com/shiltian closed https://github.com/llvm/llvm-project/pull/71552 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for bare kernel (PR #68373)

2023-11-12 Thread Shilei Tian via cfe-commits
shiltian wrote: Gentle ping https://github.com/llvm/llvm-project/pull/68373 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [openmp] [OpenMP] Directly use user's grid and block size in kernel language mode (PR #70612)

2023-11-12 Thread Shilei Tian via cfe-commits
shiltian wrote: Gentle ping https://github.com/llvm/llvm-project/pull/70612 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [Clang][OpenMP] Emit unsupported directive error (PR #70233)

2023-10-25 Thread Shilei Tian via cfe-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/70233 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [Clang][OpenMP] Emit unsupported directive error (PR #70233)

2023-10-25 Thread Shilei Tian via cfe-commits
shiltian wrote: Can you add a small test to check the error message is correctly emitted? You can refer to those diagnosis tests under `clang/test/OpenMP`. https://github.com/llvm/llvm-project/pull/70233 ___ cfe-commits mailing list cfe-commits@lists.

[clang] [OpenMP] Unify the min/max thread/teams pathways (PR #70273)

2023-10-26 Thread Shilei Tian via cfe-commits
@@ -1,68 +1,20 @@ -// RUN: %clang_cc1 -verify -fopenmp -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -emit-llvm %s -o - | FileCheck -allow-deprecated-dag-overlap %s -check-prefix=CHECK1 -// RUN: %clang_cc1 -fopenmp -fopenmp-targets=powerp

[clang] [OpenMP] Unify the min/max thread/teams pathways (PR #70273)

2023-10-27 Thread Shilei Tian via cfe-commits
@@ -1,68 +1,20 @@ -// RUN: %clang_cc1 -verify -fopenmp -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -emit-llvm %s -o - | FileCheck -allow-deprecated-dag-overlap %s -check-prefix=CHECK1 -// RUN: %clang_cc1 -fopenmp -fopenmp-targets=powerp

[clang] [OpenMP] Unify the min/max thread/teams pathways (PR #70273)

2023-10-28 Thread Shilei Tian via cfe-commits
https://github.com/shiltian approved this pull request. LG https://github.com/llvm/llvm-project/pull/70273 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP] Add support for Solaris (PR #70593)

2023-10-29 Thread Shilei Tian via cfe-commits
shiltian wrote: I'm not familiar with Solaris but does it need dedicated implementation of the function invocation written in ASM? https://github.com/llvm/llvm-project/pull/70593 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.ll

[clang] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for bare kernel (PR #68373)

2023-10-29 Thread Shilei Tian via cfe-commits
https://github.com/shiltian updated https://github.com/llvm/llvm-project/pull/68373 >From d0d8bcc9fa002304ef79ca48d736853ca621c0bd Mon Sep 17 00:00:00 2001 From: Shilei Tian Date: Sun, 29 Oct 2023 19:17:19 -0400 Subject: [PATCH] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for b

[clang] no default grid size (PR #70612)

2023-10-29 Thread Shilei Tian via cfe-commits
https://github.com/shiltian created https://github.com/llvm/llvm-project/pull/70612 - [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for bare kernel - [OpenMP] Directly use user's grid and block size in kernel language mode >From 7b0eaa1606ad2e557105fed9509c135f857db375 Mon Sep 17

[clang] [OpenMP] Directly use user's grid and block size in kernel language mode (PR #70612)

2023-10-29 Thread Shilei Tian via cfe-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/70612 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP] Directly use user's grid and block size in kernel language mode (PR #70612)

2023-10-29 Thread Shilei Tian via cfe-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/70612 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP] Directly use user's grid and block size in kernel language mode (PR #70612)

2023-10-29 Thread Shilei Tian via cfe-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/70612 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP] Directly use user's grid and block size in kernel language mode (PR #70612)

2023-10-29 Thread Shilei Tian via cfe-commits
https://github.com/shiltian updated https://github.com/llvm/llvm-project/pull/70612 >From 7b0eaa1606ad2e557105fed9509c135f857db375 Mon Sep 17 00:00:00 2001 From: Shilei Tian Date: Sun, 29 Oct 2023 19:18:49 -0400 Subject: [PATCH 1/2] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` f

[clang] [openmp] Add memory diff dump for kernel record-replay (PR #70667)

2023-10-30 Thread Shilei Tian via cfe-commits
@@ -274,7 +317,7 @@ struct RecordReplayTy { void saveKernelOutputInfo(const char *Name) { SmallString<128> OutputFilename = { Name, (isRecording() ? ".original.output" : ".replay.output")}; -dumpDeviceMemory(OutputFilename); +dumpDeviceMemory(OutputFilenam

[clang-tools-extra] [openmp] Add memory diff dump for kernel record-replay (PR #70667)

2023-10-30 Thread Shilei Tian via cfe-commits
@@ -274,7 +317,7 @@ struct RecordReplayTy { void saveKernelOutputInfo(const char *Name) { SmallString<128> OutputFilename = { Name, (isRecording() ? ".original.output" : ".replay.output")}; -dumpDeviceMemory(OutputFilename); +dumpDeviceMemory(OutputFilenam

[clang] [OpenMP] Add support for Solaris/x86_64 (PR #70593)

2023-10-30 Thread Shilei Tian via cfe-commits
shiltian wrote: > > I'm not familiar with Solaris but does it need dedicated implementation of > > the function invocation written in ASM? > > Can you point out what you're referring to? Looking at the patches for adding > support for OpenBSD and other OS's I don't see any ASM additions. I wa

[clang-tools-extra] [openmp] Add memory diff dump for kernel record-replay (PR #70667)

2023-10-30 Thread Shilei Tian via cfe-commits
shiltian wrote: Could you fix the existing format issue in a separate RFC patch? https://github.com/llvm/llvm-project/pull/70667 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [openmp] Add memory diff dump for kernel record-replay (PR #70667)

2023-10-30 Thread Shilei Tian via cfe-commits
shiltian wrote: Could you fix the existing format issue in a separate RFC patch? https://github.com/llvm/llvm-project/pull/70667 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [StackProtector] Do not emit the stack protector on GPU architectures (PR #70799)

2023-10-31 Thread Shilei Tian via cfe-commits
https://github.com/shiltian approved this pull request. I think the changes make sense. https://github.com/llvm/llvm-project/pull/70799 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[llvm] [openmp] [clang] [OpenMP] Introduce the KernelLaunchEnvironment as implicit argument (PR #70401)

2023-10-31 Thread Shilei Tian via cfe-commits
shiltian wrote: Tests in `mlir` have to be updated as well. https://github.com/llvm/llvm-project/pull/70401 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [openmp] [OpenMP][FIX] Allocate per launch memory for GPU team reductions (PR #70752)

2023-11-01 Thread Shilei Tian via cfe-commits
https://github.com/shiltian approved this pull request. LG with some nits https://github.com/llvm/llvm-project/pull/70752 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [openmp] [OpenMP][FIX] Allocate per launch memory for GPU team reductions (PR #70752)

2023-11-01 Thread Shilei Tian via cfe-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/70752 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [openmp] [OpenMP][FIX] Allocate per launch memory for GPU team reductions (PR #70752)

2023-11-01 Thread Shilei Tian via cfe-commits
@@ -194,6 +191,9 @@ int32_t __kmpc_nvptx_teams_reduce_nowait_v2( ThreadId = 0; } + uint32_t &IterCnt = state::getKernelLaunchEnvironment().ReductionIterCnt; + uint32_t &Cnt = state::getKernelLaunchEnvironment().ReductionCnt; shiltian wrote: These two

[openmp] [clang] [OpenMP][FIX] Allocate per launch memory for GPU team reductions (PR #70752)

2023-11-01 Thread Shilei Tian via cfe-commits
@@ -387,7 +387,7 @@ struct GenericKernelTy { KernelEnvironmentTy KernelEnvironment; /// The prototype kernel launch environment. - KernelLaunchEnvironmentTy KernelLaunchEnvironment; + KernelLaunchEnvironmentTy KernelLaunchEnvironment = {0, 0}; shiltian w

[clang] [OpenMP] Basic BumpAllocator for (AMD)GPUs (PR #69806)

2023-10-20 Thread Shilei Tian via cfe-commits
@@ -0,0 +1,80 @@ +//===-- State.cpp - OpenMP State & ICV interface - C++ -*-===// +// +// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions. +// See https://llvm.org/LICENSE.txt for license information. +// SPDX-License-Identifier: Apach

[clang] [OpenMP] Basic BumpAllocator for (AMD)GPUs (PR #69806)

2023-10-20 Thread Shilei Tian via cfe-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/69806 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP] Introduce support for OMPX extensions and taskgraph frontend (PR #66919)

2023-10-24 Thread Shilei Tian via cfe-commits
shiltian wrote: Waiting for the evaluation, if possible https://github.com/llvm/llvm-project/pull/66919 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP] Introduce support for OMPX extensions and taskgraph frontend (PR #66919)

2023-10-24 Thread Shilei Tian via cfe-commits
shiltian wrote: I'd prefer to split it into at least one front end and one runtime, both with proper tests. https://github.com/llvm/llvm-project/pull/66919 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/

[clang] [OpenMP] Introduce support for OMPX extensions and taskgraph frontend (PR #66919)

2023-10-24 Thread Shilei Tian via cfe-commits
shiltian wrote: By "ompx infrastructure" do you mean the frond end code? https://github.com/llvm/llvm-project/pull/66919 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP] Introduce support for OMPX extensions and taskgraph frontend (PR #66919)

2023-10-24 Thread Shilei Tian via cfe-commits
shiltian wrote: It's fine that it will generate code that cannot be linked, as long as we don't have runtime use. We just need to check generated IR. https://github.com/llvm/llvm-project/pull/66919 ___ cfe-commits mailing list cfe-commits@lists.llvm.o

[clang] [Offloading][NFC] Move creation of offloading entries from OpenMP (PR #70116)

2023-10-24 Thread Shilei Tian via cfe-commits
shiltian wrote: Can this stuff really be generic? https://github.com/llvm/llvm-project/pull/70116 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [Clang][OpenMP] Emit unsupported directive error (PR #70233)

2023-11-24 Thread Shilei Tian via cfe-commits
https://github.com/shiltian approved this pull request. LG with a nit https://github.com/llvm/llvm-project/pull/70233 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [Clang][OpenMP] Emit unsupported directive error (PR #70233)

2023-11-24 Thread Shilei Tian via cfe-commits
@@ -0,0 +1,7 @@ +// RUN: %clang_cc1 -emit-llvm -fopenmp -disable-llvm-passes %s -verify=expected + +// expected-error@+2 {{cannot compile this OpenMP dispatch directive yet}} +void a(){ +#pragma omp dispatch +a(); +} shiltian wrote: Leave an empty line a

[clang] [Clang][OpenMP] Emit unsupported directive error (PR #70233)

2023-11-24 Thread Shilei Tian via cfe-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/70233 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [Clang][OpenMP] Emit unsupported directive error (PR #70233)

2023-11-24 Thread Shilei Tian via cfe-commits
https://github.com/shiltian closed https://github.com/llvm/llvm-project/pull/70233 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for bare kernel (PR #68373)

2023-12-12 Thread Shilei Tian via cfe-commits
shiltian wrote: gentle ping https://github.com/llvm/llvm-project/pull/68373 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[lld] [flang] [clang-tools-extra] [lldb] [compiler-rt] [libcxx] [clang] [libc] [llvm] [openmp] Gcc 75 libomptarget type convert (PR #75562)

2023-12-15 Thread Shilei Tian via cfe-commits
shiltian wrote: FYI: #75419 https://github.com/llvm/llvm-project/pull/75562 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [OpenMP][Clang] Force use of `num_teams` and `thread_limit` for bare kernel (PR #68373)

2023-12-18 Thread Shilei Tian via cfe-commits
https://github.com/shiltian closed https://github.com/llvm/llvm-project/pull/68373 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [openmp] [Clang][OpenMP] Fix mapping of structs to device (PR #75642)

2023-12-18 Thread Shilei Tian via cfe-commits
shiltian wrote: The newly added test `offloading/struct_mapping_with_pointers.cpp` fails on NVIDIA GPUs as well. ``` TEST 'libomptarget :: nvptx64-nvidia-cuda :: offloading/struct_mapping_with_pointers.cpp' FAILED Exit Code: 1 Command Output (stdout):

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-12 Thread Shilei Tian via cfe-commits
@@ -79,17 +79,17 @@ define amdgpu_ps void @test_llvm_amdgcn_fdot2_bf16_bf16_sis( ; GFX11: ; %bb.0: ; %entry ; GFX11-NEXT:v_mov_b32_e32 v2, s1 ; GFX11-NEXT:s_delay_alu instid0(VALU_DEP_1) -; GFX11-NEXT:v_dot2_bf16_bf16 v2, s0, 0x10001, v2 +; GFX11-NEXT:v_do

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-12 Thread Shilei Tian via cfe-commits
https://github.com/shiltian updated https://github.com/llvm/llvm-project/pull/80908 >From 4196e998349d663a9a9922937cc4bedbec95fe5f Mon Sep 17 00:00:00 2001 From: Shilei Tian Date: Mon, 12 Feb 2024 13:48:39 -0500 Subject: [PATCH] [RFC][WIP][AMDGPU] Use `bf16` instead of `i16` for bfloat Current

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-12 Thread Shilei Tian via cfe-commits
@@ -2730,6 +2749,12 @@ std::optional getInlineEncodingV2I16(uint32_t Literal) { return getInlineEncodingV216(false, Literal); } +// Encoding of the literal as an inline constant for a V_PK_*_BF16 instruction +// or nullopt. +std::optional getInlineEncodingV2BF16(uint32_t Li

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-12 Thread Shilei Tian via cfe-commits
@@ -2660,15 +2660,34 @@ bool isInlinableLiteral16(int16_t Literal, bool HasInv2Pi) { return true; uint16_t Val = static_cast(Literal); - return Val == 0x3C00 || // 1.0 - Val == 0xBC00 || // -1.0 - Val == 0x3800 || // 0.5 - Val == 0xB800 || // -0

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-12 Thread Shilei Tian via cfe-commits
https://github.com/shiltian updated https://github.com/llvm/llvm-project/pull/80908 >From df3dbb6b9c257157c4afb407e40447a25c27a2a8 Mon Sep 17 00:00:00 2001 From: Shilei Tian Date: Mon, 12 Feb 2024 18:03:57 -0500 Subject: [PATCH] [RFC][WIP][AMDGPU] Use `bf16` instead of `i16` for bfloat Current

[clang] [llvm] [LLVM] Add `__builtin_readsteadycounter` intrinsic and builtin for realtime clocks (PR #81331)

2024-02-13 Thread Shilei Tian via cfe-commits
https://github.com/shiltian approved this pull request. LG https://github.com/llvm/llvm-project/pull/81331 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
https://github.com/shiltian updated https://github.com/llvm/llvm-project/pull/80908 >From c556e40c13adb9d253ef7c5ebb2b46cb12969d46 Mon Sep 17 00:00:00 2001 From: Shilei Tian Date: Tue, 13 Feb 2024 15:30:51 -0500 Subject: [PATCH] [RFC][WIP][AMDGPU] Use `bf16` instead of `i16` for bfloat Current

[clang] [Clang][CodeGen] Loose the cast check when emitting builtins (PR #81669)

2024-02-13 Thread Shilei Tian via cfe-commits
https://github.com/shiltian created https://github.com/llvm/llvm-project/pull/81669 This patch looses the cast check (`canLosslesslyBitCastTo`) and leaves it to the one inside `CreateBitCast`. It seems too conservative for the use case here. >From 813441fd3106a0069346aabd0dd828d8feb8ea53 Mon S

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
shiltian wrote: The patch is in a good shape now. I have made two other prime patches (#81674 and #81669). I'll rebase this one once they are landed. This patch only changes one bf16 instruction with the necessary infrastructure for others. I'll update all of them once this patch is landed. H

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
@@ -79,17 +79,17 @@ define amdgpu_ps void @test_llvm_amdgcn_fdot2_bf16_bf16_sis( ; GFX11: ; %bb.0: ; %entry ; GFX11-NEXT:v_mov_b32_e32 v2, s1 ; GFX11-NEXT:s_delay_alu instid0(VALU_DEP_1) -; GFX11-NEXT:v_dot2_bf16_bf16 v2, s0, 0x10001, v2 +; GFX11-NEXT:v_do

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
https://github.com/shiltian updated https://github.com/llvm/llvm-project/pull/80908 >From bfd3170dc5e4d6e53fb98b46b37f2bf3c3ebf86d Mon Sep 17 00:00:00 2001 From: Shilei Tian Date: Tue, 13 Feb 2024 17:39:23 -0500 Subject: [PATCH] [RFC][WIP][AMDGPU] Use `bf16` instead of `i16` for bfloat Current

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
@@ -1,8 +1,7 @@ ; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py ; RUN: llc -mtriple=amdgcn -mcpu=gfx1100 -verify-machineinstrs < %s | FileCheck %s --check-prefixes=GFX11,SDAG-GFX11 -; RUN: llc -global-isel -mtriple=amdgcn -mcpu=gfx1100 -verify-mach

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/80908 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
@@ -2819,11 +2819,11 @@ def int_amdgcn_fdot2_f16_f16 : def int_amdgcn_fdot2_bf16_bf16 : ClangBuiltin<"__builtin_amdgcn_fdot2_bf16_bf16">, DefaultAttrsIntrinsic< -[llvm_i16_ty], // %r +[llvm_bfloat_ty], // %r shiltian wrote: The cast will be ins

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
@@ -488,6 +488,49 @@ static bool printImmediateFloat16(uint32_t Imm, const MCSubtargetInfo &STI, return true; } +static bool printImmediateBFloat16(uint32_t Imm, const MCSubtargetInfo &STI, + raw_ostream &O) { + if (Imm == 0x3F80) +O <

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
@@ -4185,9 +4185,17 @@ bool SIInstrInfo::isInlineConstant(const MachineOperand &MO, case AMDGPU::OPERAND_REG_INLINE_C_V2FP16: case AMDGPU::OPERAND_REG_INLINE_AC_V2FP16: return AMDGPU::isInlinableLiteralV2F16(Imm); + case AMDGPU::OPERAND_REG_IMM_V2BF16: + case AMDGPU:

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
https://github.com/shiltian updated https://github.com/llvm/llvm-project/pull/80908 >From d72bf8bb9d1091ba76e17bf09b0aad9073e18caa Mon Sep 17 00:00:00 2001 From: Shilei Tian Date: Tue, 13 Feb 2024 19:02:41 -0500 Subject: [PATCH] [RFC][WIP][AMDGPU] Use `bf16` instead of `i16` for bfloat Current

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
https://github.com/shiltian updated https://github.com/llvm/llvm-project/pull/80908 >From 22fd628687652e7f9d8795c48e9f794fdd684c45 Mon Sep 17 00:00:00 2001 From: Shilei Tian Date: Tue, 13 Feb 2024 21:27:17 -0500 Subject: [PATCH] [RFC][WIP][AMDGPU] Use `bf16` instead of `i16` for bfloat Current

[clang] [llvm] [RFC][AMDGPU] Use `bf16` instead of `i16` for bfloat (PR #80908)

2024-02-13 Thread Shilei Tian via cfe-commits
@@ -4185,9 +4185,17 @@ bool SIInstrInfo::isInlineConstant(const MachineOperand &MO, case AMDGPU::OPERAND_REG_INLINE_C_V2FP16: case AMDGPU::OPERAND_REG_INLINE_AC_V2FP16: return AMDGPU::isInlinableLiteralV2F16(Imm); + case AMDGPU::OPERAND_REG_IMM_V2BF16: + case AMDGPU:

  1   2   3   4   5   6   >