Hi all, I've recently started to evaluate PoCL on the recent Intel Xeon Phi generation (KNL). My motivation is the lack of OpenCL support by Intel for HPC platforms. The Intel OpenCL SDK runs with a work-around in AVX2 mode. My hope is to get AVX-512 support, i.e. performance competitive with OpenMP, via PoCL/LLVM.
I am currently gathering benchmark results. Therefore I use a micro benchmark to compare the performance of basic arithmetic operators and math functions between OpenMP and OpenCL on different target architectures, and also some real world scientific codes that use OpenCL. I hope this is the right place to ask questions. The first one would be: Q: Does it matter, performance-wise, on which platform I build PoCL vs. on which I use PoCL - all within the x86-64 world? Background: I use PoCL on a Xeon (Haswell, AVX2) host, and a Xeon Phi (KNL, AVX-512) which share a home. So do I need two builds of PoCL to generate optimal code for my OpenCL kernels, or will the PoCL OpenCL compiler figure out whether to use AVX2 or AVX-512 at runtime? Thanks, Matthias ------------------------------------------------------------------------------ _______________________________________________ pocl-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/pocl-devel
