[Beignet] [PATCH 06/10] OCL20: Add functions for runtime to get the device enqueue info.

2016-03-19 Thread Yang Rong
Add two functions gbe_kernel_use_device_enqueue and gbe_program_get_device_enqueue_kernel_name. Signed-off-by: Yang Rong --- backend/src/backend/program.cpp | 17 - backend/src/backend/program.h | 6 ++ backend/src/gbe_bin_interpreter.cpp |

[Beignet] [PATCH 07/10] OCL20: add a cl_kernel pointer to gpgpu.

2016-03-19 Thread Yang Rong
Because in flush the command queue, must check the currunt flushed command queue has device enqueue or not, it need the cl_kernel. So store the cl_kernel pointer to gpgpu. And add two function intel_gpgpu_set_kernel and intel_gpgpu_get_kernel for it. Signed-off-by: Yang Rong

[Beignet] [PATCH] utest: do not check MV near image border

2016-03-19 Thread Guo Yejun
if the image width and height is not aligned, the VME hardware block could use the data out of the image, there is no clear rule defines the behavior of this case, so do not check the MVs near the border. Signed-off-by: Guo Yejun ---

Re: [Beignet] [Patch v2 2/2] [OCL20] atomic_flag_test_and_set function fix.

2016-03-19 Thread Song, Ruiling
> -Original Message- > From: Beignet [mailto:beignet-boun...@lists.freedesktop.org] On Behalf Of > xionghu@intel.com > Sent: Friday, March 11, 2016 11:42 PM > To: beignet@lists.freedesktop.org > Cc: Luo, Xionghu > Subject: [Beignet] [Patch v2 2/2] [OCL20]

[Beignet] [PATCH 03/10] OCL20: add ir register enqueuebufptr for enqueue global buffer.

2016-03-19 Thread Yang Rong
Signed-off-by: Yang Rong --- backend/src/backend/program.h | 1 + backend/src/ir/profile.cpp| 3 ++- backend/src/ir/profile.hpp| 3 ++- 3 files changed, 5 insertions(+), 2 deletions(-) diff --git a/backend/src/backend/program.h b/backend/src/backend/program.h

Re: [Beignet] [PATCH] enable FP_CONTRACT on as default, and implemented with MAD

2016-03-19 Thread Yang, Rong R
Pushed, thanks. > -Original Message- > From: Beignet [mailto:beignet-boun...@lists.freedesktop.org] On Behalf Of > Guo, Yejun > Sent: Tuesday, March 8, 2016 14:52 > To: Song, Ruiling ; beignet@lists.freedesktop.org > Subject: Re: [Beignet] [PATCH] enable

[Beignet] [PATCH 09/10] OCL20: remove CL_QUEUE_ON_DEVICE check in function clCreateCommandQueueWithProperties.

2016-03-19 Thread Yang Rong
Signed-off-by: Yang Rong --- src/cl_api.c | 5 - 1 file changed, 5 deletions(-) diff --git a/src/cl_api.c b/src/cl_api.c index 5697e34..19c0ec7 100644 --- a/src/cl_api.c +++ b/src/cl_api.c @@ -488,11 +488,6 @@ clCreateCommandQueueWithProperties(cl_context

[Beignet] [Patch V3 1/2] [OCL20] fix compiler_generic_atomic utest bug.

2016-03-19 Thread xionghu . luo
From: Luo Xionghu should use opencl 2.0 API to test generic atomic, the atomic_fetch_add will return the old value of the source pointer, so should move the src data to dst with another instruction. --- kernels/compiler_generic_atomic.cl | 3 ++- 1 file changed, 2

Re: [Beignet] [PATCH] utest: do not check MV near image border

2016-03-19 Thread yan . wang
Now this case could passed when previous test_printf case has multiply tests. VME engine seems to read data out of specified image buffer which is based on drm bo. If this drm bo of src/ref image object reuse from previous bo with garbage by coincidence, it will cause different MV results. Yan

[Beignet] [PATCH 1/2] Benchmark: Evaluate internal sine and cosine performance on intervals

2016-03-19 Thread Grigore Lupescu
From: Grigore Lupescu Signed-off-by: Grigore Lupescu --- kernels/bench_math.cl | 12 ++-- 1 file changed, 10 insertions(+), 2 deletions(-) diff --git a/kernels/bench_math.cl b/kernels/bench_math.cl index b53ba50..a6b6b94 100644 --- a/kernels/bench_math.cl +++ b/kernels/bench_math.cl

[Beignet] [PATCH 10/10] OCL20: add device enqueue test case.

2016-03-19 Thread Yang Rong
Signed-off-by: Yang Rong --- kernels/compiler_device_enqueue.cl | 19 +++ utests/CMakeLists.txt | 3 ++- utests/compiler_device_enqueue.cpp | 36 3 files changed, 57 insertions(+), 1 deletion(-) create

[Beignet] [PATCH 05/10] OCL20: fix a unpack long assert.

2016-03-19 Thread Yang Rong
Src may be scaler when unpack, only check dst. Signed-off-by: Yang Rong --- backend/src/backend/gen8_context.cpp | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/backend/src/backend/gen8_context.cpp b/backend/src/backend/gen8_context.cpp index

Re: [Beignet] [PATCH] change behavior of mul24/mad24 when out of range

2016-03-19 Thread Yang, Rong R
LGTM, pushed, thanks. > -Original Message- > From: Beignet [mailto:beignet-boun...@lists.freedesktop.org] On Behalf Of > Guo, Yejun > Sent: Tuesday, March 8, 2016 14:54 > To: Guo, Yejun ; beignet@lists.freedesktop.org > Subject: Re: [Beignet] [PATCH] change behavior

Re: [Beignet] [PATCH 1/2] Backend: Refine get_enqueued_local_size and get_local_size

2016-03-19 Thread Yang, Rong R
LGTM, pushed. > -Original Message- > From: Beignet [mailto:beignet-boun...@lists.freedesktop.org] On Behalf Of > Xiuli Pan > Sent: Wednesday, March 16, 2016 7:53 > To: beignet@lists.freedesktop.org > Cc: Pan, Xiuli > Subject: [Beignet] [PATCH 1/2] Backend: Refine

[Beignet] [PATCH] standalone utest for unified OpenCL implementation.

2016-03-19 Thread xionghu . luo
From: Luo Xionghu use the variable NOT_BUILD_STAND_ALONE_UTEST to control the build type: for beignet build, set it to 1; for stand alone build, do NOT need set it. remove all clXXXIntel extension call and such kind of tests since we intend to provide the unit test

[Beignet] [PATCH 01/10] OCL20: add device enqueue helper functions in backend.

2016-03-19 Thread Yang Rong
This functions collect all device enqueue's invoke functions and store them in the unit, and set those functions to OpenCL kernel function. Because it change the module's kernel functions, so must called before link, otherwize, the built-in functions called in invoke functions may not be

[Beignet] [Patch V3 2/2] [OCL20] atomic_flag_test_and_set function fix.

2016-03-19 Thread xionghu . luo
From: Luo Xionghu should call atomic_compare_exchange_strong instead of the gen type call. v2: still use __gen_ocl_atomic_compare_exchange_strong32 to implement the atomic_flag_test_and_set function as the spec says: "Returns atomically, the value of the object