Re: [Beignet] [PATCH 1/3] fix bswap bug.

2015-09-07 Thread Yang, Rong R
> -Original Message- > From: Beignet [mailto:beignet-boun...@lists.freedesktop.org] On Behalf Of > xionghu@intel.com > Sent: Thursday, August 13, 2015 14:28 > To: beignet@lists.freedesktop.org > Cc: Luo, Xionghu > Subject: [Beignet] [PATCH 1/3] fix bswap bug. > > From: Luo Xionghu >

Re: [Beignet] [PATCH] GBE: Use addRemappedFile to avoid creating temporary cl source file.

2015-09-07 Thread Gong, Zhigang
> -Original Message- > From: Luo, Xionghu > Sent: Tuesday, September 8, 2015 2:09 PM > To: Gong, Zhigang; beignet@lists.freedesktop.org > Cc: Gong, Zhigang > Subject: RE: [Beignet] [PATCH] GBE: Use addRemappedFile to avoid creating > temporary cl source file. > > This patch LGTM except som

Re: [Beignet] [PATCH] GBE: Use addRemappedFile to avoid creating temporary cl source file.

2015-09-07 Thread Luo, Xionghu
This patch LGTM except some questions. How didn't decide the name "stringInput.cl"? And since this method works, we could also remap all the input headers in API clCompileProgram to avoid create temp files under /tmp, anyway, this could be processed in another patch. Luo Xionghu Best Regards -

Re: [Beignet] [PATCH 3/3] add optimization for local copy propagation

2015-09-07 Thread Zhigang Gong
> -Original Message- > From: Beignet [mailto:beignet-boun...@lists.freedesktop.org] On Behalf Of > Guo, Yejun > Sent: Monday, September 7, 2015 8:27 PM > To: Zhigang Gong; beignet@lists.freedesktop.org > Subject: Re: [Beignet] [PATCH 3/3] add optimization for local copy propagation > > Yes

Re: [Beignet] [PATCH 3/3] add optimization for local copy propagation

2015-09-07 Thread Guo, Yejun
Yes, there will be penalty for the case in your example. I read several documents for local copy propagation, and none mentioned this case. :( For the method to iterate new instructions/registers, it requires to add the 'new' flag during GenIR to SelectionIR period, since the current implementat

[Beignet] [PATCH v3 3/4] add basic utest for block_motion_estimate_intel

2015-09-07 Thread Chuanbo Weng
From: Guo Yejun if the CL device does not support this builtin kernel, the test returns PASS. Signed-off-by: Guo Yejun --- utests/CMakeLists.txt | 1 + .../builtin_kernel_block_motion_estimate_intel.cpp | 109 + utests/utest_helper.hpp

[Beignet] [PATCH v3 4/4] Add document of video motion estimation support.

2015-09-07 Thread Chuanbo Weng
Signed-off-by: Chuanbo Weng --- docs/Beignet.mdwn | 1 + docs/howto/video-motion-estimation-howto.mdwn | 79 +++ 2 files changed, 80 insertions(+) create mode 100644 docs/howto/video-motion-estimation-howto.mdwn diff --git a/docs/Beignet.mdwn

[Beignet] [PATCH v3 2/4] Add extensions intel_accelerator and basic intel_motion_estimation.

2015-09-07 Thread Chuanbo Weng
v2: 1. Just upload the first vme_state. 2. Remove duplicated code in check_opt1_extension. 3. Check image format before cl_gpgpu_bind_image_for_vme. 4. Fix error of getting mv. Because we suppose this kernel run in SIMD16 mode, so dword 0 of grf 1 should be __gen_ocl_region(8,vme_result.s0),

[Beignet] [PATCH v3 1/4] Add built-in function __gen_ocl_vme.

2015-09-07 Thread Chuanbo Weng
__gen_ocl_vme is used for hardware accelerated video motion estimation. It gets payload values as parameters and uses MOV to pass these payload values to VME SEND Message's payload grfs. The int8 return value is used to store SEND Message writeback. v2: Remove unnecessary 5 parameters(src_grf*) of

Re: [Beignet] [PATCH v2 1/4] Add built-in function __gen_ocl_vme.

2015-09-07 Thread Weng, Chuanbo
Please ignore this patchset, I will send out a new version soon. -Original Message- From: Weng, Chuanbo Sent: Monday, September 07, 2015 13:01 To: beignet@lists.freedesktop.org Cc: Weng, Chuanbo Subject: [PATCH v2 1/4] Add built-in function __gen_ocl_vme. __gen_ocl_vme is used for hardwa