> The GPU is an Intel HD Graphics 4000.
> 
> What worries me is not only the warmup, but the copy phase. In release it can 
> take a lot of time for a simple 1M items (169 seconds!). I suspect I should 
> try to do everything on the GPU. Not only that, but this call:
> 
>     viennacl::vector<float> gpuv;
>     gpuv.resize(size);
> 
> is taking 6 times what it takes on the CPU.
> 
> How can I avoid these overheads? I am quite new with OpenCL...
> 
> Thank you!
> 

Sorry, I forgot the log for the twice-computed norm:

Computing norm_1 on GPU 
Computing norm_1 on GPU 347

Computing norm_1 on GPU (again)
Computing norm_1 on GPU 2

Computing norm_1 on CPU 
Computing norm_1 on CPU 0


As you can see, yes, the OpenCL compilation takes a lot of time, but still 
higher than the CPU (with cache and all, I know).
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
ViennaCL-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/viennacl-devel

Reply via email to