Dear OpenCL users,

I am looking for a way to find the best device in a computer in order to be 
able to select it for processing.

PyOpenCL offers me a max_clock_frequency and a max_compute_units for the 
device. Nice!
Unfortunately on a dual-Xeon5520 + Fermi; the product 
max_clock_frequency*max_compute_units is in favour of the CPU but the GPU is 
clearly faster !

I have calculated the FLOPS per compute unit per Hz for a few devices and I got:
NVidia Fermi (GTX580): 64 FLOPS/Unit/Hz
NVidia Tesla (GT285): 24 FLOPS/Unit/Hz
NVidia GT9600: 24 FLOPS/Unit/Hz
Intel CPU: 4 FLOPS/Unit/MHz (I usually get less)

According to some readings on the web for Kepler cards, it should be 384 
FLOPS/Unit/MHz (unchecked)

I have no figures for AMD cards, I would be interested in getting some
of them; and would like to be able to discriminate the various NVidia
generations within pyopencl (via compute_capability_major_nv &
compute_capability_minor_nv?)

Any ideas are welcome.

Cheers,

-- 
Jérôme Kieffer
Data analysis unit - ESRF

_______________________________________________
PyOpenCL mailing list
[email protected]
http://lists.tiker.net/listinfo/pyopencl

Reply via email to