Dear OpenCL users, I am looking for a way to find the best device in a computer in order to be able to select it for processing.
PyOpenCL offers me a max_clock_frequency and a max_compute_units for the device. Nice! Unfortunately on a dual-Xeon5520 + Fermi; the product max_clock_frequency*max_compute_units is in favour of the CPU but the GPU is clearly faster ! I have calculated the FLOPS per compute unit per Hz for a few devices and I got: NVidia Fermi (GTX580): 64 FLOPS/Unit/Hz NVidia Tesla (GT285): 24 FLOPS/Unit/Hz NVidia GT9600: 24 FLOPS/Unit/Hz Intel CPU: 4 FLOPS/Unit/MHz (I usually get less) According to some readings on the web for Kepler cards, it should be 384 FLOPS/Unit/MHz (unchecked) I have no figures for AMD cards, I would be interested in getting some of them; and would like to be able to discriminate the various NVidia generations within pyopencl (via compute_capability_major_nv & compute_capability_minor_nv?) Any ideas are welcome. Cheers, -- Jérôme Kieffer Data analysis unit - ESRF _______________________________________________ PyOpenCL mailing list [email protected] http://lists.tiker.net/listinfo/pyopencl
