Hi, I've done some analysis comparing CPU(on a nehalem) and GPU(on a tesla) performance of PyOpenCL to parallel Cython using OpenMP. The performance of PyOpenCL on the CPU(Intel Nehalem with AMD OpenCL 1.1) was very poor, even slower than a single threaded Cython program. I realize that my OpenCL implementation was fairly poor, but I expected performance to be a bit better than it was.
The analysis is available here: http://www.rcloud.me/2011/09/20/pyopencl-implementation/ I'm hoping that someone can give some insight into how to improve it or why it is so bad. Also, I would like to run the analysis again with the Intel OpenCL driver, but can't get PyOpenCL to recognize both Intel and AMD platforms, when I run get_platforms it only shows AMD. Here is my siteconf.py file: rcloud@Vertex:~/sources/pyopencl-2011.1.2$ cat siteconf.py BOOST_INC_DIR = [] BOOST_LIB_DIR = [] BOOST_COMPILER = 'gcc43' BOOST_PYTHON_LIBNAME = ['boost_python-gcc43-mt'] USE_SHIPPED_BOOST = True CL_TRACE = False CL_ENABLE_GL = False CL_ENABLE_DEVICE_FISSION = True CL_INC_DIR = ['/home/rcloud/sources/amd/AMD-APP-SDK-v2.5-RC2-lnx64/include'] CL_LIB_DIR = ['/home/rcloud/sources/amd/AMD-APP-SDK-v2.5-RC2-lnx64/lib/x86_64', '/usr/lib64'] CL_LIBNAME = ['OpenCL'] CXXFLAGS = [] LDFLAGS = [] thanks in advance, -- Robert L Cloud ,,Warum willst du dich von uns Allen Und unsrer Meinung entfernen?" Ich schreibe nicht, euch zu gefallen; Ihr sollt was lernen. --Goethe http://www.robertlouiscloud.com
_______________________________________________ PyOpenCL mailing list [email protected] http://lists.tiker.net/listinfo/pyopencl
