Hi, > I've already told this on IRC. The GEMV uses very conservative profiles > with very few threads. Now that I have ported a simple version of GEMM > (when only full matrices are used), I'll re-bind the generator into > pyviennacl and will try to get an auto-tuning up and runing in python. > Then, I'll update the profiles to something better :)
What about using a default local workgroup size of 128 and a default global workgroup size of 128*128? This has worked fairly well for years now and will get him fairly close to peak on current hardware? Best regards, Karli ------------------------------------------------------------------------------ Open source business process management suite built on Java and Eclipse Turn processes into business applications with Bonita BPM Community Edition Quickly connect people, data, and systems into organized workflows Winner of BOSSIE, CODIE, OW2 and Gartner awards http://p.sf.net/sfu/Bonitasoft _______________________________________________ ViennaCL-devel mailing list ViennaCL-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/viennacl-devel