Just thought I should point that the FirePro V4800 is 3 years older than the GTX 780 Ti and has far fewer cores. Its bandwidth to global memory is 57.6 GB/s vs the 780 Ti's 336.5 GB/s. Comparing the two is pointless, the FirePro V4800 will always lose.
On 14 August 2015 at 19:12, CRV§ADER//KY <[email protected]> wrote: > Look up opencl / cuda coalesced memory access on stack overflow, there's > plenty of threads there > On 14 Aug 2015 13:55, "Joe Haywood" <[email protected]> wrote: > >> Will you explain this to me a little more? >> "One that jumps to the eye is that you're accessing 4 bytes of memory in >> an arbitrary place, but every time you're really loading up, and then >> writing back, a whole page! That's why it's so slow, even without atomic >> operations. The solution is local memory." >> >> Sent from my Samsung Galaxy Tab® S >> >> Confidentiality Notice: >> This e-mail, including any attachments is the property of Trinity Health >> and is intended for the sole use of the intended recipient(s). It may >> contain information that is privileged and confidential. Any unauthorized >> review, use, disclosure, or distribution is prohibited. If you are not the >> intended recipient, please delete this message, and reply to the sender >> regarding the error in a separate email. >> > > _______________________________________________ > PyOpenCL mailing list > [email protected] > http://lists.tiker.net/listinfo/pyopencl > >
_______________________________________________ PyOpenCL mailing list [email protected] http://lists.tiker.net/listinfo/pyopencl
