Just thought I should point that the FirePro V4800 is 3 years older than
the GTX 780 Ti and has far fewer cores. Its bandwidth to global memory is
57.6 GB/s vs the 780 Ti's 336.5 GB/s. Comparing the two is pointless, the
FirePro V4800 will always lose.

On 14 August 2015 at 19:12, CRV§ADER//KY <[email protected]> wrote:

> Look up opencl / cuda coalesced memory access on stack overflow, there's
> plenty of threads there
> On 14 Aug 2015 13:55, "Joe Haywood" <[email protected]> wrote:
>
>> Will you explain this to me a little more?
>> "One that jumps to the eye is that you're accessing 4 bytes of memory in
>> an arbitrary place, but every time you're really loading up, and then
>> writing back, a whole page! That's why it's so slow, even without atomic
>> operations. The solution is local memory."
>>
>> Sent from my Samsung Galaxy Tab® S
>>
>> Confidentiality Notice:
>> This e-mail, including any attachments is the property of Trinity Health
>> and is intended for the sole use of the intended recipient(s). It may
>> contain information that is privileged and confidential.  Any unauthorized
>> review, use, disclosure, or distribution is prohibited. If you are not the
>> intended recipient, please delete this message, and reply to the sender
>> regarding the error in a separate email.
>>
>
> _______________________________________________
> PyOpenCL mailing list
> [email protected]
> http://lists.tiker.net/listinfo/pyopencl
>
>
_______________________________________________
PyOpenCL mailing list
[email protected]
http://lists.tiker.net/listinfo/pyopencl

Reply via email to