Erik Trimble wrote:
On 7/18/2010 5:01 AM, "C. Bergström" wrote:
Rob Clark wrote:
If SunStudio had some GPU support and there were a few functions added for the hotpoints ...
GPUs are great for some things but it is difficult to imagine a GPU being of assistance in the zfs implementation due to way too much latency, optimization for floating point rather than integer, and due to creating a "hotpoint". For zfs it is much better to spread the load across multiple CPU cores using many threads as is done now.

Bob
--

If you wait long enough someone will 'build a Bridge to it' ...


Accelerating Distributed Storage Systems with CUDA - Paper & Code
http://www.nvidia.com/object/cuda_apps_flash_new.html#state=detailsOpen;aid=e223fbfc-f017-498c-8174-699747dbd88b
I believe I commented on this thread before and that paper mostly focuses on hashed and md5 checksums. The other problem is that while as long as the GPU is on the other side of the memory controller it's unlikely to be "cost" (performance/latency) effective to do the expensive round-trip for small chunks of data. Depending on the storage topology and stack of course... *If* you had some master node doing checksums or doing it at the application layer it could possible make sense, but it would need to be some seriously huge workload to justify it

GPUs sitting on the PCI-E bus are going to have this problem, and it's likely insurmountable.

HOWEVER, AMD *is* finally getting around to implementing a GPU in the same package as the CPU, so we'll shortly be able to see a combined CPU/GPU thing that sits in an AM3 socket (or, more likely, a C32 socket). There's even a possibility that AMD's talked about where an advanced GPU will live in a second CPU-style socket, with direct HT connections. This sort of design at least leads itself to being used as a co-processor, as it has direct low-latency connection to the memory contoller/bus.
lalala.. hear no evil.. speak no evil... Does it *really* sound so fun to write code generation for x86_64 *AND* ATI VLIW targets... Unless everyone wants to rewrite their code in highly explicit parallel programming models I think there's a huge amount of work before general applications can really benefit from this.. I'd be happy to see a fully automatic solution for optimally offloading general application code to the GPU by 2012..

(I also don't know if Fusion, which is what I think you're referring to, is really going to initially target the high performance/visualization market.. If this is the case it's much less likely to solve performance problems and be better suited for improving efficiency..)
_______________________________________________
zfs-code mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/zfs-code

Reply via email to