[
https://issues.apache.org/jira/browse/CASSANALYTICS-114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18051567#comment-18051567
]
Stefan Miklosovic commented on CASSANALYTICS-114:
-------------------------------------------------
draft for inspiration https://github.com/apache/cassandra-analytics/pull/164
I might completely miss something though, I plan to return to this later.
> Investigate GPU accelerated processing, e.g. via Nvidia RAPIDS for Cassandra
> Analytics
> --------------------------------------------------------------------------------------
>
> Key: CASSANALYTICS-114
> URL: https://issues.apache.org/jira/browse/CASSANALYTICS-114
> Project: Apache Cassandra Analytics
> Issue Type: New Feature
> Components: Writer
> Reporter: Stefan Miklosovic
> Assignee: Stefan Miklosovic
> Priority: Normal
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Users who happen to have GPUs to spare can leverage its computing power via
> Nvidia's project called RAPIDS which is a plugin to Apache Spark which maps
> data to VRAM and its processing is accelerated via CUDA.
> It would be nice to explore this in context of Cassandra Analytics so we can
> speed up the processing significantly.
> My preliminary investigation led to the realisation that we would likely need
> to process rows in a columnar fashion - the current processing would stay, it
> just about the way we present the data to Spark API / internals.
> (1)
> https://www.nvidia.com/en-us/ai-data-science/spark-ebook/getting-started-spark-3/#p7-s1?ncid=no-ncid
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]