Hi Spark users, We have been working on GPU acceleration for Apache Spark SQL / Dataframe using the RAPIDS Accelerator for Apache Spark <https://www.nvidia.com/en-us/deep-learning-ai/solutions/data-science/apache-spark-3/> and open source project Alluxio <https://github.com/Alluxio/alluxio> without any code changes. Our preliminary results suggest 2x improvement in performance and 70% in ROI compared to a CPU-based cluster.
Feel free to read the developer blog <https://bit.ly/2QkXjxo> for more details of the benchmark. If you are interested to discuss further with the authors, join our free online meetup <https://go.alluxio.io/community-alluxio-day-2021> next Tuesday morning (April 27) Pacific time. Best, - Bin Fan