Hi all, I'd like to leverage some of the fast Spark collection implementations in my own code.
Particularity for doing things like distinct counts in a mapPartitions loop. Are there any plans to make the org.apache.spark.util.collection implementations public? Is there any other library out there with similar performance? Cheers, NW