Hi all,

I'd like to leverage some of the fast Spark collection implementations in
my own code.

Particularity for doing things like distinct counts in a mapPartitions
loop.

Are there any plans to make the org.apache.spark.util.collection
implementations public? Is there any other library out there with similar
performance?

Cheers,
NW

Reply via email to