Hi,
Is there any plan to add the countByValue function to Spark SQL Dataframe ?
Even
https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/ml/feature/StringIndexer.scala#L78
is using the RDD part right now, but for ML purposes, being able to get the
most frequent categorical value on multiple columns would be very useful.


Regards,


-- 
*Olivier Girardot* | AssociƩ
o.girar...@lateral-thoughts.com
+33 6 24 09 17 94

Reply via email to