It seems like the next release will add a nice org.apache.spark.mllib.feature package but what is the recommended way to normalize features in the current release (1.0.2) -- I'm hoping for a general pointer here.
At the moment I have a RDD[LabeledPoint] and I can get a MultivariateStatisticalSummary for mean/variance. Is that about the right way to proceed? I'm also not seeing an easy way to subtract vectors -- do I need to do this element-wise? thanks