Hi All, We're working on a Spark application that could make use of a computing quantiles in a streaming fashion. Something in the vein of what DataFu has for Pig http://linkedin.github.io/datafu/docs/current/datafu/pig/stats/StreamingQuantile.html .
Does anything like this exist in the Spark ecosystem? If not, would there be a good place to contribute this if we write it? thanks, Sandy