Hi , Is there something like summary function in spark like that in "R".
The summary calculation which comes with spark(MultivariateStatisticalSummary) operates only on numeric types. I am interested in getting the results for string types also like the first four max occuring strings(groupby kind of operation) , number of uniques etc. Is there any preexisting code for this ? If not what please suggest the best way to deal with string types. Thanks, Kundan