am having problems trying to apply the split-apply-combine strategy
for dataframes using SparkR.
I have a largish dataframe and I would like to achieve something similar
to what
ddply(df, .(id), foo)
would do, only that using SparkR as computing engine. My df has a few
million records
Shivaram
On Thu, Aug 14, 2014 at 3:53 PM, Carlos J. Gil Bellosta
gilbello...@gmail.com wrote:
Hello,
I am having problems trying to apply the split-apply-combine strategy
for dataframes using SparkR.
I have a largish dataframe and I would like to achieve something similar
to what
ddply(df