Thanks for the link to the bug.
Unfortunately, using accumulators like this is getting spread around as a
recommended practice despite the bug.
From: Daniel Siegmann [mailto:daniel.siegm...@velos.io]
Sent: Monday, November 17, 2014 8:32 AM
To: Segerlind, Nathan L
Cc: user
Subject: Re
Hi All.
I am trying to get my head around why using accumulators and accumulables seems
to be the most recommended method for accumulating running sums, averages,
variances and the like, whereas the aggregate method seems to me to be the
right one. I have no performance measurements as of yet,
Howdy.
Is it possible to initiate Spark jobs from Oozie (presumably as a java action)?
If so, are there known limitations to this? And would anybody have a pointer
to an example?
Thanks,
Nate