How can a function access Executor ID, Function ID and other parameters known to the Spark Environment

Steve Lewis Wed, 26 Nov 2014 10:59:36 -0800

 I am running on a 15 node cluster and am trying to set partitioning to
balance the work across all nodes. I am using an Accumulator to track work
by Mac Address but would prefer to use data known to the Spark environment
-  Executor ID, and Function ID show up in the Spark UI and Task ID and
Attempt ID (assuming these work like Hadoop) would be useful.
Does someone know how code running in a function can access these
parameters. I think I have asked this group several times about Task ID and
Attempt ID without getting a reply.


Incidentally the data I collect suggests that my execution is not at all
balanced

How can a function access Executor ID, Function ID and other parameters known to the Spark Environment

Reply via email to