depends what you mean by "output data". Do you mean: * the data that is sent back to the driver? that is "result size" * the shuffle output? that is in "Shuffle Write Metrics" * the data written to a hadoop output format? that is in "Output Metrics"
On Thu, May 14, 2015 at 2:22 PM, yanwei <echo....@gmail.com> wrote: > I am trying to extract the *output data size* information for *each task*. > What *field(s)* should I look for, given the json-format log? > > Also, what does "Result Size" stand for? > > Thanks a lot in advance! > -Yanwei > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/spark-log-field-clarification-tp22892.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > >