depends what you mean by "output data".  Do you mean:

* the data that is sent back to the driver? that is "result size"
* the shuffle output?  that is in "Shuffle Write Metrics"
* the data written to a hadoop output format?  that is in "Output Metrics"

On Thu, May 14, 2015 at 2:22 PM, yanwei <echo....@gmail.com> wrote:

> I am trying to extract the *output data size* information for *each task*.
> What *field(s)* should I look for, given the json-format log?
>
> Also, what does "Result Size" stand for?
>
> Thanks a lot in advance!
> -Yanwei
>
>
>
> --
> View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/spark-log-field-clarification-tp22892.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> For additional commands, e-mail: user-h...@spark.apache.org
>
>

Reply via email to