I would like to track metrics of the Pig job across runs. For example:
- overall runtime
- records read
- records written

This is available in the Pig client log and needs to be parsed out:

> Successfully read 14171 records (401 bytes) from: "..."
> Successfully stored 8728906 records (642415596 bytes) in: "..."
> 2019-11-16 20:51:43,448 [MainThread] INFO  org.apache.pig.Main - Pig
> script completed in 6 minutes, 46 seconds and 854 milliseconds (406854 ms)


Is there a way to get this data in more machine-friendly format, like
CSV/JSON/etc?

Reply via email to