[
https://issues.apache.org/jira/browse/PIG-2855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dmitriy V. Ryaboy updated PIG-2855:
-----------------------------------
Release Note:
New Feature: Timing your UDFs
The first step to improving performance and efficiency is measuring where the
time is going. Pig provides a light-weight method for approximately measuring
how much time is spent in different user-defined functions (UDFs) and Loaders.
Simply set the pig.udf.profile property to true. This will cause new counters
to be tracked for all Map-Reduce jobs generated by your script:
approx_microsecs measures the approximate amount of time spent in a UDF, and
approx_invocations measures the approximate number of times the UDF was
invoked. Note that this may produce a large number of counters (two per UDF).
Excessive amounts of counters can lead to poor JobTracker performance, so use
this feature carefully, and preferably on a test cluster.
> Provide a method to measure time spent in UDFs
> ----------------------------------------------
>
> Key: PIG-2855
> URL: https://issues.apache.org/jira/browse/PIG-2855
> Project: Pig
> Issue Type: New Feature
> Reporter: Dmitriy V. Ryaboy
> Assignee: Dmitriy V. Ryaboy
> Fix For: 0.11
>
> Attachments: PIG-2855.2.patch, PIG-2855.patch
>
>
> When debugging slow jobs, it is often useful to know whether time is being
> spent in UDFs, and in which UDFs. This is easy to measure from within the
> framework, we should let users optionally track these metrics.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira