[ 
https://issues.apache.org/jira/browse/PIG-2855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy V. Ryaboy updated PIG-2855:
-----------------------------------

    Release Note: 
New Feature: Timing your UDFs

The first step to improving performance and efficiency is measuring where the 
time is going. Pig provides a light-weight method for approximately measuring 
how much time is spent in different user-defined functions (UDFs) and Loaders. 
Simply set the pig.udf.profile property to true. This will cause new counters 
to be tracked for all Map-Reduce jobs generated by your script: 
approx_microsecs measures the approximate amount of time spent in a UDF, and 
approx_invocations measures the approximate number of times the UDF was 
invoked. Note that this may produce a large number of counters (two per UDF). 
Excessive amounts of counters can lead to poor JobTracker performance, so use 
this feature carefully, and preferably on a test cluster.

    
> Provide a method to measure time spent in UDFs
> ----------------------------------------------
>
>                 Key: PIG-2855
>                 URL: https://issues.apache.org/jira/browse/PIG-2855
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Dmitriy V. Ryaboy
>            Assignee: Dmitriy V. Ryaboy
>             Fix For: 0.11
>
>         Attachments: PIG-2855.2.patch, PIG-2855.patch
>
>
> When debugging slow jobs, it is often useful to know whether time is being 
> spent in UDFs, and in which UDFs. This is easy to measure from within the 
> framework, we should let users optionally track these metrics.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to