[jira] Commented: (HIVE-1638) convert commonly used udfs to generic udfs

Siying Dong (JIRA) Wed, 29 Sep 2010 12:21:59 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-1638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12916244#action_12916244
 ]


Siying Dong commented on HIVE-1638:
-----------------------------------

I should have made it clearer.

"CPU Cycle (MapRed Framework)" is the "CPU_MILLISECONDS" reported in 
"Map-Reduce Framework" section of job page.

"Total CPU Time (hmon)" is not the query execution time. It is the average time 
the query takes the cluster. It aggregates resource usage this job takes on 
each machine and normalized by the total resource of the cluster (this one 
specifically should be number of cores). It also includes reducer's costs. It's 
trend should be very similar to the first one, just another source.



> convert commonly used udfs to generic udfs
> ------------------------------------------
>
>                 Key: HIVE-1638
>                 URL: https://issues.apache.org/jira/browse/HIVE-1638
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Siying Dong
>         Attachments: HIVE-1638.1.patch
>
>
> Copying a mail from Joy:
> i did a little bit of profiling of a simple hive group by query today. i was 
> surprised to see that one of the most expensive functions were in converting 
> the equals udf (i had some simple string filters) to generic udfs. 
> (primitiveobjectinspectorconverter.textconverter)
> am i correct in thinking that the fix is to simply port some of the most 
> popular udfs (string equality/comparison etc.) to generic udsf?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HIVE-1638) convert commonly used udfs to generic udfs

Reply via email to