[ 
https://issues.apache.org/jira/browse/HIVE-396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuntao Jia updated HIVE-396:
----------------------------

    Attachment: hive_benchmark_2009-07-21.tar.gz

Updated the benchmark script to make it more automatic. Now it outputs all the 
timings to a csv file which looks like:

Timings, grep select, rankings select, uservisits aggregation, 
uservisits-rankings join
Trial 1
Hive,126.3,25.0,546.1,447.9,
PIG,240.5,31.0,672.3,658.3,
Hadoop,135.4,21.6,394.9,486.1
Trial 2
Hive,126.3,25.0,546.1,447.9,
PIG,240.5,31.0,672.3,658.3,
Hadoop,135.4,21.6,394.9,486.1
Trial 3
Hive,126.3,25.0,546.1,447.9,
PIG,240.5,31.0,672.3,658.3,
Hadoop,135.4,21.6,394.9,486.1

The first line shows the queries, followed by query timings from different 
trials. Within each trial, there are three lines showing the query timings on 
Hive, PIG and Hadoop, respectively. The numbers here are for illustration 
purpose only.
The file can be directly opened in excel. User can then easily generate a 
performance graph on top of it

> Hive performance benchmarks
> ---------------------------
>
>                 Key: HIVE-396
>                 URL: https://issues.apache.org/jira/browse/HIVE-396
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Zheng Shao
>            Assignee: Yuntao Jia
>         Attachments: hive_benchmark_2009-06-18.pdf, 
> hive_benchmark_2009-06-18.tar.gz, hive_benchmark_2009-07-12.pdf, 
> hive_benchmark_2009-07-21.tar.gz
>
>
> We need some performance benchmark to measure and track the performance 
> improvements of Hive.
> Some references:
> PIG performance benchmarks PIG-200
> PigMix: http://wiki.apache.org/pig/PigMix

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to