[ 
https://issues.apache.org/jira/browse/KNOX-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16650272#comment-16650272
 ] 

Kevin Risden edited comment on KNOX-1524 at 10/15/18 2:27 PM:
--------------------------------------------------------------

h2. Result Interpretation

There is a JVM startup time of about 2 seconds as shown by the HDFS native 
results. HiveServer2 adds quite a bit of overhead compared to reading the file 
directly from HDFS. Doubling the time from HDFS native to return the results 
with HiveServer2. HiveServer2 Binary mode is more efficient than HTTP mode. The 
Knox overhead is lower than the overhead of HiveServer2 HTTP vs binary mode. 
There are probably some improvements that can be made to HiveServer2 to improve 
the performance. 


was (Author: risdenk):
h2. Result Interpretation

There is a JVM startup time of about 2 seconds as shown by the HDFS native 
results. HiveServer2 adds quite a bit of overhead compared to reading the file 
directly from HDFS. Doubling the time from HDFS native to return the results 
with HiveServer2. HiveServer2 Binary mode is more efficient than HTTP mode. The 
Knox overhead is about the same as the binary vs http mode overhead. There are 
probably some improvements that can be made to HiveServer2 to improve the 
performance. 

> Hive "select *" performance evaluation
> --------------------------------------
>
>                 Key: KNOX-1524
>                 URL: https://issues.apache.org/jira/browse/KNOX-1524
>             Project: Apache Knox
>          Issue Type: Task
>            Reporter: Kevin Risden
>            Assignee: Kevin Risden
>            Priority: Major
>             Fix For: 1.2.0
>
>
> While looking at WebHDFS performance in KNOX-1221, I decided to look a bit 
> more into performance for common use cases. Hive performance is another area 
> that could use some research.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to