[ 
https://issues.apache.org/jira/browse/HIVE-20486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16663962#comment-16663962
 ] 

slim bouguerra commented on HIVE-20486:
---------------------------------------

looked at the suggested option and the code base 
https://github.com/apache/hive/blob/37c7fd7833eba087eadd8048dbc63b403b272104/ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/Vectorizer.java#L1462

i see that it is only possible to enable this for some pre selected Input 
formats.
Thus i have created a new vectorized Kafka Reader.



> Kafka: Use Row SerDe + vectorization
> ------------------------------------
>
>                 Key: HIVE-20486
>                 URL: https://issues.apache.org/jira/browse/HIVE-20486
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Gopal V
>            Priority: Major
>
> KafkaHandler returns unvectorized rows which causes the operators downstream 
> to be slower and sub-optimal.
> Hive has a vectorization shim which allows Kafka streams without complex 
> projections to be wrapped into a vectorized reader via 
> {{hive.vectorized.use.row.serde.deserialize}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to