[ 
https://issues.apache.org/jira/browse/HIVE-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13741362#comment-13741362
 ] 

Leo Romanoff commented on HIVE-1511:
------------------------------------

[~appodictic] Regarding the speed: I already mentioned these ideas on Kryo 
mailing list, but may be it was overlooked. Kryo is very fast when used 
properly.

Here are some possible improvements:

1) Do not create new Kryo instances every time you need to (de)serialize. It is 
a pretty costly operation. Reuse Kryo instances instead. Use object pooling for 
Kryo instances. But be careful with multi-threading. Kryo instances cannot be 
used by different threads at _the same time_. 

2) Preregister classes (use Kryo.register) to be serialized if possible. It 
makes serialization faster and serialized representations shorter.
                
> Hive plan serialization is slow
> -------------------------------
>
>                 Key: HIVE-1511
>                 URL: https://issues.apache.org/jira/browse/HIVE-1511
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 0.7.0
>            Reporter: Ning Zhang
>            Assignee: Mohammad Kamrul Islam
>         Attachments: HIVE-1511.patch, HIVE-1511-wip2.patch, 
> HIVE-1511-wip3.patch, HIVE-1511-wip.patch
>
>
> As reported by Edward Capriolo:
> For reference I did this as a test case....
> SELECT * FROM src where
> key=0 OR key=0 OR key=0 OR  key=0 OR key=0 OR key=0 OR key=0 OR key=0
> OR key=0 OR key=0 OR key=0 OR
> key=0 OR key=0 OR key=0 OR  key=0 OR key=0 OR key=0 OR key=0 OR key=0
> OR key=0 OR key=0 OR key=0 OR
> ...(100 more of these)
> No OOM but I gave up after the test case did not go anywhere for about
> 2 minutes.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to