[
https://issues.apache.org/jira/browse/HIVE-352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12683783#action_12683783
]
He Yongqiang commented on HIVE-352:
-----------------------------------
Thanks, Joydeep and Zheng. The advises are really helpful.
I have written a draft document according to suggestions from Zheng and Joydeep.
Here is the link: http://docs.google.com/Doc?id=dc9jpfdr_3ft7w3hc4
I agree with you guys, we can start from B2, and then B1. And finally find out
should we need to add the VFile in.
BTW, yestoday i also took a look on MapFile, which i found VFile has a same
with MapFile in that VFlie sometimes also need an index file. The main
difference is that VFile does not need a key part and sometimes even the
value's length part. Because a VFile stores one column, each column has a type,
and if the data type of that column is fix lengthed, it only needs to store the
raw value bytes.
> Make Hive support column based storage
> --------------------------------------
>
> Key: HIVE-352
> URL: https://issues.apache.org/jira/browse/HIVE-352
> Project: Hadoop Hive
> Issue Type: New Feature
> Reporter: He Yongqiang
>
> column based storage has been proven a better storage layout for OLAP.
> Hive does a great job on raw row oriented storage. In this issue, we will
> enhance hive to support column based storage.
> Acctually we have done some work on column based storage on top of hdfs, i
> think it will need some review and refactoring to port it to Hive.
> Any thoughts?
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.