[ 
https://issues.apache.org/jira/browse/HIVE-352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12683783#action_12683783
 ] 

He Yongqiang commented on HIVE-352:
-----------------------------------

Thanks, Joydeep and Zheng. The advises are really helpful.
I have written a draft document according to suggestions from Zheng and Joydeep.
Here is the link: http://docs.google.com/Doc?id=dc9jpfdr_3ft7w3hc4

I agree with you guys, we can start from B2, and then B1. And finally find out 
should we need to add the VFile in.
BTW, yestoday i also took a look on MapFile, which i found VFile has a same 
with MapFile in that VFlie sometimes also need an index file. The main 
difference is that VFile does not need a key part and sometimes even the 
value's length part. Because a VFile stores one column, each column has a type, 
and if the data type of that column is fix lengthed, it only needs to store the 
raw value bytes.

> Make Hive support column based storage
> --------------------------------------
>
>                 Key: HIVE-352
>                 URL: https://issues.apache.org/jira/browse/HIVE-352
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: He Yongqiang
>
> column based storage has been proven a better storage layout for OLAP. 
> Hive does a great job on raw row oriented storage. In this issue, we will 
> enhance hive to support column based storage. 
> Acctually we have done some work on column based storage on top of hdfs, i 
> think it will need some review and refactoring to port it to Hive.
> Any thoughts?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to