[ 
https://issues.apache.org/jira/browse/HIVE-352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12704807#action_12704807
 ] 

Zheng Shao commented on HIVE-352:
---------------------------------

hive-352-2009-5-1-3.patch

Can you remove the extra message "FileSplit's start is 0, its length is 299"?
Or use LOG.info/LOG.debug.
{code}
hive> select * from zshao_rc;
OK
FileSplit's start is 0, its length is 299
123     456     NULL
Time taken: 0.09 seconds
{code}

Can you find the error messsage in the code, and fix it?
You probably just need to add your ColumnarSerDe to the internal SerDe list.
{code}
hive> alter table zshao_rc replace columns(a int);
Replace columns is not supported for this table. SerDe may be incompatible.
FAILED: Execution Error, return code 1 from 
org.apache.hadoop.hive.ql.exec.DDLTask
{code}

Can you allow extra columns in the metadata? Just assign NULLs to the columns 
in the metadata but NOT in the data.
{code}
hive> alter table zshao_rc add columns(a int);
Column 'a' exists
FAILED: Execution Error, return code 1 from 
org.apache.hadoop.hive.ql.exec.DDLTask
hive> alter table zshao_rc add columns(d int);
hive> select * from zshao_rc;
FileSplit's start is 0, its length is 299
Failed with exception This BytesRefArrayWritable only has 3 valid values.
{code}


> Make Hive support column based storage
> --------------------------------------
>
>                 Key: HIVE-352
>                 URL: https://issues.apache.org/jira/browse/HIVE-352
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: He Yongqiang
>            Assignee: He Yongqiang
>         Attachments: 4-22 performace2.txt, 4-22 performance.txt, 4-22 
> progress.txt, hive-352-2009-4-15.patch, hive-352-2009-4-16.patch, 
> hive-352-2009-4-17.patch, hive-352-2009-4-19.patch, 
> hive-352-2009-4-22-2.patch, hive-352-2009-4-22.patch, 
> hive-352-2009-4-23.patch, hive-352-2009-4-27.patch, 
> hive-352-2009-4-30-2.patch, hive-352-2009-4-30-3.patch, 
> hive-352-2009-4-30-4.patch, hive-352-2009-5-1-3.patch, 
> hive-352-2009-5-1.patch, HIve-352-draft-2009-03-28.patch, 
> Hive-352-draft-2009-03-30.patch
>
>
> column based storage has been proven a better storage layout for OLAP. 
> Hive does a great job on raw row oriented storage. In this issue, we will 
> enhance hive to support column based storage. 
> Acctually we have done some work on column based storage on top of hdfs, i 
> think it will need some review and refactoring to port it to Hive.
> Any thoughts?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to