Hi Vijay,

DynamicSerDe is deprecated.

Please use the following SerDe instead:
https://issues.apache.org/jira/browse/HIVE-662

Can you point us to where you see this example? We should update it with
RegexSerDe.

Zheng

On Mon, Oct 12, 2009 at 4:46 PM, Vijay <[email protected]> wrote:

> Hi,
>
> I have this very strange problem of queries hanging when I use the
> org.apache.hadoop.hive.serde2.dynamic_type.DynamicSerDe with the
> serialization format
> org.apache.hadoop.hive.serde2.thrift.TCTLSeparatedProtocol. I'm basically
> loading a very standard apache log file (pretty small) using this example
> from one of the examples on the net.
>
> CREATE TABLE apachelog (
> ipaddress STRING, identd STRING, user STRING,finishtime STRING,
> requestline string, returncode INT, size INT)
> ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.dynamic_type.DynamicSerDe'
> WITH SERDEPROPERTIES (
>
> 'serialization.format'='org.apache.hadoop.hive.serde2.thrift.TCTLSeparatedProtocol',
> 'quote.delim'='("|\\[|\\])',
> 'field.delim'=' ',
> 'serialization.null.format'='-')
> STORED AS TEXTFILE;
>
> After that, I simply load a small log file. Everything is good except, when
> I run any simple query like SELECT count(1) from apachelog, the job "hangs."
> It basically sits there forever and the job gets killed after 10 minutes. I
> must be missing something very basic.
>
> I can get the same setup working well using the RegexSerDe but I'd rather
> use the DynamicSerDe like above. I'm using hive-0.4 branch but I'm pretty
> sure I saw the same behavior using trunk as well. There is not a thing I can
> find in the /tmp/<>/hive.log file.
>
> Thanks for your help!
> Vijay
>



-- 
Yours,
Zheng

Reply via email to