[ 
https://issues.apache.org/jira/browse/HIVE-333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joydeep Sen Sarma reassigned HIVE-333:
--------------------------------------

    Assignee: Joydeep Sen Sarma

> Add TFileTransport deserializer
> -------------------------------
>
>                 Key: HIVE-333
>                 URL: https://issues.apache.org/jira/browse/HIVE-333
>             Project: Hadoop Hive
>          Issue Type: New Feature
>          Components: Serializers/Deserializers
>         Environment: Linux
>            Reporter: Steve Corona
>            Assignee: Joydeep Sen Sarma
>
> I've been googling around all night and havn't really found what I am looking 
> for. Basically, I want to transfer some data from my web servers to hive  in 
> a format that's a little more verbose than plain CSV files. It seems like 
> JSON or thrift would be perfect for this. I am planning on sending this 
> serialized json or thrift data through scribe and loading it into Hive.. I 
> just can't figure out how to tell hive that the input data is a bunch of 
> serialized thrift records (all of the records are the "struct" type)  in a 
> TFileTransport. Hopefully this makes sense...
> Reply from Joydeep Sen Sarma (jssa...@facebook.com)
> Unfortunately the open source code base does not have the loaders we run to 
> convert thrift records in a tfiletransport into a sequencefile that 
> hadoop/hive can work with. One option is that we add this to Hive code base 
> (should be straightforward).
> No process required. Please file a jira - I will try to upload a patch this 
> weekend (just cut'n'paste for most part). Would appreciate some help in 
> finessing it out .. (the internal code is hardwired to some assumptions etc. )

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to