TIMESTAMP type
Hello, I inspected the grammar Hive.g and decided to create a new type for TIMESTAMP. TIMESTAMP is not a primitive type or list type or map type. It is a timestamp type of the form TIMESTAMP(MMDDHHMMSS) which is different from other types. Please let me know if there is any other suggestions. Thanks, shyam_sar...@yahoo.com
[jira] Created: (HIVE-333) Add TFileTransport deserializer
Add TFileTransport deserializer --- Key: HIVE-333 URL: https://issues.apache.org/jira/browse/HIVE-333 Project: Hadoop Hive Issue Type: New Feature Components: Serializers/Deserializers Environment: Linux Reporter: Steve Corona I've been googling around all night and havn't really found what I am looking for. Basically, I want to transfer some data from my web servers to hive in a format that's a little more verbose than plain CSV files. It seems like JSON or thrift would be perfect for this. I am planning on sending this serialized json or thrift data through scribe and loading it into Hive.. I just can't figure out how to tell hive that the input data is a bunch of serialized thrift records (all of the records are the struct type) in a TFileTransport. Hopefully this makes sense... Reply from Joydeep Sen Sarma (jssa...@facebook.com) Unfortunately the open source code base does not have the loaders we run to convert thrift records in a tfiletransport into a sequencefile that hadoop/hive can work with. One option is that we add this to Hive code base (should be straightforward). No process required. Please file a jira - I will try to upload a patch this weekend (just cut'n'paste for most part). Would appreciate some help in finessing it out .. (the internal code is hardwired to some assumptions etc. ) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Resolved: (HIVE-323) row counts for one query are being in printed subsequent queries
[ https://issues.apache.org/jira/browse/HIVE-323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joydeep Sen Sarma resolved HIVE-323. Resolution: Duplicate row counts for one query are being in printed subsequent queries Key: HIVE-323 URL: https://issues.apache.org/jira/browse/HIVE-323 Project: Hadoop Hive Issue Type: Bug Components: Query Processor Reporter: Joydeep Sen Sarma when executing multiple queries from the cli - i am seeing the row count state being maintained/printed across queries: q1 N1 rows q2 N1 rows inserted N2 rows inserted -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
Re: TIMESTAMP type
HI Shyam, Conforming to the standard (like the MySQL 6.0 TIMESTAMP specification) is definitely the ultimate goal we want to achieve. Go ahead with the design and let us know when you have something ready for review. My previous point was basically that it might be much easier to add support for TIMESTAMP using UDFs. Of course that is more like a short-term plan. In the longer term we definitely need something like what you suggested. Zheng On Sat, Mar 7, 2009 at 8:30 PM, Shyam Sarkar shyam_sar...@yahoo.com wrote: Hi Zheng, From my earlier discussion with the group I got the suggestion that we should implement MySQL 6.0 TIMESTAMP specifications:: http://dev.mysql.com/doc/refman/6.0/en/timestamp.html Please suggest if all the properties described for MySQL 6.0 can be done by UDF or not. Are you suggesting that UDF should be explicitely called all the time to decode TIMESTAMP string? This may require may different UDFs to be defined and called. Also this may be non-standard way to implement TIMESTAMP. Please suggest. Thanks, SHyam --- On Sat, 3/7/09, Zheng Shao zsh...@gmail.com wrote: From: Zheng Shao zsh...@gmail.com Subject: Re: TIMESTAMP type To: hive-dev@hadoop.apache.org, shyam_sar...@yahoo.com Date: Saturday, March 7, 2009, 4:29 PM The problem is that different people may have different formats to store TIMESTAMP. The easiest way to go is to declare the field as STRING, and then use UDF to manipulate the content. Zheng On Sat, Mar 7, 2009 at 6:08 AM, Shyam Sarkar shyam_sar...@yahoo.com wrote: Hello, I inspected the grammar Hive.g and decided to create a new type for TIMESTAMP. TIMESTAMP is not a primitive type or list type or map type. It is a timestamp type of the form TIMESTAMP(MMDDHHMMSS) which is different from other types. Please let me know if there is any other suggestions. Thanks, shyam_sar...@yahoo.com -- Yours, Zheng -- Yours, Zheng
JIRA_HIVE-286.2.patch_UNIT_TEST_SUCCEEDED
SUCCESS: BUILD AND UNIT TEST using PATCH HIVE-286.2.patch PASSED!!