TIMESTAMP type

2009-03-07 Thread Shyam Sarkar

Hello,

I inspected the grammar Hive.g and decided to create a new type for TIMESTAMP. 
TIMESTAMP is not a primitive type or list type or map type. It is a timestamp 
type of the form TIMESTAMP(MMDDHHMMSS) which is different from other types.

Please let me know if there is any other suggestions.

Thanks,
shyam_sar...@yahoo.com





  


[jira] Created: (HIVE-333) Add TFileTransport deserializer

2009-03-07 Thread Steve Corona (JIRA)
Add TFileTransport deserializer
---

 Key: HIVE-333
 URL: https://issues.apache.org/jira/browse/HIVE-333
 Project: Hadoop Hive
  Issue Type: New Feature
  Components: Serializers/Deserializers
 Environment: Linux
Reporter: Steve Corona


I've been googling around all night and havn't really found what I am looking 
for. Basically, I want to transfer some data from my web servers to hive  in a 
format that's a little more verbose than plain CSV files. It seems like JSON or 
thrift would be perfect for this. I am planning on sending this serialized json 
or thrift data through scribe and loading it into Hive.. I just can't figure 
out how to tell hive that the input data is a bunch of serialized thrift 
records (all of the records are the struct type)  in a TFileTransport. 
Hopefully this makes sense...

Reply from Joydeep Sen Sarma (jssa...@facebook.com)

Unfortunately the open source code base does not have the loaders we run to 
convert thrift records in a tfiletransport into a sequencefile that hadoop/hive 
can work with. One option is that we add this to Hive code base (should be 
straightforward).

No process required. Please file a jira - I will try to upload a patch this 
weekend (just cut'n'paste for most part). Would appreciate some help in 
finessing it out .. (the internal code is hardwired to some assumptions etc. )

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Resolved: (HIVE-323) row counts for one query are being in printed subsequent queries

2009-03-07 Thread Joydeep Sen Sarma (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joydeep Sen Sarma resolved HIVE-323.


Resolution: Duplicate

 row counts for one query are being in printed subsequent queries
 

 Key: HIVE-323
 URL: https://issues.apache.org/jira/browse/HIVE-323
 Project: Hadoop Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Joydeep Sen Sarma

 when executing multiple queries from the cli - i am seeing the row count 
 state being maintained/printed across queries:
 q1
 N1 rows
 q2
 N1 rows inserted
 N2 rows inserted

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: TIMESTAMP type

2009-03-07 Thread Zheng Shao
HI Shyam,

Conforming to the standard (like the MySQL 6.0 TIMESTAMP specification) is
definitely the ultimate goal we want to achieve.
Go ahead with the design and let us know when you have something ready for
review.

My previous point was basically that it might be much easier to add support
for TIMESTAMP using UDFs. Of course that is more like a short-term plan.
In the longer term we definitely need something like what you suggested.


Zheng

On Sat, Mar 7, 2009 at 8:30 PM, Shyam Sarkar shyam_sar...@yahoo.com wrote:


 Hi Zheng,

 From my earlier discussion with the group I got the suggestion that we
 should implement MySQL 6.0 TIMESTAMP specifications::

 http://dev.mysql.com/doc/refman/6.0/en/timestamp.html

 Please suggest if all the properties described for MySQL 6.0 can be
 done by UDF or not.

 Are you suggesting that UDF should be explicitely called all the time
 to decode TIMESTAMP string? This may require may different UDFs to be
 defined and called. Also this may be non-standard way to implement
 TIMESTAMP.

 Please suggest.

 Thanks,
 SHyam



 --- On Sat, 3/7/09, Zheng Shao zsh...@gmail.com wrote:

  From: Zheng Shao zsh...@gmail.com
  Subject: Re: TIMESTAMP type
  To: hive-dev@hadoop.apache.org, shyam_sar...@yahoo.com
  Date: Saturday, March 7, 2009, 4:29 PM
  The problem is that different people may have different
  formats to store
  TIMESTAMP.
 
  The easiest way to go is to declare the field as STRING,
  and then use UDF to
  manipulate the content.
 
  Zheng
 
  On Sat, Mar 7, 2009 at 6:08 AM, Shyam Sarkar
  shyam_sar...@yahoo.com wrote:
 
  
   Hello,
  
   I inspected the grammar Hive.g and decided to create a
  new type for
   TIMESTAMP. TIMESTAMP is not a primitive type or list
  type or map type. It is
   a timestamp type of the form TIMESTAMP(MMDDHHMMSS)
  which is different
   from other types.
  
   Please let me know if there is any other suggestions.
  
   Thanks,
   shyam_sar...@yahoo.com
  
  
  
  
  
  
  
 
 
  --
  Yours,
  Zheng






-- 
Yours,
Zheng


JIRA_HIVE-286.2.patch_UNIT_TEST_SUCCEEDED

2009-03-07 Thread Murli Varadachari

SUCCESS: BUILD AND UNIT TEST using PATCH HIVE-286.2.patch PASSED!!