[
https://issues.apache.org/jira/browse/PIG-1832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13587189#comment-13587189
]
Guido Serra aka Zeph commented on PIG-1832:
-------------------------------------------
even... they just updated ( PIG-2341 ) the documentation:
- http://pig.apache.org/docs/r0.11.0/func.html#HBaseStorage
I'd say, that just having the double usage of "-timestamp=", at LOAD and on
STORE, is all we need
right now (as of version 0.11), this option is being taken into consideration
only at LOAD time
p.s. there is a scenario though, which I'm covering with a python/jython custom
script, that puzzles me... what if only a cell (row/column intersection)
changes? HBase by design stores a new entry at a given timestamp for all the
family:columns provided, even if they are identical ... shall we compute the
difference within the HBaseStorage, or shall the user handle it?
> Support timestamp in HBaseStorage
> ---------------------------------
>
> Key: PIG-1832
> URL: https://issues.apache.org/jira/browse/PIG-1832
> Project: Pig
> Issue Type: Improvement
> Environment: Java 6, Mac OS X 10.6
> Reporter: Eric Yang
>
> When storing data into HBase using
> org.apache.pig.backend.hadoop.hbase.HBaseStorage, HBase timestamp field is
> stored with insertion time of the mapreduce job. It would be nice to have a
> way to populate timestamp from user data.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira