[ 
https://issues.apache.org/jira/browse/FLINK-10245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16615552#comment-16615552
 ] 

Hequn Cheng commented on FLINK-10245:
-------------------------------------

[~dangdangdang] Yes, you are right. We can't simply deduplicate rows according 
to the rowkey if nullable field is allowed. In this case, we should merge rows 
in the buffer. When merge two rows, return the new field if it is not null or 
return the old field. 

However, there are some problems for UpsertTableSink which contains key 
definition. In this case, we should stick to the latest value even if it is 
null. We may need to delete before writing to make sure this semantic, since 
hbase doesn't support writing null values. 
We can have more discussions about TableSink in the TabkeSink 
task(FLINK-10277). 

What do you think?

> Add DataStream HBase Sink
> -------------------------
>
>                 Key: FLINK-10245
>                 URL: https://issues.apache.org/jira/browse/FLINK-10245
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Streaming Connectors
>            Reporter: Shimin Yang
>            Assignee: Shimin Yang
>            Priority: Major
>              Labels: pull-request-available
>
> Design documentation: 
> [https://docs.google.com/document/d/1of0cYd73CtKGPt-UL3WVFTTBsVEre-TNRzoAt5u2PdQ/edit?usp=sharing]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to