Sankar Hariappan created HIVE-17367:
---------------------------------------

             Summary: IMPORT should overwrite the table if the dump has same 
state as table.
                 Key: HIVE-17367
                 URL: https://issues.apache.org/jira/browse/HIVE-17367
             Project: Hive
          Issue Type: Bug
          Components: HiveServer2, Import/Export, repl
    Affects Versions: 3.0.0
            Reporter: Sankar Hariappan
            Assignee: Sankar Hariappan
             Fix For: 3.0.0


Repl v1 creates a set of EXPORT/IMPORT commands to replicate modified data (as 
per events) across clusters.
For instance, let's say, insert generates 2 events such as
ALTER_TABLE (ID: 10)
INSERT (ID: 11)
Each event generates a set of EXPORT and IMPORT commands.
ALTER_TABLE event generates metadata only export/import
INSERT generates metadata+data export/import.
As Hive always dump the latest copy of table during export, it sets the latest 
notification event ID as current state of it. So, in this example, import of 
metadata by ALTER_TABLE event sets the current state of the table as 11.
Now, when we try to import the data dumped by INSERT event, it is noop as the 
table's current state(11) is equal to the dump state (11) which in-turn leads 
to the data never gets replicated to target cluster.
So, it is necessary to allow overwrite of table/partition if their current 
state equals the dump state.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to