Zsolt Miskolczi created HIVE-28578:
--------------------------------------

             Summary: Concurrency issue in updateTableColumnStatistics
                 Key: HIVE-28578
                 URL: https://issues.apache.org/jira/browse/HIVE-28578
             Project: Hive
          Issue Type: Bug
      Security Level: Public (Viewable by anyone)
          Components: Standalone Metastore
            Reporter: Zsolt Miskolczi


Summary: 
updateTableColumnStatistics can throw SQLIntegrityConstraintViolationException 
during replication if HA is on and two different HMS instance gets the same 
call but with different engine. 
 
Workaround: 
Update table column statistics in single threaded. 
 
Details: 
updateTableColumnStatistics has a relative long running transaction. In that 
transaction, it validates the actual parameters, queries the metastore db 
against the TABLE_PARAMS that are already stored and makes a decision based on 
that. After this, it uses data nucleus to persist the new statistics. 
>From the two HMS instances, one can save the column statistics. And the other 
>cannot as the first instance already saved them. 
 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to