Zsolt Miskolczi created HIVE-28578:
--------------------------------------
Summary: Concurrency issue in updateTableColumnStatistics
Key: HIVE-28578
URL: https://issues.apache.org/jira/browse/HIVE-28578
Project: Hive
Issue Type: Bug
Security Level: Public (Viewable by anyone)
Components: Standalone Metastore
Reporter: Zsolt Miskolczi
Summary:
updateTableColumnStatistics can throw SQLIntegrityConstraintViolationException
during replication if HA is on and two different HMS instance gets the same
call but with different engine.
Workaround:
Update table column statistics in single threaded.
Details:
updateTableColumnStatistics has a relative long running transaction. In that
transaction, it validates the actual parameters, queries the metastore db
against the TABLE_PARAMS that are already stored and makes a decision based on
that. After this, it uses data nucleus to persist the new statistics.
>From the two HMS instances, one can save the column statistics. And the other
>cannot as the first instance already saved them.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)