Zhihua Deng created HIVE-29165:
----------------------------------

             Summary: PartColNameInfo could introduce high hash collision due 
to the wide table
                 Key: HIVE-29165
                 URL: https://issues.apache.org/jira/browse/HIVE-29165
             Project: Hive
          Issue Type: Improvement
          Components: Standalone Metastore
            Reporter: Zhihua Deng
            Assignee: Zhihua Deng


In {{{}DirectSqlUpdatePart{}}}, the {{PartColNameInfo}} acts as a map key, 
referring to those statistics to be updated or inserted. If the current table 
has lots of columns, say 1000, then for each {{{}PartColNameInfo{}}}, it’s 
assumed to hash into the same map buckets more than 1000 times. If the table 
has thousands of partitions, locating or inserting the {{PartColNameInfo}} 
could be very slow.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to