[ 
https://issues.apache.org/jira/browse/HIVE-9938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14703879#comment-14703879
 ] 

Eugene Koifman commented on HIVE-9938:
--------------------------------------

The infrastructure for this is in place.  TxnHandler.isRetryable() needs to 
have a clause added to check for this message/condition.

> Add retry logic to DbTxnMgr instead of aborting transactions.
> -------------------------------------------------------------
>
>                 Key: HIVE-9938
>                 URL: https://issues.apache.org/jira/browse/HIVE-9938
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 0.14.0
>            Reporter: bharath v
>
> Sometimes parallel updates using DBTxnMgr results in the following error trace
> {noformat}
> 5325 [main] INFO org.apache.hadoop.hive.ql.log.PerfLogger - <PERFLOG 
> method=acquireReadWriteLocks from=org.apache.hadoop.hive.ql.Driver> 
> 5351 [main] ERROR org.apache.hadoop.hive.ql.Driver - FAILED: Error in 
> acquiring locks: Error communicating with the metastore 
> org.apache.hadoop.hive.ql.lockmgr.LockException: Error communicating with the 
> metastore 
> at 
> org.apache.hadoop.hive.ql.lockmgr.DbLockManager.lock(DbLockManager.java:100) 
> at 
> org.apache.hadoop.hive.ql.lockmgr.DbTxnManager.acquireLocks(DbTxnManager.java:194)
>  
> {noformat}
> Internally looking at the postgres logs we see 
> {noformat}
> 2015-02-02 06:36:05,632 ERROR 
> org.apache.hadoop.hive.metastore.RetryingHMSHandler: 
> org.apache.thrift.TException: MetaException(message:Unable to update 
> transaction database org.postgresql.util.PSQLException: ERROR: could not 
> serialize access due to concurrent update 
> {noformat}
> Ideally we should add a retry logic to retry the failed transaction.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to