Adar Dembo created KUDU-2718:
--------------------------------

             Summary: master_failover-itest when HMS is enabled is flaky
                 Key: KUDU-2718
                 URL: https://issues.apache.org/jira/browse/KUDU-2718
             Project: Kudu
          Issue Type: Bug
          Components: test
    Affects Versions: 1.9.0
            Reporter: Adar Dembo
         Attachments: master_failover-itest.1.txt

This was a failure in 
HmsConfigurations/MasterFailoverTest.TestDeleteTableSync/1, where GetParam() = 
2, but it's likely possible in every multi-master test with HMS integration 
enabled.

It looks like there was a leader master election at the time that the client 
tried to create the table being tested. The master managed to create the table 
in HMS, but then there was a failure replicating in Raft because another master 
was elected leader. So the client retried the request on a different master, 
but the HMS piece of CreateTable failed because the HMS already knew about the 
table.

Thing is, there's code to roll back the HMS table creation if this happens, so 
I don't see why the retried CreateTable failed at the HMS with "table already 
exists". Perhaps this is a case where even though we succeeded in dropping the 
table from HMS, it doesn't reflect that immediately?

I'm attaching the full log.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to