Shubham Sharma created HAWQ-1549:
------------------------------------

             Summary:  Re-syncing standby fails even when stop mode is fast
                 Key: HAWQ-1549
                 URL: https://issues.apache.org/jira/browse/HAWQ-1549
             Project: Apache HAWQ
          Issue Type: Bug
          Components: Command Line Tools, Standby master
            Reporter: Shubham Sharma
            Assignee: Radar Lei


Recently observed a behaviour while re-syncing standby from hawq command line.

Here are the reproduction steps -

1 - Open a client connection to hawq using psql
2 - From a different terminal run command - hawq init standby -n -v -M fast
3 - Standby resync fails with error

{code}
20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[WARNING]:-There are other 
connections to this instance, shutdown mode smart aborted

20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[WARNING]:-Either remove 
connections, or use 'hawq stop master -M fast' or 'hawq stop master -M 
immediate'

20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[WARNING]:-See hawq stop --help 
for all options

20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[ERROR]:-Active connections. 
Aborting shutdown...

20171113:03:49:21:158143 hawq_init:hdp3:gpadmin-[ERROR]:-Stop hawq cluster 
failed, exit
{code}

4 - When -M (stop mode) is passed it should terminate existing client 
connections. 

The source of this issue appears to be tools/bin/hawq_ctl method 
_resync_standby. When this is called the command formation does not include 
stop_mode options as passed to the arguments.

{code}
 def _resync_standby(self):
        logger.info("Re-sync standby")
        cmd = "%s; hawq stop master -a;" % source_hawq_env
        check_return_code(local_ssh(cmd, logger), logger, "Stop hawq cluster 
failed, exit")
        ......
        ......
{code}

I can start this and submit a PR when changes are done.





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to