Re: [Slony1-general] Testing failover

Filip Rembiałkowski Tue, 29 Sep 2009 09:29:12 -0700

2009/9/29 Lawrence Giam <[email protected]>

>  Hi All,
>
>
>
> I am testing failover with the following setup:
>
> N1 - Master
>
> N2 – Slave
>
>
>
> To test failover, I issue this command: slonik_failover 1 2 | slonik
>
> -------------------------------------------
>
> INFO: calling failedNode(1,2) on node 1
>
> <stdin>:4: NOTICE:  failedNode: set 1 has no other direct receivers - move
> now
>
> INFO: Waiting for slon engines to restart
>
> INFO: Node with highest sync for set 1 is 2
>
> <stdin>:10: Replication sets originating on 1 failed over to 2
>
> -------------------------------------------
>
>
>
> Now I try to remove the N1 from the setup with this command:
> slonik_drop_node 1 | slonik
>
> <stdin>:4: Error: Node ID and event node cannot be identical
>
>
>
> I also try this : slonik_drop_node 2 | slonik
>
> <stdin>:10: dropped node 2 cluster
>
>
>


yikes ;-)


> Problem is when I check the slon log on node2, it is reporting:
>
> ----------------------
>
> 2009-09-29 13:49:19 SGT DEBUG1 slon: restart of worker
>
> 2009-09-29 13:49:19 SGT CONFIG main: slon version 1.2.16 starting up
>
> 2009-09-29 13:49:19 SGT DEBUG2 slon: watchdog process started
>
> 2009-09-29 13:49:19 SGT DEBUG2 slon: watchdog ready - pid = 28749
>
> 2009-09-29 13:49:19 SGT DEBUG2 slon: worker process created - pid = 29314
>
> 2009-09-29 13:49:19 SGT ERROR  cannot get sl_local_node_id - ERROR:
> schema "_testrepl" does not exist
>
> 2009-09-29 13:49:19 SGT FATAL  main: Node is not initialized properly -
> sleep 10s
>
> 2009-09-29 13:49:29 SGT DEBUG2 slon_retry() from pid=29314
>
> 2009-09-29 13:49:29 SGT DEBUG1 slon: retry requested
>
> 2009-09-29 13:49:29 SGT DEBUG2 slon: notify worker process to shutdown
>
> 2009-09-29 13:49:29 SGT DEBUG2 slon: child terminated status: 0; pid:
> 29314, current worker pid: 29314
>
> ----------------------
>
>
>
> Also checking in PgAdmin, it reports the cluster database on node2 was
> removed. Did I issue the wrong command? How do I remove node1 from the
> cluster after handing the origin to node2 with the failover command?
>
>
>

simple - you have dropped N2 (see "yikes" above) , so you have effectively
trashed the cluster.


-- 
Filip Rembiałkowski
JID,mailto:[email protected]
http://filip.rembialkowski.net/

_______________________________________________
Slony1-general mailing list
[email protected]
http://lists.slony.info/mailman/listinfo/slony1-general

Re: [Slony1-general] Testing failover

Reply via email to