At Tue, 14 Jun 2011 22:06:59 +0200, krimson wrote: > > I need to give a little more info on the cluster join fail problem: > > node 1: > # sheep -f /data/sheep > sheep: jrnl_recover(2221) Openning the directory > /data/sheep/journal/00000003/. > sheep: set_addr(1595) addr = 172.16.1.1, port = 7000 > sheep: main(144) Sheepdog daemon (version 0.2.3) started > sheep: get_cluster_status(408) sheepdog is waiting with newer epoch, 1 3 > 172.16.1.2:7000 (when I start sheep on node 2) > > node 2: > # sheep -f /data/sheep > sheep: jrnl_recover(2221) Openning the directory > /data/sheep/journal/00000001/. > sheep: jrnl_recover(2226) start jrnl_recovery. > sheep: jrnl_recover(2267) end jrnl_recovery. > sheep: set_addr(1595) addr = 172.16.1.2, port = 7000 > sheep: main(144) Sheepdog daemon (version 0.2.3) started > sheep: send_join_request(1048) 33624236 17428 > sheep: update_cluster_info(568) failed to join sheepdog, 66
Could you apply the patch I just sent minutes ago and give me the output of the following commands? It would really help my debugging. $ collie cluster info -a 172.16.1.1 $ collie cluster info -a 172.16.1.2 The patch is also in my development repository: git://github.com/kazum/sheepdog.git Thanks, Kazutaka -- sheepdog mailing list [email protected] http://lists.wpkg.org/mailman/listinfo/sheepdog
