Hi, Here's (part of) the output after killing of sheep on node 1 and restarting the sheep daemon:
root@kvm01:~# sheep -f */data/sheep/* sheep: jrnl_recover(2221) Openning the directory /data/sheep//journal/00000003/. sheep: set_addr(1595) addr = 172.16.1.1, port = 7000 sheep: main(144) Sheepdog daemon (version 0.2.3) started sheep: send_join_request(1048) 16847020 5144 sheep: update_cluster_info(599) system status = 0, epoch = 4 sheep: get_vdi_bitmap_from_all(518) get the vdi bitmap 1 172.16.1.2 sheep: recover_one(1340) 0 1151, 721ef10000037a sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef10000037a, No such file or directory sheep: recover_one(1340) 1 1151, 721ef10000020a sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef10000020a, No such file or directory sheep: recover_one(1340) 2 1151, 721ef100000207 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef100000207, No such file or directory sheep: recover_one(1340) 3 1151, 721ef10000029c sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef10000029c, No such file or directory sheep: recover_one(1340) 4 1151, 721ef100000292 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef100000292, No such file or directory sheep: recover_one(1340) 5 1151, 721ef1000003c2 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef1000003c2, No such file or directory sheep: recover_one(1340) 6 1151, 721ef1000000a6 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef1000000a6, No such file or directory sheep: recover_one(1340) 7 1151, 721ef100000128 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef100000128, No such file or directory sheep: __recover_one(1181) cannot find target node, 721ef100000128 sheep: recover_one(1340) 8 1151, 721ef100000339 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef100000339, No such file or directory sheep: __recover_one(1181) cannot find target node, 721ef100000339 sheep: recover_one(1340) 9 1151, 721ef1000001b0 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef1000001b0, No such file or directory sheep: __recover_one(1181) cannot find target node, 721ef1000001b0 sheep: recover_one(1340) 10 1151, 721ef100000269 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef100000269, No such file or directory sheep: __recover_one(1181) cannot find target node, 721ef100000269 sheep: recover_one(1340) 11 1151, 721ef100000214 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef100000214, No such file or directory sheep: __recover_one(1181) cannot find target node, 721ef100000214 sheep: recover_one(1340) 12 1151, 721ef1000002f1 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef1000002f1, No such file or directory sheep: __recover_one(1181) cannot find target node, 721ef1000002f1 sheep: recover_one(1340) 13 1151, 721ef1000001d3 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef1000001d3, No such file or directory sheep: __recover_one(1181) cannot find target node, 721ef1000001d3 sheep: recover_one(1340) 14 1151, 721ef100000175 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef100000175, No such file or directory sheep: __recover_one(1181) cannot find target node, 721ef100000175 sheep: recover_one(1340) 15 1151, 721ef100000433 sheep: ob_open(491) failed to open /data/sheep//obj/00000005/00721ef100000433, No such file or directory sheep: __recover_one(1181) cannot find target node, 721ef100000433 it keeps printing these messages forever I think. On node 2, collie vdi list currently hangs. When I restart sheep on node 2 it works again, same goes for node 1. This looks like a different situation than yesterday, I'll see if I can reproduce that.
-- sheepdog mailing list [email protected] http://lists.wpkg.org/mailman/listinfo/sheepdog
