Re: Slave cannot be registered while masters keep switching to another one.

2015-01-28 Thread xiaokun
hi, I changed the quorum to 1. Slave can be displayed now!

Thanks!

2015-01-28 16:19 GMT+08:00 xiaokun xiaokun...@gmail.com:

 Thanks for your reply. I will try to modify quorum to 1.
 Here is log from server side. Attachment is added.
 I0128 03:15:36.608562 15350 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:37.552141 15346 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:38.479542 15345 network.hpp:424] ZooKeeper group memberships
 changed
 I0128 03:15:38.479799 15345 group.cpp:659] Trying to get
 '/mesos/log_replicas/002270' in ZooKeeper
 I0128 03:15:38.480613 15345 group.cpp:659] Trying to get
 '/mesos/log_replicas/002271' in ZooKeeper
 I0128 03:15:38.481050 15345 group.cpp:659] Trying to get
 '/mesos/log_replicas/002272' in ZooKeeper
 I0128 03:15:38.481679 15345 network.hpp:466] ZooKeeper group PIDs: {
 log-replica(1)@10.27.17.135:5050, log-replica(1)@10.27.16.214:5050 }
 I0128 03:15:38.621351 15345 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:39.544558 15345 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:40.072347 15343 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:41.025926 15345 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:41.695303 15349 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:42.493906 15345 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:43.086762 15343 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:43.831442 15346 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:44.787384 15343 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:45.527914 15345 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:46.005728 15349 detector.cpp:138] Detected a new leader:
 (id='2272')
 I0128 03:15:46.005892 15349 group.cpp:659] Trying to get
 '/mesos/info_002272' in ZooKeeper
 I0128 03:15:46.006530 15349 detector.cpp:433] A new leading master (UPID=
 master@10.27.16.214:5050) is detected
 I0128 03:15:46.006624 15349 master.cpp:1263] The newly elected leader is
 master@10.27.16.214:5050 with id 20150128-031430-3591379722-5050-15326
 I0128 03:15:46.006664 15349 master.cpp:1276] Elected as the leading master!



Re: Slave cannot be registered while masters keep switching to another one.

2015-01-28 Thread xiaokun
Thanks for your reply. I will try to modify quorum to 1.
Here is log from server side. Attachment is added.
I0128 03:15:36.608562 15350 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:37.552141 15346 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:38.479542 15345 network.hpp:424] ZooKeeper group memberships
changed
I0128 03:15:38.479799 15345 group.cpp:659] Trying to get
'/mesos/log_replicas/002270' in ZooKeeper
I0128 03:15:38.480613 15345 group.cpp:659] Trying to get
'/mesos/log_replicas/002271' in ZooKeeper
I0128 03:15:38.481050 15345 group.cpp:659] Trying to get
'/mesos/log_replicas/002272' in ZooKeeper
I0128 03:15:38.481679 15345 network.hpp:466] ZooKeeper group PIDs: {
log-replica(1)@10.27.17.135:5050, log-replica(1)@10.27.16.214:5050 }
I0128 03:15:38.621351 15345 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:39.544558 15345 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:40.072347 15343 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:41.025926 15345 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:41.695303 15349 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:42.493906 15345 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:43.086762 15343 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:43.831442 15346 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:44.787384 15343 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:45.527914 15345 replica.cpp:638] Replica in VOTING status
received a broadcasted recover request
I0128 03:15:46.005728 15349 detector.cpp:138] Detected a new leader:
(id='2272')
I0128 03:15:46.005892 15349 group.cpp:659] Trying to get
'/mesos/info_002272' in ZooKeeper
I0128 03:15:46.006530 15349 detector.cpp:433] A new leading master (UPID=
master@10.27.16.214:5050) is detected
I0128 03:15:46.006624 15349 master.cpp:1263] The newly elected leader is
master@10.27.16.214:5050 with id 20150128-031430-3591379722-5050-15326
I0128 03:15:46.006664 15349 master.cpp:1276] Elected as the leading master!
Log file created at: 2015/01/28 03:14:30
Running on machine: ubuntu-1404-ci1
Log line format: [IWEF]mmdd hh:mm:ss.uu threadid file:line] msg
I0128 03:14:30.330451 15326 logging.cpp:172] INFO level logging started!
I0128 03:14:30.330847 15326 main.cpp:167] Build: 2015-01-09 02:25:56 by root
I0128 03:14:30.330864 15326 main.cpp:169] Version: 0.21.1
I0128 03:14:30.330873 15326 main.cpp:172] Git tag: 0.21.1
I0128 03:14:30.330881 15326 main.cpp:176] Git SHA: 
2ae1ba91e64f92ec71d327e10e6ba9e8ad5477e8
I0128 03:14:30.344133 15326 leveldb.cpp:176] Opened db in 13.010195ms
I0128 03:14:30.356261 15326 leveldb.cpp:183] Compacted db in 12.042571ms
I0128 03:14:30.356359 15326 leveldb.cpp:198] Created db iterator in 18305ns
I0128 03:14:30.356395 15326 leveldb.cpp:204] Seeked to beginning of db in 
16176ns
I0128 03:14:30.356683 15326 leveldb.cpp:273] Iterated through 3 keys in the db 
in 273935ns
I0128 03:14:30.356844 15326 replica.cpp:741] Replica recovered with log 
positions 3 - 4 with 0 holes and 0 unlearned
I0128 03:14:30.358438 15344 log.cpp:238] Attempting to join replica to 
ZooKeeper group
I0128 03:14:30.359638 15344 recover.cpp:437] Starting replica recovery
I0128 03:14:30.360090 15326 main.cpp:292] Starting Mesos master
I0128 03:14:30.360147 15344 recover.cpp:463] Replica is in VOTING status
I0128 03:14:30.360304 15344 recover.cpp:452] Recover process terminated
I0128 03:14:30.361130 15326 master.cpp:318] Master 
20150128-031430-3591379722-5050-15326 (10.27.16.214) started on 
10.27.16.214:5050
I0128 03:14:30.361229 15326 master.cpp:366] Master allowing unauthenticated 
frameworks to register
I0128 03:14:30.361258 15326 master.cpp:371] Master allowing unauthenticated 
slaves to register
I0128 03:14:30.364174 15346 master.cpp:1202] Successfully attached file 
'/var/log/mesos/mesos-master.INFO'
I0128 03:14:30.364243 15346 contender.cpp:131] Joining the ZK group
I0128 03:14:30.539114 15349 replica.cpp:638] Replica in VOTING status received 
a broadcasted recover request
I0128 03:14:30.568620 15347 group.cpp:313] Group process 
(group(1)@10.27.16.214:5050) connected to ZooKeeper
I0128 03:14:30.568691 15347 group.cpp:790] Syncing group operations: queue size 
(joins, cancels, datas) = (0, 0, 0)
I0128 03:14:30.568738 15347 group.cpp:385] Trying to create path 
'/mesos/log_replicas' in ZooKeeper
I0128 03:14:30.568804 15349 group.cpp:313] Group process 
(group(3)@10.27.16.214:5050) connected to ZooKeeper
I0128 03:14:30.568841 15349 group.cpp:790] Syncing group operations: queue size 
(joins, cancels, datas

Re: Slave cannot be registered while masters keep switching to another one.

2015-01-28 Thread Dick Davies
Be careful, there's now nothing stopping those 2 masters from forming
2 clusters.
Add a third asap.



On 28 January 2015 at 08:25, xiaokun xiaokun...@gmail.com wrote:
 hi, I changed the quorum to 1. Slave can be displayed now!

 Thanks!

 2015-01-28 16:19 GMT+08:00 xiaokun xiaokun...@gmail.com:

 Thanks for your reply. I will try to modify quorum to 1.
 Here is log from server side. Attachment is added.
 I0128 03:15:36.608562 15350 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:37.552141 15346 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:38.479542 15345 network.hpp:424] ZooKeeper group memberships
 changed
 I0128 03:15:38.479799 15345 group.cpp:659] Trying to get
 '/mesos/log_replicas/002270' in ZooKeeper
 I0128 03:15:38.480613 15345 group.cpp:659] Trying to get
 '/mesos/log_replicas/002271' in ZooKeeper
 I0128 03:15:38.481050 15345 group.cpp:659] Trying to get
 '/mesos/log_replicas/002272' in ZooKeeper
 I0128 03:15:38.481679 15345 network.hpp:466] ZooKeeper group PIDs: {
 log-replica(1)@10.27.17.135:5050, log-replica(1)@10.27.16.214:5050 }
 I0128 03:15:38.621351 15345 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:39.544558 15345 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:40.072347 15343 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:41.025926 15345 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:41.695303 15349 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:42.493906 15345 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:43.086762 15343 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:43.831442 15346 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:44.787384 15343 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:45.527914 15345 replica.cpp:638] Replica in VOTING status
 received a broadcasted recover request
 I0128 03:15:46.005728 15349 detector.cpp:138] Detected a new leader:
 (id='2272')
 I0128 03:15:46.005892 15349 group.cpp:659] Trying to get
 '/mesos/info_002272' in ZooKeeper
 I0128 03:15:46.006530 15349 detector.cpp:433] A new leading master
 (UPID=master@10.27.16.214:5050) is detected
 I0128 03:15:46.006624 15349 master.cpp:1263] The newly elected leader is
 master@10.27.16.214:5050 with id 20150128-031430-3591379722-5050-15326
 I0128 03:15:46.006664 15349 master.cpp:1276] Elected as the leading
 master!