Been testing with the 1.7.4 snapshot, and I still can't get an orientdb 
cluster to properly align itself after a node is removed and added back. 
 I've tried on windows, on ubuntu, and ubuntu as root just to be sure. 
 I've tried adjust parameters in hazelcast.xml and 
default-distributed-db-config.json, still nothing.  If i completely erase 
the db and let it recover, it generally will work.  However, it will not 
properly sync if a record has been altered while the other node was down.

Here is the log file of what happens (this was 1.7.4 snapshot, on ubuntu as 
root):

2014-06-13 12:02:07:505 INFO Loading configuration from: 
/home/test/orientdb/orientdb2/config/orientdb-dserver-config.xml... 
[OServerConfigurationLoaderXml]
2014-06-13 12:02:07:913 INFO OrientDB Server v1.7-SNAPSHOT (build 
UNKNOWN@r; 2014-06-12 18:25:56+0200) is starting up... [OServer]
2014-06-13 12:02:07:926 INFO Databases directory: 
/home/test/orientdb/orientdb2/databases [OServer]
2014-06-13 12:02:08:001 INFO Port 0.0.0.0:2424 busy, trying the next 
available... [OServerNetworkListener]
2014-06-13 12:02:08:002 INFO Listening binary connections on 0.0.0.0:2425 
(protocol v.21, socket=default) [OServerNetworkListener]
2014-06-13 12:02:08:002 INFO Port 0.0.0.0:2480 busy, trying the next 
available... [OServerNetworkListener]
2014-06-13 12:02:08:003 INFO Listening http connections on 0.0.0.0:2481 
(protocol v.10, socket=default) [OServerNetworkListener]
2014-06-13 12:02:08:015 INFO Installing dynamic plugin 'studio-1.7.zip'... 
[OServerPluginManager]
2014-06-13 12:02:08:146 INFO Installing GREMLIN language v.2.5.0 - 
graph.pool.max=50 [OGraphServerHandler]
2014-06-13 12:02:08:195 INFO Starting distributed server 
'node1402673455127'... [OHazelcastPlugin]
2014-06-13 12:02:08:245 INFO Configuring Hazelcast from 
'/home/test/orientdb/orientdb2/config/hazelcast.xml'. [FileSystemXmlConfig]
2014-06-13 12:02:08:591 INFO null [orientdb] [3.2.1] Prefer IPv4 stack is 
true. [DefaultAddressPicker]
2014-06-13 12:02:08:623 INFO null [orientdb] [3.2.1] Picked 
Address[192.168.1.10]:2435, using socket 
ServerSocket[addr=/0:0:0:0:0:0:0:0,localport=2435], bind any local is true 
[DefaultAddressPicker]
2014-06-13 12:02:08:775 INFO [192.168.1.10]:2435 [orientdb] [3.2.1] 
Hazelcast Community Edition 3.2.1 (20140428) starting at 
Address[192.168.1.10]:2435 [system]
2014-06-13 12:02:08:775 INFO [192.168.1.10]:2435 [orientdb] [3.2.1] 
Copyright (C) 2008-2014 Hazelcast.com [system]
2014-06-13 12:02:08:784 INFO [192.168.1.10]:2435 [orientdb] [3.2.1] 
Creating MulticastJoiner [Node]
2014-06-13 12:02:08:810 INFO [192.168.1.10]:2435 [orientdb] [3.2.1] 
Address[192.168.1.10]:2435 is STARTING [LifecycleService]
2014-06-13 12:02:09:010 INFO [192.168.1.10]:2435 [orientdb] [3.2.1] 
Connecting to /192.168.1.10:2434, timeout: 0, bind-any: true 
[SocketConnector]
2014-06-13 12:02:09:028 INFO [192.168.1.10]:2435 [orientdb] [3.2.1] 49736 
accepted socket connection from /192.168.1.10:2434 [TcpIpConnectionManager]
2014-06-13 12:02:14:550 INFO [192.168.1.10]:2435 [orientdb] [3.2.1] 

Members [2] {
Member [192.168.1.10]:2434
Member [192.168.1.10]:2435 this
}
 [ClusterService]
2014-06-13 12:02:16:100 INFO [192.168.1.10]:2435 [orientdb] [3.2.1] 
Address[192.168.1.10]:2435 is STARTED [LifecycleService]
2014-06-13 12:02:16:117 INFO [node1402673455127] found no previous messages 
in queue orientdb.node.node1402673455127.response 
[OHazelcastDistributedMessageService]
2014-06-13 12:02:16:296 WARN [node1402673455127] opening database 
'testdb'... [OHazelcastPlugin]
2014-06-13 12:02:16:302 INFO [node1402673455127] loaded database 
configuration from active cluster [OHazelcastPlugin]
2014-06-13 12:02:16:354 INFO updated distributed configuration for 
database: testdb:
----------
{
  "version":2,
  "autoDeploy":true,
  "hotAlignment":true,
  "readQuorum":1,
  "writeQuorum":2,
  "failureAvailableNodesLessQuorum":false,
  "readYourWrites":true,"clusters":{
    "internal":null,
    "index":null,
    "*":{
  "servers":["<NEW_NODE>","node1402673438702","node1402673455127"]
}
    }
}
---------- [OHazelcastPlugin]
2014-06-13 12:02:16:375 WARN [node1402673455127] found 1 previous messages 
in queue orientdb.node.node1402673455127.testdb.request, aligning the 
database... [OHazelcastDistributedMessageService]
2014-06-13 12:02:18:854 WARN Storage testdb was not closed properly. Will 
try to restore from write ahead log. [OLocalPaginatedStorage]
2014-06-13 12:02:18:854 SEVE Restore is not possible because write ahead 
log is empty. [OLocalPaginatedStorage]
2014-06-13 12:02:18:927 INFO Storage data restore was completed 
[OLocalPaginatedStorage]
2014-06-13 12:02:22:321 WARN segment file 'database.ocf' was not closed 
correctly last time [OSingleFileSegment]
2014-06-13 12:02:22:334 WARN Can not restore 1 WAL master record for 
storage testdb [OWriteAheadLog][node1402673455127]<-[node1402673438702] 
error on reading distributed request: record_update(#9:4 v.6)
Error on creation of shared resource
-> 
com.orientechnologies.common.concur.resource.OSharedContainerImpl.getResource(OSharedContainerImpl.java:55)
-> 
com.orientechnologies.orient.server.distributed.ODistributedStorage.getResource(ODistributedStorage.java:516)
-> 
com.orientechnologies.orient.core.metadata.OMetadataDefault.init(OMetadataDefault.java:110)
-> 
com.orientechnologies.orient.core.metadata.OMetadataDefault.load(OMetadataDefault.java:68)
-> 
com.orientechnologies.orient.core.db.record.ODatabaseRecordAbstract.open(ODatabaseRecordAbstract.java:291)
-> 
com.orientechnologies.orient.core.db.ODatabaseWrapperAbstract.open(ODatabaseWrapperAbstract.java:49)
-> 
com.orientechnologies.orient.server.OServer.openDatabase(OServer.java:557)
-> 
com.orientechnologies.orient.server.hazelcast.OHazelcastDistributedDatabase.initDatabaseInstance(OHazelcastDistributedDatabase.java:281)
-> 
com.orientechnologies.orient.server.hazelcast.OHazelcastDistributedDatabase.onMessage(OHazelcastDistributedDatabase.java:471)
-> 
com.orientechnologies.orient.server.hazelcast.OHazelcastDistributedDatabase$1.run(OHazelcastDistributedDatabase.java:243)
-> java.lang.Thread.run(Thread.java:745)
The record with id '#0:1' not found
-> 
com.orientechnologies.common.concur.resource.OSharedContainerImpl.getResource(OSharedContainerImpl.java:55)
-> 
com.orientechnologies.orient.server.distributed.ODistributedStorage.getResource(ODistributedStorage.java:516)
-> 
com.orientechnologies.orient.core.metadata.OMetadataDefault.init(OMetadataDefault.java:110)
-> 
com.orientechnologies.orient.core.metadata.OMetadataDefault.load(OMetadataDefault.java:68)
-> 
com.orientechnologies.orient.core.db.record.ODatabaseRecordAbstract.open(ODatabaseRecordAbstract.java:291)
-> 
com.orientechnologies.orient.core.db.ODatabaseWrapperAbstract.open(ODatabaseWrapperAbstract.java:49)
-> 
com.orientechnologies.orient.server.OServer.openDatabase(OServer.java:557)
-> 
com.orientechnologies.orient.server.hazelcast.OHazelcastDistributedDatabase.initDatabaseInstance(OHazelcastDistributedDatabase.java:281)
-> 
com.orientechnologies.orient.server.hazelcast.OHazelcastDistributedDatabase.onMessage(OHazelcastDistributedDatabase.java:471)
-> 
com.orientechnologies.orient.server.hazelcast.OHazelcastDistributedDatabase$1.run(OHazelcastDistributedDatabase.java:243)
-> java.lang.Thread.run(Thread.java:745)
Storage testdb is not opened.
-> 
com.orientechnologies.common.concur.resource.OSharedContainerImpl.getResource(OSharedContainerImpl.java:55)
-> 
com.orientechnologies.orient.server.distributed.ODistributedStorage.getResource(ODistributedStorage.java:516)
-> 
com.orientechnologies.orient.core.metadata.OMetadataDefault.init(OMetadataDefault.java:110)
-> 
com.orientechnologies.orient.core.metadata.OMetadataDefault.load(OMetadataDefault.java:68)
-> 
com.orientechnologies.orient.core.db.record.ODatabaseRecordAbstract.open(ODatabaseRecordAbstract.java:291)
-> 
com.orientechnologies.orient.core.db.ODatabaseWrapperAbstract.open(ODatabaseWrapperAbstract.java:49)
-> 
com.orientechnologies.orient.server.OServer.openDatabase(OServer.java:557)
-> 
com.orientechnologies.orient.server.hazelcast.OHazelcastDistributedDatabase.initDatabaseInstance(OHazelcastDistributedDatabase.java:281)
-> 
com.orientechnologies.orient.server.hazelcast.OHazelcastDistributedDatabase.onMessage(OHazelcastDistributedDatabase.java:471)
-> 
com.orientechnologies.orient.server.hazelcast.OHazelcastDistributedDatabase$1.run(OHazelcastDistributedDatabase.java:243)
-> java.lang.Thread.run(Thread.java:745)
2014-06-13 12:02:22:850 INFO [node1402673455127] executed all pending tasks 
in queue, set restoringMessages=false and database 'testdb' as online... 
[OHazelcastDistributedDatabase$1]
2014-06-13 12:02:43:795 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 1/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:44:096 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 2/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:44:397 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 3/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:44:699 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 4/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:45:001 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 5/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:45:307 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 6/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:45:608 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 7/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:45:909 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 8/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:46:210 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 9/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:46:511 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 10/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:46:811 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 11/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:47:112 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 12/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:47:412 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 13/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:47:713 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 14/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:48:016 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 15/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:48:318 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 16/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:48:619 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 17/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:48:920 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 18/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:49:221 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 19/20 [ONetworkProtocolHttpDb]
2014-06-13 12:02:49:522 INFO Node is not online yet (status=STARTING), 
blocking the command until it's online 20/20 [ONetworkProtocolHttpDb]

-- 

--- 
You received this message because you are subscribed to the Google Groups 
"OrientDB" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to