Alfredo Deza writes:

> Have you ensured that either there is no firewall up or that the ports
> that the monitors need to communicate between each other are open?


Yes, I am sure - the nodes are connected over one single switch, and 
no firewall is active.

 
> If that is not the problem then the next thing I would do is to
> increase the verbosity for the monitors, restart them
> and look at the logs.

My current configuration is: 
(ceph.conf)

i.e. I tried whether or not I used the wrong network intercaces. 

mon_host = 10.112.3.1,10.112.3.2,10.112.3.3
mon_initial_members = hvrrzceph1, hvrrzceph2, hvrrzceph3

and the networks are: 

hvrrzceph1:~/my-cluster # grep hvrrzceph1 /etc/hosts
10.1.1.239      hvrrzceph1-admin
10.111.3.1      hvrrzceph1-storage
10.112.3.1      hvrrzceph1
 (two more nodes in the same way)

Someone ist listening on port 6789:
hvrrzceph1:~/my-cluster # grep 6789 /etc/services
smc-https          6789/tcp     # SMC-HTTPS  [Ratnadeep_Bhattachar]
smc-https          6789/udp     # SMC-HTTPS  [Ratnadeep_Bhattachar]
hvrrzceph1:~/my-cluster # netstat -a | grep smc-https
tcp        0      0 hvrrzceph1-st:smc-https *:*           LISTEN
hvrrzceph1:~/my-cluster #

The monitor log says:

2014-04-04 15:11:00.673595 7fbb264f2780  0 ceph version 0.72.2 
(a913ded2ff138aef
b8cb84d347d72164099cfd60), process ceph-mon, pid 9354
2014-04-04 15:11:01.017924 7f57b6e0c780  0 ceph version 0.72.2 
(a913ded2ff138aef
b8cb84d347d72164099cfd60), process ceph-mon, pid 9455
2014-04-04 15:11:01.027519 7f57b6e0c780  0 mon.hvrrzceph1 does not 
exist in monm
ap, will attempt to join an existing cluster
2014-04-04 15:11:01.027928 7f57b6e0c780  0 using public_addr 
10.111.3.1:0/0 -> 1
0.111.3.1:6789/0
2014-04-04 15:11:01.030407 7f57b6e0c780  1 mon.hvrrzceph1@-1(probing) 
e0 preinit
 fsid 8dba6b51-9380-4d32-9393-520dc141a8b6
2014-04-04 15:11:01.030645 7f57b6e0c780  1 mon.hvrrzceph1@-1(probing) 
e0  initia
l_members hvrrzceph1,hvrrzceph2,hvrrzceph3, filtering seed monmap
2014-04-04 15:11:01.031918 7f57b6e0c780  0 mon.hvrrzceph1@-1(probing) 
e0  my ran
k is now 0 (was -1)
2014-04-04 15:11:01.032909 7f57b04cf700  0 -- 10.111.3.1:6789/0 >> 
0.0.0.0:0/2 p
ipe(0x15fda00 sd=21 :0 s=1 pgs=0 cs=0 l=0 c=0x15bec60).fault
2014-04-04 15:11:01.033772 7f57b03ce700  0 -- 10.111.3.1:6789/0 >> 
10.112.3.1:67
89/0 pipe(0x1607780 sd=24 :0 s=1 pgs=0 cs=0 l=0 c=0x15beb00).fault
2014-04-04 15:11:01.034079 7f57b6e0a700  0 -- 10.111.3.1:6789/0 >> 
0.0.0.0:0/1 p
ipe(0x15fd280 sd=23 :0 s=1 pgs=0 cs=0 l=0 c=0x15be420).fault
2014-04-04 15:11:01.034627 7f57b01cc700  0 -- 10.111.3.1:6789/0 >> 
10.112.3.3:67
89/0 pipe(0x1607c80 sd=25 :0 s=1 pgs=0 cs=0 l=0 c=0x15be840).fault

[etc. for some time; and then]

2014-04-04 15:21:01.033997 7f57b1cd2700  0 
mon.hvrrzceph1@0(probing).data_health
(0) update_stats avail 70% total 15365520 used 3804740 avail 10780236
2014-04-04 15:22:01.034316 7f57b1cd2700  0 
mon.hvrrzceph1@0(probing).data_health
(0) update_stats avail 70% total 15365520 used 3804740 avail 10780236
2014-04-04 15:23:01.034627 7f57b1cd2700  0 
mon.hvrrzceph1@0(probing).data_health
(0) update_stats avail 70% total 15365520 used 3804740 avail 10780236
2014-04-04 15:24:01.034917 7f57b1cd2700  0 
mon.hvrrzceph1@0(probing).data_health
(0) update_stats avail 70% total 15365520 used 3804740 avail 10780236

Diedrich

-- 
Diedrich Ehlerding, Fujitsu Technology Solutions GmbH,
FTS CE SC PS&IS W, Hildesheimer Str 25, D-30880 Laatzen
Fon +49 511 8489-1806, Fax -251806, Mobil +49 173 2464758
Firmenangaben: http://de.ts.fujitsu.com/imprint.html

_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to