Boa tarde a todos 

Esse é meu primeiro Post, aqui na lista.

Seguinte tenho dois servidores GW com Heartbeat instalado. 
O que ocorre é o seguinte, ele cai e sobe em determinados dias,
causando uma queda temporaria na internet. E voltando novamente.

A configuração primario é Debian 4 Etch, e da segunda um Conectiva 10,
a ligação de uma maquina para a outra é feita via Serial.
 Esse é a parte do meu Log

Oct 22 21:10:53 gateway heartbeat[19084]: WARN: Late heartbeat: Node
gateway2.domain.com.br: interval 12530 ms
Oct 22 22:20:37 gateway heartbeat[19084]: WARN: node
gateway2.domain.com.br: is dead
Oct 22 22:20:37 gateway heartbeat[19084]: WARN: No STONITH device
configured.
Oct 22 22:20:37 gateway heartbeat[19084]: WARN: Shared disks are not
protected.
Oct 22 22:20:37 gateway heartbeat[19084]: info: Resources being
acquired from gateway2.domain.com.br.
Oct 22 22:20:37 gateway heartbeat[19084]: info: Link
gateway2.domain.com.br:/dev/ttyS0 dead.
Oct 22 22:20:38 gateway heartbeat: info: Running /etc/ha.d/rc.d/status
status
Oct 22 22:20:38 gateway heartbeat: info: /usr/lib/heartbeat/mach_down:
nice_failback: foreign resources acquired
Oct 22 22:20:42 gateway heartbeat[19084]: WARN: Cluster node
gateway2.domain.com.br returning after partition.
Oct 22 22:20:42 gateway heartbeat[19084]: WARN: Deadtime value may be
too small.
Oct 22 22:20:42 gateway heartbeat[19084]: info: See documentation for
information on tuning deadtime.
Oct 22 22:20:42 gateway heartbeat[19084]: info: Link
gateway2.domain.com.br:/dev/ttyS0 up.
Oct 22 22:20:42 gateway heartbeat[19084]: WARN: Late heartbeat: Node
gateway2.domain.com.br: interval 35790 ms
Oct 22 22:20:42 gateway heartbeat[19084]: info: Status update for node
gateway2.domain.com.br: status active
Oct 22 22:20:42 gateway heartbeat[19084]: info: mach_down takeover
complete.
Oct 22 22:20:42 gateway heartbeat: info: mach_down takeover complete
for node gateway2.domain.com.br.
Oct 22 22:20:42 gateway heartbeat[14883]: info: Local Resource
acquisition completed.
Oct 22 22:20:42 gateway heartbeat: info: Running /etc/ha.d/rc.d/status
status
Oct 22 22:20:44 gateway heartbeat[19084]: info: Heartbeat shutdown in
progress. (19084)
Oct 22 22:20:44 gateway heartbeat[16667]: info: Giving up all HA
resources.
Oct 22 22:20:44 gateway heartbeat: info: Releasing resource group:
gateway.domain.com.br 200.xxx.xxx.xxx/30/eth0 200.xxx.xxx.x6/30/eth1
200.xxx.xxx.x7/29/eth2 firewall shaper
Oct 22 22:20:44 gateway heartbeat: info: Running /etc/init.d/shaper  stop
Oct 22 22:20:46 gateway heartbeat: info: Running /etc/init.d/firewall
 stop
Oct 22 22:20:46 gateway heartbeat: info: Running
/etc/ha.d/resource.d/IPaddr 200.xxx.xxx.x7/29/eth2 stop
Oct 22 22:20:47 gateway heartbeat: info: Running
/etc/ha.d/resource.d/IPaddr 200.xxx.xxx.x6/30/eth1 stop
Oct 22 22:20:47 gateway heartbeat: info: /sbin/route -n del -host
200.xxx.xxx.x6
Oct 22 22:20:47 gateway heartbeat: info: /sbin/ifconfig eth1:0 down
Oct 22 22:20:47 gateway heartbeat: info: IP Address 200.xxx.xxx.x6
released
Oct 22 22:20:47 gateway heartbeat: info: Running
/etc/ha.d/resource.d/IPaddr 200.xxx.xxx.xxx/30/eth0 stop
Oct 22 22:20:47 gateway heartbeat[16667]: info: All HA resources
relinquished.
Oct 22 22:20:47 gateway heartbeat[19084]: WARN: 1 lost packet(s) for
[gateway2.domain.com.br] [239455:239457]
Oct 22 22:20:47 gateway heartbeat[19084]: info: No pkts missing from
gateway2.domain.com.br!
Oct 22 22:20:48 gateway heartbeat[19084]: info: killing HBFIFO process
19086 with signal 15
Oct 22 22:20:48 gateway heartbeat[19084]: info: killing HBWRITE
process 19087 with signal 15
Oct 22 22:20:48 gateway heartbeat[19084]: info: killing HBREAD process
19088 with signal 15
Oct 22 22:20:48 gateway heartbeat[19084]: info: Core process 19088
exited. 3 remaining
Oct 22 22:20:48 gateway heartbeat[19084]: info: Core process 19086
exited. 2 remaining
Oct 22 22:20:48 gateway heartbeat[19084]: info: Core process 19087
exited. 1 remaining
Oct 22 22:20:48 gateway heartbeat[19084]: info: Heartbeat shutdown
complete.
Oct 22 22:20:48 gateway heartbeat[19084]: info: Heartbeat restart
triggered.
Oct 22 22:20:48 gateway heartbeat[19084]: info: Restarting heartbeat.
Oct 22 22:20:48 gateway heartbeat[19084]: info: Performing heartbeat
restart exec.
Oct 22 22:21:19 gateway heartbeat[19084]: info: **************************
Oct 22 22:21:19 gateway heartbeat[19084]: info: Configuration
validated. Starting heartbeat 1.2.5
Oct 22 22:21:19 gateway heartbeat[19947]: info: heartbeat: version 1.2.5
Oct 22 22:21:19 gateway heartbeat[19947]: info: Heartbeat generation: 23
Oct 22 22:21:20 gateway heartbeat[19947]: info: Starting serial
heartbeat on tty /dev/ttyS0 (19200 baud)
Oct 22 22:21:20 gateway heartbeat[19947]: info: pid 19947 locked in
memory.
Oct 22 22:21:20 gateway heartbeat[19947]: info: Local status now set
to: 'up'
Oct 22 22:21:21 gateway heartbeat[19949]: info: pid 19949 locked in
memory.
Oct 22 22:21:21 gateway heartbeat[19950]: info: pid 19950 locked in
memory.
Oct 22 22:21:21 gateway heartbeat[19951]: info: pid 19951 locked in
memory.
Oct 22 22:21:21 gateway heartbeat[19947]: WARN: string2msg_ll: node
[gateway2.domain.com.br] failed authentication
Oct 22 22:21:22 gateway heartbeat[19947]: info: Link
gateway2.domain.com.br:/dev/ttyS0 up.
Oct 22 22:21:22 gateway heartbeat[19947]: info: Status update for node
gateway2.domain.com.br: status active
Oct 22 22:21:22 gateway heartbeat[19947]: info: Local status now set
to: 'active'
Oct 22 22:21:22 gateway heartbeat: info: Running /etc/ha.d/rc.d/status
status
Oct 22 22:21:22 gateway heartbeat[19947]: info: remote resource
transition completed.
Oct 22 22:21:22 gateway heartbeat[19947]: info: remote resource
transition completed.
Oct 22 22:21:22 gateway heartbeat[19947]: info: Local Resource
acquisition completed. (none)
Oct 22 22:21:23 gateway heartbeat[19947]: info: gateway2.domain.com.br
wants to go standby [foreign]
Oct 22 22:21:35 gateway heartbeat[19947]: info: standby: acquire
[foreign] resources from gateway2.domain.com.br
Oct 22 22:21:35 gateway heartbeat[19956]: info: acquire local HA
resources (standby).
Oct 22 22:21:35 gateway heartbeat: info: Acquiring resource group:
gateway.domain.com.br 200.xxx.xxx.xxx/30/eth0 200.xxx.xxx.x6/30/eth1
200.xxx.xxx.x7/29/eth2 firewall shaper
Oct 22 22:21:35 gateway heartbeat: info: Running
/etc/ha.d/resource.d/IPaddr 200.xxx.xxx.xxx/30/eth0 start
Oct 22 22:21:35 gateway heartbeat: info: /sbin/ifconfig eth0:0
200.xxx.xxx.xxx netmask 255.255.255.252      broadcast 200.208.220.131
Oct 22 22:21:35 gateway heartbeat: info: Sending Gratuitous Arp for
200.xxx.xxx.xxx on eth0:0 [eth0]
Oct 22 22:21:35 gateway heartbeat: /usr/lib/heartbeat/send_arp -i 1010
-r 5 -p /var/lib/heartbeat/rsctmp/send_arp/send_arp-200.xxx.xxx.xxx
eth0 200.xxx.xxx.xxx auto 200.xxx.xxx.xxx ffffffffffff
Oct 22 22:21:35 gateway heartbeat: info: Running
/etc/ha.d/resource.d/IPaddr 200.xxx.xxx.x6/30/eth1 start
Oct 22 22:21:35 gateway heartbeat: info: /sbin/ifconfig eth1:0
200.xxx.xxx.x6 netmask 255.255.255.252 broadcast 200.208.223.67
Oct 22 22:21:35 gateway heartbeat: info: Sending Gratuitous Arp for
200.xxx.xxx.x6 on eth1:0 [eth1]
Oct 22 22:21:35 gateway heartbeat: /usr/lib/heartbeat/send_arp -i 1010
-r 5 -p /var/lib/heartbeat/rsctmp/send_arp/send_arp-200.xxx.xxx.x6
eth1 200.xxx.xxx.x6 auto 200.xxx.xxx.x6 ffffffffffff
Oct 22 22:21:36 gateway heartbeat: info: Running
/etc/ha.d/resource.d/IPaddr 200.xxx.xxx.x7/29/eth2 start
Oct 22 22:21:36 gateway heartbeat: info: /sbin/ifconfig eth2:0
200.xxx.xxx.x7 netmask 255.255.255.248      broadcast 200.208.220.151
Oct 22 22:21:36 gateway heartbeat: info: Sending Gratuitous Arp for
200.xxx.xxx.x7 on eth2:0 [eth2]
Oct 22 22:21:36 gateway heartbeat: /usr/lib/heartbeat/send_arp -i 1010
-r 5 -p /var/lib/heartbeat/rsctmp/send_arp/send_arp-200.xxx.xxx.x7
eth2 200.xxx.xxx.x7 auto 200.xxx.xxx.x7 ffffffffffff
Oct 22 22:21:36 gateway heartbeat: info: Running /etc/init.d/firewall
 start
Oct 22 22:21:36 gateway heartbeat: info: Running /etc/init.d/shaper  start
Oct 22 22:21:41 gateway heartbeat[19956]: info: local HA resource
acquisition completed (standby).
Oct 22 22:21:41 gateway heartbeat[19947]: info: Standby resource
acquisition done [foreign].
Oct 22 22:21:41 gateway heartbeat[19947]: info: Initial resource
acquisition complete (auto_failback)
Oct 22 22:21:41 gateway heartbeat[19947]: info: remote resource
transition completed.


_______________________________________________
Linux-HA mailing list
[email protected]
http://listas.linuxchix.org.br/mailman/listinfo/linux-ha

Responder a