Sorry Randy, that was not a case of saying "dunno". HAD not heartbeating GAB is usually indicative of a system load issue or something blocking the ability of HAD to open necessary lock files. These are general statements as this can happen on any environment and should be easy to track down.
Specific questions, or more difficult to solve issues need to be opened as a support case. This is a general discussion forum, not a support avenue for VCS. Since the support guys have access to explorer output, core files, and far more day to day experience, they can answer far better. ________________________________ From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Randy Slead Sent: Tuesday, November 13, 2007 2:43 PM To: veritas-ha@mailman.eng.auburn.edu Subject: Re: [Veritas-ha] SF/HA 5.0 on Solaris 9: "HAD Self Check" error I have seen this on all version of VCS (4/5) even at 10% system utilization. And Symantec going "I dunno", is not helpful. Jim Senicka <[EMAIL PROTECTED]> wrote: HAD is not talking to GAB. Excessive system utilization, or a blocked /var file system or some such issue. ________________________________ From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Marianne Van Den Berg Sent: Tuesday, November 13, 2007 1:17 PM To: veritas-ha@mailman.eng.auburn.edu Subject: [Veritas-ha] SF/HA 5.0 on Solaris 9: "HAD Self Check" error Hi all Brand new installation - 2-node cluster, Solaris 9 with latest O/S patches, SF/HA 5.0 with MP1. IPMultiNICB config'ed as parallel sg (using mpathd) and ClusterService group. Getting these errors about 3 minutes after hastart. Any ideas?? /var/adm/messages: Nov 13 15:59:11 drp-db-1 gab: [ID 272231 kern.notice] GAB WARNING V-15-1-20057 Port h process 140 inactive 7 sec Nov 13 15:59:12 drp-db-1 gab: [ID 272231 kern.notice] GAB WARNING V-15-1-20057 Port h process 140 inactive 8 sec Nov 13 15:59:13 drp-db-1 gab: [ID 272231 kern.notice] GAB WARNING V-15-1-20057 Port h process 140 inactive 9 sec Nov 13 15:59:14 drp-db-1 gab: [ID 272231 kern.notice] GAB WARNING V-15-1-20057 Port h process 140 inactive 10 sec Nov 13 15:59:15 drp-db-1 Had[140]: [ID 702911 daemon.alert] VCS WARNING V-16-1-51047 HAD Self Check: Excessive delay in the HAD heartbeat to GAB (10 seconds) Nov 13 15:59:15 drp-db-1 gab: [ID 272231 kern.notice] GAB WARNING V-15-1-20057 Port h process 140 inactive 11 sec Nov 13 15:59:16 drp-db-1 gab: [ID 272231 kern.notice] GAB WARNING V-15-1-20057 Port h process 140 inactive 12 sec Nov 13 15:59:17 drp-db-1 gab: [ID 272231 kern.notice] GAB WARNING V-15-1-20057 Port h process 140 inactive 13 sec Nov 13 15:59:18 drp-db-1 gab: [ID 272231 kern.notice] GAB WARNING V-15-1-20057 Port h process 140 inactive 14 sec Nov 13 15:59:19 drp-db-1 gab: [ID 191522 kern.notice] GAB WARNING V-15-1-20058 Port h process 140: heartbeat failed, killing process Nov 13 15:59:19 drp-db-1 gab: [ID 975177 kern.notice] GAB INFO V-15-1-20059 Port h heartbeat interval 15000 msec. Statistics: Nov 13 15:59:19 drp-db-1 gab: [ID 217350 kern.notice] GAB INFO V-15-1-20129 Port h: heartbeats in 0 ~ 3000 msec: 3869 Nov 13 15:59:19 drp-db-1 gab: [ID 217350 kern.notice] GAB INFO V-15-1-20129 Port h: heartbeats in 3000 ~ 6000 msec: 0 Nov 13 15:59:19 drp-db-1 gab: [ID 217350 kern.notice] GAB INFO V-15-1-20129 Port h: heartbeats in 6000 ~ 9000 msec: 0 Nov 13 15:59:19 drp-db-1 gab: [ID 217350 kern.notice] GAB INFO V-15-1-20129 Port h: heartbeats in 9000 ~ 12000 msec: 0 Nov 13 15:59:19 drp-db-1 gab: [ID 217350 kern.notice] GAB INFO V-15-1-20129 Port h: heartbeats in 12000 ~ 15000 msec: 0 Nov 13 15:59:19 drp-db-1 gab: [ID 259915 kern.notice] GAB INFO V-15-1-20094 number of processes: 158 Nov 13 15:59:19 drp-db-1 gab: [ID 631272 kern.notice] GAB INFO V-15-1-20095 load average in 1 min: 0. 6 Nov 13 15:59:19 drp-db-1 gab: [ID 587815 kern.notice] GAB INFO V-15-1-20096 load average in 5 min: 0. 8 Nov 13 15:59:19 drp-db-1 gab: [ID 980060 kern.notice] GAB INFO V-15-1-20097 load average in 15 min: 0.10 Nov 13 15:59:19 drp-db-1 gab: [ID 559196 kern.notice] GAB INFO V-15-1-20098 pagein rate: 0 Nov 13 15:59:19 drp-db-1 gab: [ID 582491 kern.notice] GAB INFO V-15-1-20099 pageout rate: 0 Nov 13 15:59:19 drp-db-1 gab: [ID 940236 kern.notice] GAB INFO V-15-1-20041 Port h: client process failure: killing process Nov 13 15:59:19 drp-db-1 Had[140]: [ID 702911 daemon.alert] VCS WARNING V-16-1-53034 HAD Signal SIGABRT received Nov 13 15:59:19 drp-db-1 Had[140]: [ID 702911 daemon.alert] VCS NOTICE V-16-1-53038 Beginning execution of the diagnostics script Nov 13 15:59:21 drp-db-1 Had[140]: [ID 702911 daemon.alert] VCS NOTICE V-16-1-53039 Completed execution of the diagnostics script Nov 13 15:59:22 drp-db-1 gab: [ID 397130 kern.notice] GAB INFO V-15-1-20032 Port h closed Nov 13 15:59:22 drp-db-1 syslog[29181]: [ID 702911 daemon.notice] VCS ERROR V-16-1-11103 VCS exited. It will restart had restarts, but the same thing happens again after a couple of minutes. Regards Marianne _______________________________________________ Veritas-ha maillist - Veritas-ha@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-ha Thanks, /\ () /\ Randy Slead /~~\ o/ _ /\ /~~\/\ Sr Solaris Sys Admin /\/ \ /\/ / / \/ \ (Minor Pubah) / \ / / / / \ [EMAIL PROTECTED] / \ - / / \ ________________________________ Never miss a thing. Make Yahoo your homepage. <http://us.rd.yahoo.com/evt=51438/*http://www.yahoo.com/r/hs>
_______________________________________________ Veritas-ha maillist - Veritas-ha@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-ha