Hello again, On Sat, 2007-04-14 at 08:56 -0600, Alan Robertson wrote: > Mark Frasa wrote: > > Hello again, > > > > On Fri, 2007-04-13 at 07:10 -0600, Alan Robertson wrote: > >> Mark Frasa wrote: > >>> On Wed, 2007-04-11 at 07:36 -0600, Alan Robertson wrote: > >>> > >>> Hello, > >>> > >>>> Mark Frasa wrote: > >>>>> Hello, > >>>>> > >>>>> thanks for the tip, however cl_status rscstatus doesn't output a thing: > >>>>> > >>>>> box:~# cl_status rcstatus > >>>>> box:~# > >>>>> > >>>>> The only thing my syslog shows: > >>>>> > >>>>> Apr 11 08:56:11 dns01 cl_status[3505]: info: start:optind: 1 > >>>>> argv[optind+1]: (null) > >>>>> > >>>>> And my messages: > >>>>> > >>>>> Apr 11 08:57:02 dns01 cl_status[3507]: info: start:optind: 1 > >>>>> argv[optind+1]: (null) > >>>>> Apr 11 08:57:02 dns01 cl_status[3507]: info: End: optind: 1 argv[optind > >>>>> +1]: (null) > >>>>> > >>>>> Is there any reason why this appears? Hearthbeat is running on the > >>>>> localhost, but not on the remote host when i do this. > >>>> The logs are the author never removing his debug info. > >>>> > >>>> Just to double-check... This is an R1-style (haresources) installation > >>>> not one with the cib.xml file (R2/CRM-style)? > >>>> > >>>> > >>> Yep, its the simple setup :-) > >> Here's the code in question: > >> tatic int > >> rscstatus(ll_cluster_t *hb, int argc, char ** argv, const char * optstr) > >> { > >> const char * rstatus; > >> > >> if ( general_simple_opt_deal(argc, argv, optstr) < 0 ) { > >> /* There are option errors */ > >> return PARAMETER_ERROR; > >> }; > >> > >> rstatus = hb->llc_ops->get_resources(hb); > >> if ( rstatus == NULL ) { > >> cl_log(LOG_ERR > >> , "Cannot get client %s's resource status" > >> , argv[optind+2]); > >> cl_log(LOG_ERR, "REASON: %s", hb->llc_ops->errmsg(hb)); > >> return UNKNOWN_ERROR; > >> } > >> if (FOR_HUMAN_READ == TRUE) { > >> printf("This node is holding %s resources.\n", rstatus); > >> } else { > >> printf("%s\n", rstatus); > >> } > >> return 0; > >> } > >> > >> general_simple_opt_deal prints an error if it thinks there's an error > >> that's occurred. (But, reading the code and how it's used, its attempt > >> to return an error will be ignored because it returns a positive value.) > >> > >> All the other cases print something as well... > >> > >> You might try invoking it with rscstatus -m and see if that prints "This > >> node is holding"... > >> > >> I just reread the code in the client side and server side for this > >> request. It's not at all obvious how an empty string can be returned. > >> And, if an error occurs, it's not obvious how it doesn't get printed > >> somewhere... > >> > >> 1) Check the error exit code > >> 2) try it with -m and see if something comes out > >> 3) raise debug and see if the messages tell you something > >> kill -SIGUSR1 $hbpid; kill -SIGUSR1 $hbpid; > >> cl_status rscstatus -m; echo $? > >> kill -SIGUSR2 $hbpid; kill -SIGUSR2 $hbpid; > >> > > > > When i enable the debugging i get the message: > > > > # cl_status rscstatus > > # Apr 13 15:44:27 dns01 cl_status[4001]: info: start:optind: 1 > > argv[optind+1]: (null) > > Apr 13 15:44:27 dns01 cl_status[4001]: ERROR: Cannot get client HZ=100's > > status > > Apr 13 15:44:27 dns01 cl_status[4001]: ERROR: REASON: Can't send message > > to IPC Channel: Resource temporarily unavailable > > Apr 13 15:44:27 dns01 cl_status[4001]: info: End: optind: 1 argv[optind > > +1]: (null) > > Apr 13 15:44:27 dns01 cl_status[4001]: ERROR: Cannot sign off from > > heartbeat. > > Apr 13 15:44:27 dns01 cl_status[4001]: ERROR: REASON: Can't send message > > to IPC Channel: Resource temporarily unavailable can't send message to > > IPC: Resource temporarily unavailable > > Apr 13 15:44:27 dns01 cl_status[4001]: ERROR: return value:13 > > > > This is only when i do cl_status rscstatus -m > > > > Though i am unsure what this error means, can you clarify it? > > It means the code you're running doesn't look much like the code I have > in source control... Nor any version that I've seen... > > Your code appears to be calling the clientstatus sub-command, not the > rscstatus sub-command -- at least if those messages are exactly correct > and nothing has been omitted. But, it looks like maybe a word has been > omitted from the message, or the old versions of this code were > different (which they don't seem to be). > > What version is this, and where did you get the binaries for it? >
The version is: heartbeat 1.2.3-9sarge6 Subsystem for High-Availability Linux And as you can see, i retrieved the version via apt-get it is the default sarge version. Cheers, Mark. _______________________________________________ Linux-HA mailing list [EMAIL PROTECTED] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems