I will try the tcpdump, I've been able to do stupid stuff with the
fence_sanbox2 python script (repeat sequences of issue/expect; increase
the python delaybeforesend value) to get it to work most of the time,
but I don't believe the script code is wrong, there's something else
going on, perhaps the switch is in a funny state. Now, how would I
detect *that*!

Thanks,
--Larry

On 03/24/2013 12:00 PM, James Washer wrote:
> You might want to grab a tcpdump of the connection. Perhaps you'll be
> able to see a bit more of the conversation.
>
> On Sat, Mar 23, 2013 at 5:55 AM, Laurence Schuler
> <laurence.schu...@nasa.gov <mailto:laurence.schu...@nasa.gov>> wrote:
>
>     I have a two node cluster that has been running fine for a couple of
>     months (little to 0 reboots though). We recently updated the software
>     with the latest Centos 6 software but now the cluster will not
>     start. It
>     keeps throwing errors during startup when attempting to unfence the
>     disks. I have hard reset the fiber switch, and reset both hosts, but
>     when I run fence_sanbox2, I am unable to either enable, disable or
>     even
>     get status of the switch ports. This is the error I get.
>
>     > [root@web1 lschule3]# /usr/sbin/fence_sanbox2 -a 192.168.1.190 -l
>     > admin -S FCpass.sh -o enable -n 5 -v
>     > telnet> set binary
>     > Negotiating binary mode with remote host.
>     > telnet> open 192.168.1.190 -23
>     > Trying 192.168.1.190...
>     > Connected to 192.168.1.190.
>     > Escape character is '^]'.
>     >
>     > Firmware V8.0.13.8.0
>     >
>     > r3fc1 login:
>     >
>     >
>     >   Establishing connection...   Please wait.
>     >
>     >        *****************************************************
>     >        *                                                   *
>     >        *       Command Line Interface SHell  (CLISH)       *
>     >        *                                                   *
>     >        *****************************************************
>     >
>     >        SystemDescription   SANbox 5800 FC Switch
>     >        HostName            r3fc1
>     >        EthIPv4NetworkAddr  192.168.1.190
>     >        EthIPv6NetworkAddr  fe80::2c0:00:00:90b
>     >        MACAddress          00:c0:dd:77:10:0b
>     >        WorldWideName       10:00:00:c0:dd:24:09:0b
>     >        SerialNumber        1236H00833
>     >        SymbolicName        r3fc1
>     >        ActiveSWVersion     V8.0.13.8.0
>     >        ActiveTimestamp     Mon Apr  2 18:32:33 2012
>     >        POSTStatus          Passed
>     >        LicensedPorts       12
>     >        SwitchMode          Full Fabric
>     >
>     >   The alarm log is empty.
>     >
>     > r3fc1 #> r3fc1 #> Failed: Unable to switch to admin section
>     > [root@web1 lschule3]#
>
>     I can manually telnet into the FC switch and execute the appropriate
>     commands to enable/disable ports. But the fence_sanbox2 script
>     will not.
>     The fence_sanbox2 code has not changed, however python has been
>     upgraded
>     from 2.6.6-29 to 2.6.6-36.
>
>     Has anyone else seen this? Know of a fix? Am I doing/not doing
>     something
>     stupid? I seem to recall running this command before during setup
>     and it
>     worked just fine then.
>
>     Thanks for any help!
>
>     --
>     Laurence Schuler (Larry)                      
>     laurence.schu...@nasa.gov <mailto:laurence.schu...@nasa.gov>
>     Systems Support                                       ADNET
>     Systems, Inc
>     Scientific Visualization Studio                
>     http://svs.gsfc.nasa.gov
>
>     --
>     Linux-cluster mailing list
>     Linux-cluster@redhat.com <mailto:Linux-cluster@redhat.com>
>     https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
>
> -- 
>
>
>  - jim

-- 
Linux-cluster mailing list
Linux-cluster@redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

Reply via email to