RE: [openib-general] question on opensm error

2005-02-17 Thread shaharf
Hi, There is a sys fail red light on the CPU on the 96-port switch that the opensm host attaches to. What's weird is none of the ib admin tools found anything. ibnetdiscover happily walked the whole subnet. The only problem was that opensm would not run, but the errors were unclear.

Re: [openib-general] question on opensm error

2005-02-16 Thread Hal Rosenstock
On Wed, 2005-02-16 at 11:45, Ronald G. Minnich wrote: On Tue, 16 Feb 2005, Hal Rosenstock wrote: On Tue, 2005-02-15 at 22:22, Ronald G. Minnich wrote: On Tue, 15 Feb 2005, Hal Rosenstock wrote: I presume your subnet has 179 HCAs ? Do you know ? no errors. It's just that

Re: [openib-general] question on opensm error

2005-02-15 Thread Hal Rosenstock
Hi Ron, On Mon, 2005-02-14 at 15:59, Ronald G. Minnich wrote: formerly working opensm starts to get these: So the OpenSM was up and running and these messages appeared in the log. Did anything change in the subnet ? [1108414727:000284173][411FF970] - umad_receiver: send completed with

Re: [openib-general] question on opensm error

2005-02-15 Thread Hal Rosenstock
Hi Ron, On Mon, 2005-02-14 at 15:59, Ronald G. Minnich wrote: formerly working opensm starts to get these: So the OpenSM was up and running and these messages appeared in the log. Did anything change in the subnet ? [1108414727:000284173][411FF970] - umad_receiver: send completed with

Re: [openib-general] question on opensm error

2005-02-15 Thread Ronald G. Minnich
On Tue, 15 Feb 2005, Hal Rosenstock wrote: ibstatus/ibstat can show the local port logical and physical port state. bluesteel:~ # ibstat CA 'mthca0': CA type: MT23108 Number of ports: 2 Firmware version: 3.3.2 Hardware version: a1 Node GUID: