Sure, here is the output: Note this is with the switch we swapped in, so the port numbers don't match the ibchecknet output in the original message.
# ibstat CA 'mlx4_0' CA type: MT26428 Number of ports: 2 Firmware version: 2.6.0 Hardware version: a0 Node GUID: 0x0002c90300032de0 System image GUID: 0x0002c90300032de3 Port 1: State: Active Physical state: LinkUp Rate: 40 Base lid: 6 LMC: 0 SM lid: 6 Capability mask: 0x0251086a Port GUID: 0x0002c90300032de1 Port 2: State: Down Physical state: Polling Rate: 10 Base lid: 0 LMC: 0 SM lid: 0 Capability mask: 0x02510868 Port GUID: 0x0002c90300032de2 CA 'mthca0' CA type: MT25204 Number of ports: 1 Firmware version: 1.2.0 Hardware version: a0 Node GUID: 0x003048c64c0c0000 System image GUID: 0x003048c64c0c0003 Port 1: State: Down Physical state: Polling Rate: 10 Base lid: 0 LMC: 0 SM lid: 0 Capability mask: 0x02510a68 Port GUID: 0x003048c64c0c0001 # iblinkinfo Switch 0x0002c9020041a7a0 Infiniscale-IV Mellanox Technologies: 1 1[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 5 1[ ] " HCA-1" ( ) 1 2[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 6 1[ ] "linux70 HCA-1" ( ) 1 3[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 7 1[ ] "linux71 HCA-1" ( ) 1 4[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 5[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 6[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 7[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 8[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 9[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 10[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 11[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 12[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 13[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 14[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 15[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 16[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 17[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 18[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 19[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 20[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 21[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 22[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 23[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 24[ ] ==( 4X 5.0 Gbps Active/ LinkUp)==> 9 1[ ] " HCA-1" ( ) 1 25[ ] ==( 4X 5.0 Gbps Active/ LinkUp)==> 8 1[ ] " HCA-1" ( ) 1 26[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 27[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 28[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 29[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 30[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 31[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 32[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 33[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 34[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 35[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) 1 36[ ] ==( 4X 2.5 Gbps Down/ Polling)==> [ ] "" ( ) On Thu, Sep 2, 2010 at 12:03 PM, Ira Weiny <wei...@llnl.gov> wrote: > On Thu, 2 Sep 2010 06:56:50 -0700 > Chuck Hartley <hartlc...@gmail.com> wrote: > >> We swapped in a different switch and see the same errors. The opensm >> logfile does not show any errors: > > Could you run "ibstat" on the node with OpenSM running? > > And "iblinkinfo" on the same node? > > Send that output. > > Ira > >> >> ------------------------------------------------- >> OpenSM 3.3.5 >> Command Line Arguments: >> Daemon mode >> Log File: /var/log/opensm.log >> ------------------------------------------------- >> OpenSM 3.3.5 >> >> Sep 02 05:56:29 933684 [B53B8700] 0x80 -> OpenSM 3.3.5 >> Entering DISCOVERING state >> >> Sep 02 05:56:29 934931 [B53B8700] 0x02 -> osm_vendor_init: 1000 >> pending umads specified >> Sep 02 05:56:29 935079 [B53B8700] 0x80 -> Entering DISCOVERING state >> Using default GUID 0x2c90300032de1 >> Entering MASTER state >> >> Sep 02 05:56:29 953763 [B53B8700] 0x02 -> osm_vendor_bind: Binding to >> port 0x2c90300032de1 >> Sep 02 05:56:29 990146 [B53B8700] 0x02 -> osm_vendor_bind: Binding to >> port 0x2c90300032de1 >> Sep 02 05:56:29 990240 [B53B8700] 0x02 -> osm_opensm_bind: Setting >> IS_SM on port 0x0002c90300032de1 >> Sep 02 05:56:30 009040 [AF1DB710] 0x80 -> Entering MASTER state >> SUBNET UP >> >> Sep 02 05:56:30 009885 [AF1DB710] 0x02 -> osm_ucast_mgr_process: >> minhop tables configured on all switches >> Sep 02 05:56:30 014593 [AF1DB710] 0x80 -> SUBNET UP >> >> >> On Thu, Sep 2, 2010 at 8:56 AM, Hal Rosenstock <hal.rosenst...@gmail.com> >> wrote: >> > On Thu, Sep 2, 2010 at 8:34 AM, Chuck Hartley <hartlc...@gmail.com> wrote: >> >> Hello, >> >> >> >> We installed 1.5.1 and are having problems getting the IB fabric >> >> working. ibv_devinfo shows the HCAs ports are ok and ibdiagnet reports >> >> no errors. However, ibchecknet shows that the switch ports are not >> >> being configured. We have never seen this before and are at a loss as >> >> to where the problem might be - would someone please point us in the >> >> right direction to look? Could it be a problem with the switch >> >> itself? Output from ibchecknet below. >> >> >> >> >> >> # ibchecknet >> >> Error check on lid 3 (Infiniscale-IV Mellanox Technologies) port all: >> >> FAILED >> >> ibwarn: [26732] dump_perfcounters: PortXmitWait not indicated so >> >> ignore this counter >> >> #warn: Lid is not configured lid 3 port 7 >> >> #warn: SM Lid is not configured >> > >> > Is there an SM running on your subnet ? If so, I think that the lack >> > of an SM could account for all of the issues mentioned here. >> > >> > -- Hal >> > >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in >> the body of a message to majord...@vger.kernel.org >> More majordomo info at http://*vger.kernel.org/majordomo-info.html >> > > > -- > Ira Weiny > Math Programmer/Computer Scientist > Lawrence Livermore National Lab > 925-423-8008 > wei...@llnl.gov > -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html