I am relatively new to openib so here goes:

I am attempting to configure our small cluster to use bproc and openib. Note I am using gen1 on kernel 2.6.6 patched with the clustermatic stuff, (should I be using gen2, is it stable for general use?).

I have successfully gotten things going on the head node including opensm. I have successfully gotten the slave nodes to run the patched kernel, load the appropriate modules as well as the various user level libraries but I am having an issue on the slave nodes:

If I run:
$bpsh 13 /usr/mellanox/bin/vstat
1 HCA found:
        hca_id=InfiniHost0
Error: Could not retrieve handle to the HCA InfiniHost0 (VAPI_EGEN)


On the head node I get: $/usr/mellanox/bin/vstat 1 HCA found: hca_id=InfiniHost0 vendor_id=0x02C9 vendor_part_id=0x5A44 hw_ver=0xA1 fw_ver=0x300020000 num_phys_ports=2 port=1 port_state=PORT_DOWN sm_lid=0x0000 port_lid=0x0353 port_lmc=0x00 max_mtu=2048

                port=2
                port_state=PORT_ACTIVE
                sm_lid=0x0354
                port_lid=0x0354
                port_lmc=0x00
                max_mtu=2048

I can run ifconfig on the slave I see ib0 properly:
$bpsh 13 ifconfig ib0
ib0       Link encap:Ethernet  HWaddr 00:00:00:00:00:00
          BROADCAST MULTICAST  MTU:2044  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:128
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)

Thanks,

Galen

_______________________________________________
openib-general mailing list
openib-general@openib.org
http://openib.org/mailman/listinfo/openib-general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to