Hi, I've been testing the current opensm development head (commit 83b67527d16 from git://git.openfabrics.org/~alexnetes/opensm), and I've been getting some messages that are new since version 3.3.7:
Apr 22 12:08:09 646534 [411CD940] 0x01 -> log_rcv_cb_error: ERR 3111: Received MAD with error status = 0x1C SubnGetResp(SwitchInfo), attr_mod 0x0, TID 0x4802 Initial path: 0,1,1,4 Return path: 0,20,1,7 I get one of these messages for each switch in my fabric, on every heavy sweep. It appears these are caused by my switches incorrectly reporting the capability IB_PORT_CAP_HAS_MCAST_FDB_TOP; i.e. this patch stops the messages: diff --git a/opensm/osm_mcast_mgr.c b/opensm/osm_mcast_mgr.c index ea52bfe..63d2968 100644 --- a/opensm/osm_mcast_mgr.c +++ b/opensm/osm_mcast_mgr.c @@ -1041,7 +1041,7 @@ static void mcast_mgr_set_mfttop(IN osm_sm_t * sm, IN osm_switch_t * p_sw) p_path = osm_physp_get_dr_path_ptr(p_physp); p_tbl = osm_switch_get_mcast_tbl_ptr(p_sw); - if (p_physp->port_info.capability_mask & IB_PORT_CAP_HAS_MCAST_FDB_TOP) { + if (0 && p_physp->port_info.capability_mask & IB_PORT_CAP_HAS_MCAST_FDB_TOP) { /* Set the top of the multicast forwarding table. */ IB_PORT_CAP_HAS_MCAST_FDB_TOP is bit 30 of the port capability mask, which in at least IBA v1.2.1 was a reserved bit but apparently is not anymore. Should I file a bug report with my switch vendor about setting a port capability bit for a capability they don't support, or is there something else going on that I haven't figured out yet? FWIW I think my switches have a base SP0; maybe it's got something to do with that? Thanks -- Jim -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html