Re: [ofa-general] [BUG report / PATCH] fix race in the core multicast management

Sean Hefty Thu, 20 Sep 2007 09:30:07 -0700

I see, however, there is no second join here, its a leave and join wherethe group refcount climbs to 2 since the the join code inc it on itssynchronous part which is executed before the thread handles theprocessing of the leave request.

The refcount is only used to ensure that the group structure continuesto exist. The code must be able to handle multiple users callingjoin/free at the same time, including a single user calling free beforeits previous call to join has completed. All MADs sent for the samemulticast group must also be serialized to prevent join and leaverequests for the same group from reaching the SA out of order.

If you walk through ib_sa_free_multicast(), the group membership isdecremented. A reference is held on the group because a work item hasjust been queued on the group for processing. We cannot remove thisreference unless we avoid queuing the work item. And the work item isqueued to ensure that the leave request to the SA is serialized withpossible future join requests.

I am not sure this is what we want from the core design. Say theconsumer has some flexibility in the join request (eg through future apichange), such that they can join a group, leave it, then join again thisgroup with different "attributes". Then if the join crosses the leave ina way that causes the core code not to issue sa leave/join queries, itsa bug from the perspective of the user.

If the attributes from a subsequent join differ from an existing join,the subsequent join operation will fail. The only way I can think of tomake this situation work is to add an asynchronousib_sa_leave_multicast() routine that provides a callback after the leavecompletes, in addition to the existing free call.

This could be a fairly difficult case to make work anyway, since itrequires destroying the group at the SA before it can be re-created withthe different attributes. It requires coordination across the groupthat's beyond the control of the local multicast module. (A singlegroup creator could handle this fairly easily.)

OK, on this specific host system there was no port down event! so theonly event that the multicast and ipoib code got was port active. Thisis why the patch I sent solves (hides) the problem, it causes themulticast code to transition the group into the error state, so theipoib join that follows causes an sa join query to be actually sent.

There were two port active events delivered back to back with no otherevents in between? If so, is this something that can or should occur?The patch itself looks fine to me; I'm trying to determine if there areother refcount problems in the multicast module. I'm not convinced thatthere are at this point.

I don't think there's a problem in ipoib, it just does not rely onmulticast error notifications but rather on port events. Do you thinkits less robust, and if yes, why?

As long as the multicast module gets the event notification first, whichI believe is the case, then I don't think there's any problems.


- Sean
_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [ofa-general] [BUG report / PATCH] fix race in the core multicast management

Reply via email to