Re: [networking-discuss] Socket filter design

Erik Nordmark Tue, 08 Dec 2009 07:04:34 -0800

Anders Persson wrote:

Hi Folks,


A design document for socket filters is now available at:
  http://cr.opensolaris.org/~anders/sockfilter/sockfilter-design.pdf


A few questions and comments on the design and design document.

Section 2.1 says you manage the filters using SMF. How does thatinteract with zones? For instance, if a shared-IP or exclusive-IP zonedisables one of the service instances, what happens?Or do the filter service instances only exist in the global zonesomehow? (Having them exist in non-global zones but have no effect onthe filters used would seem confusing.)

Section 3.2. What is the errno when a FIL_DETACH is done on an automaticfilter?

Section 4.3. It makes sense to mention that the bypass flag is aperformance optimization. The filter must have its own checks whether ornot it should bypass its work to handle any callbacks that are activewhen the bypass flag is set.

Section 5.3. How does flow control interact with filters that injectdata (either for input or output)?

In don't understand the need for section 7.1. The mblks that shouldn'tbe modified have db_ref = 2 (this is done by using esballoca) hencethere shouldn't be any need to do anything special for the filters. Afilter, just like any streams modules or driver, must check db_ref == 1before doing any in-place modifications to dblk.

Section 7.2. How does a filter indicate that it doesn't have a data_incallback? Null pointer?

Section 7.3 seems dangerous. If kssl is implemented using filters andsome application issues some innocuous I_* ioctl to get something, thatwould make kssl disappear from the socket? I think instead the ioctl (orother reason for the fallback) must fail.

Section 8. Do we have interested parties outside of the consolidationthat would like to use socket filters? What would it take to make thisan evolving interface?

Section 8.1: What are the semantics of sof_unregister for a filter thatis in use? Will it fail with EBUSY? Something else?

Section 8.2.1 If there are multiple filters and one returnsSOF_RVAL_DEFER, do the filters above it get an attach callback?

Section 8.2.3, 8.2.4 etc says the filter can modify the mblk. In lightof db_ref considerations it would be better to say that it can return adifferent mblk (with in-sito modifications of the dblk allowed if db_ref= 1). Instead of having a mblk_t ** it seems to be less error prone tohave an mblk_t pointer as the return value. That makes it less likely tointroduce errors where the mblk is freed or changed, but the *mpp isn'tupdated. (I've fixed this in IP recently.)What is the definition of *sizep? It is what I get from msgdsize()?msgsize()? (Can there be mblks other than M_DATA?)


Section 8.2.4. Why do we need the complexity of tailmp?

Why do we even need the complexity of a b_next chain? Doesn't TCP justhave b_cont chains?

Section 8.2.5. In light of the above comments I think an mblk_t * returnvalue type makes more sense, and have the error return value as aseparate out argument. When can the filter free and/or consume an mblk?For instance, if the filter wants to queue the mblk would it return aNULL mblk and set RVAL_RETURN? Must the filter free the message whenreturning an error?

Section 8.2.6. What is the utility of SOF_RVAL_RETURN for bind orconnect? I can imagine using it for a deferred connect (similar to anaccept filter but in the reverse direction). But to do that the filterwould need a way to continue propagating the connect down to theprotocol, and I don't see such a facility.

Ditto for setsockopt.

Perhaps we should restrict SOF_RVAL_RETURN to the callbacks where weknow we have all the pieces necessary to make use of it?

Section 8.4.3 and 8.4.4 doesn't indicate how this works when there aremultiple filters. Earlier in the document you stated how accept filterswork (each filter has to say sof_newconn_ready). Is the same true forflow control that all that have stopped the flow have to restart theflow? It makes sense documenting the behavior in the earlier part onflow control.


   Erik


_______________________________________________
networking-discuss mailing list
[email protected]

Re: [networking-discuss] Socket filter design

Reply via email to