Proposed semantics for attributes in C++ (and in C?)

Mark Mitchell Sun, 15 Oct 2006 15:13:00 -0700

We have a number of C++ PRs open around problems with code like this:


  struct S {
    void f();
    virtual void g();
  };

  typedef __attribute__((...)) struct S T;

If the attribute makes any substantive change to S (e.g., changes itssize, alignment, etc.) then bad things happen. For example, the memberfunctions of "S" have expectations about the layout of "S" that are notsatisfied if they are called with a "T". Depending on the attribute andcircumstances, we do all manner of bad things, including ICE, generatewrong code, etc.

For a while now, I've been promising to propose semantics for theseconstructs. Here is a sketch of the semantics that I think we shouldhave. (I say a sketch because I have not attempted to write standardese.)

All attributes must be classified as either "semantic" or "non-semantic"attributes. A "semantic" attribute is one which might affectcode-generation in any way. A "non-semantic" attribute cannot affectcode-generation. For example, "used" and "deprecated" are non-semanticattributes; there is no way to observe, by looking at an object file,whether or not a class has been marked with one of these attributes. Incontrast, "packed" is a semantic attribute; the size of a class isdifferent depending on whether or not it is "packed".

Any attribute may be applied at the point of definition of a class.These attributes (whether semantic or non-semantic) apply to the class.For example, if the class is packed, then the member functions expectthe "this" pointer to point to the packed class.

A typedef declaration which adds only non-semantic attributes is alwaysvalid. As with other typedefs, the typedef declaration creates a newname for an existing type. The type referred to by that name is thesame type as the original type. However, the *name* has additionalproperties, implied by the (non-semantic) attributes. For example,using a "deprecated" name for a type results in a deprecation warning.But, a function declared to take a parameter with the non-deprecatedname may be passed a parameter with the "deprecated" name.

A typedef declaration which adds semantic attributes to a class type,other than POD classes with no explicitly declared members other thandata members, to arrays of such classes, to arrays of such arrays, etc.,is invalid. (POD-ness alone is not satisfactory, as PODs may containfunction members, and I think dealing with static data members andtypedef members is not worth the trouble.)

A typedef declaration which adds semantic attributes to a POD class typewith no function members is valid, but creates an entirely new type,different from all other types except others formed by adding the samecombination of semantic attributes to the same original class type. Inthe example above, if the typedef adds a semantic attribute, you may notpass an "S" to a function expecting a "T" or vice versa. Neither mayyou pass an "S*" to a function expecting a "T*", without an explicitreinterpret_cast. The name of "T", for linkage purposes, is "T", andthere is no implicit "T::S" type; instead, however, there is a "T::T"type. (Various consequences follow; for example, typeid(T) gives you atype_info object that indicates that the name of the type is "T".)References to the original type from within the types of the members ofthe class still refer to the original class. For example, in:


  struct S {
    char c;
    S* next;
  };
  typedef __attribute__((packed)) S T;

the data member T::next has type "S*", not "T*".

A typedef declaration which adds semantic attributes to a non-class typeis valid, but again creates an entirely new type. (We might want aspecial exception to the "entirely new type" rule for the "mode"attribute, declaring that "typedef __attribute__((mode(DI))) int LL" isequivalent to "typedef long long LL;" on platforms where "long long" hasDImode.) So,


  typedef S* P;
  typedef __attribute__((...)) P Q;

creates a type "Q" that is incompatible with "S*" if the attribute issemantic. However, the type of "*Q" is still "S". It is invalid to doanything that would require either type_info or a mangled name for "Q",including using it as an argument to typeid, thowing an exception of atype involving "Q", or declaring a template to take a parameter of atype involving "Q". (We could relax some of these restrictions infuture, if we add mangling support for attributes.)


A variable declaration involving attributes, like:

  __attribute__((...)) S v;

is treated as syntactic sugar for:

  typedef __attribute__((...)) S T;
  T v;

where T is some invented type name different from all others in the program.

For example given:

  __attribute__((packed)) S v;

the type of "&v" is "__attribute__((packed)) S *", and cannot be passedto a function expecting an "S*", but can of course be passed to afunction expecting an "__attribute__((packed)) S *", or a typedef forsuch a type.


Thoughts?

--
Mark Mitchell
CodeSourcery
[EMAIL PROTECTED]
(650) 331-3385 x713

Proposed semantics for attributes in C++ (and in C?)

Reply via email to