Re: inheritance, multiple inheritance and the weaklist and instance dictionaries

Rouslan Korneychuk Wed, 09 Feb 2011 20:08:17 -0800

On 02/09/2011 08:40 PM, Carl Banks wrote:

I explained why in my last post; there's a bunch of reasons.
Generally you can't assume someone's going to go through the type
structure to find the object's dict, nor can you expect inherited
methods to always use the derived class's type structure (some methods
might use their own type's tp_dictoffset or tp_weakreflist, which
would be wrong if called from a superclass that changes those
values).

Who do you mean by someone? The code is generated by a program. No humanis required to touch it. If it needs to be updated, the program issimply run again with the updated specification file. Thus I can makethose assumptions because I have total control over the code. The onlything I don't have control over is the Python code that imports theextension, but in Python, the user doesn't get to choose how they accessthe weaklist and instance dictionary.

Even if you are careful to avoid such usage, the Python
interpreter can't be sure.  So it has to check for layout conflicts,
and these checks would become very complex if it allowed dict and
weakreflist to appear in different locations in the layout (it's have
to check a lot more).

What is so complex about this? It already uses "obj_instance +obj_instance->ob_type->tp_weaklistoffset". That's all the checking itneeds. It only becomes a problem when trying to derive from two or moreclasses that already have these defined. In such a case the Pythoninterpreter can't deduce what the values of tp_weaklistoffset andtp_dictoffset in the derived type should be, but it doesn't have tobecause my program tells it what they need to be.

I would say you do.  Python's type system specifies that a derived
type's layout is a superset of its base types' layout.  You seem to
have found a way to derive a type without a common layout, perhaps by
exploiting a bug, and you claim to be able to keep data access
straight.  But Python types are not intended to work that way, and you
are asking for trouble if you try to do it.

I'm not really circumventing this system (except for the varyinglocation of the dictionaries. See the explanation below for that).Python allows variable-sized objects. Tuples and strings are variablesized. This allows them to store the data directly in the object insteadof having a pointer to another location in memory. And the objects Igenerate are basically this:


struct MyObject {
    PyObject_HEAD
    storage_mode mode;
    char[x] opaque_data;
};

I use the real type instead of char[] when possible because it will havethe proper alignment but I still treat it like a private hunk of memorythat only my generate code will touch. What I store in opaque_data is upto me. I can store a copy of the wrapped type, or I can store a pointerto it. "mode" specifies what is in opaque_data. A derived type wouldlook like this:


struct MyDerivedObject {
    PyObject_HEAD
    storage_mode mode;
    char[y] opaque_data;
};

Where y >= x. It's still the same layout. All that's left is some wayfor the original object to know what C++ type is stored in opaque_data.I could have used another variable like 'mode', but since there is aone-to-one correspondence between PyObject->ob_type and the type that isbeing wrapped, I can determine the type from ob_type instead.

There is no bug being exploited. The actual implementation is a littledifferent than this, but the principle is the same. I said before thatthe layout varies, but that's only if you consider the contents ofopaque_data, but that is neither Python's nor the user's concern.

I guess there's also no point in arguing that tp_dictoffset and
tp_weakreflist need to have the same value for base and derived types,
since you're rejecting the premise that layouts need to be
compatible.  Therefore, I'll only point out that the layout checking
code is based on this premise, so that's why you're running afoul of
it.

That's not what the Python documentation says. Underhttp://docs.python.org/c-api/typeobj.html#tp_weaklistoffset it says"This field is inherited by subtypes, but see the rules listed below. Asubtype may override this offset; this means that the subtype uses adifferent weak reference list head than the base type. Since the listhead is always found via tp_weaklistoffset, this should not be aproblem." And underhttp://docs.python.org/c-api/typeobj.html#tp_dictoffset it says "Thisfield is inherited by subtypes, but see the rules listed below. Asubtype may override this offset; this means that the subtype instancesstore the dictionary at a difference offset than the base type. Sincethe dictionary is always found via tp_dictoffset, this should not be aproblem."

You claimed in another post you weren't trying to mimic the C++ type
hierarchy in Python, but this line suggests you are.

When did I make that claim? Perhaps you misunderstood me I said "Ikind-of already did. The issue only comes up when multiply-inheritingfrom types that have a different combination of the weaklist andinstance dictionaries. I don't have to support this particular feature."

I was saying I kind-of already did mimic the C++ hierarchy. And when Isaid "this particular feature", I was talking about the thing Idescribed in the immediately preceding sentence, not the C++ type hierarchy.


--
http://mail.python.org/mailman/listinfo/python-list

Re: inheritance, multiple inheritance and the weaklist and instance dictionaries

Reply via email to