[Python-ideas] General methods

Serhiy Storchaka Fri, 08 May 2020 12:49:26 -0700

Methods in Python are defined as functions in the class namespace. Whencall the method of the object, the function will be called with theobject as the first argument. And furthermore, unbound methods can becalled with passing self as the first argument. For example,str.upper('abc') returns 'ABC'. So class can be considered as anamespace for functions related to objects of the specified type.

For methods defined in Python you can pass arbitrary object as self. Butin methods defined in C it should be an instance of the class in whichthe method was defined. On one hand, it is very convenient -- you cab besure that self is binary compatible with the specified class. On otherhand, it restricts you.

I propose to add the METH_GENERAL flag, which is applicable to methodsas METH_CLASS and METH_STATIC (and is mutually incompatible with them).If it is set, the check for the type of self will be omitted, and youcan pass an arbitrary object as the first argument of the unbound method.


I have several use cases for this.

1. Bytes and bytearray methods.

Bytes and bytearray has a lot of of sequence-like and string-likemethods. They also implement the buffer protocol. There are otherobjects which implement the buffer protocol: memoryview, BytesIO,array.array, mmap.mmap, ctypes arrays, NumPy arrays, but they lack mostof these methods. To use these methods (for example index()) you need tocopy the content to a bytes or bytearray, that invalidates the purposeof the buffer protocol which was designed to avoid copying of binarydata. With general methods we can make bytes.index() be applicable toany object which supports the buffer protocol.

bytes.index() and bytearray.index() will be equivalent, but thedifference between bytes.split() and bytearray.split() will be in theresult type.


2. Set methods.

Some set methods accept arbitrary number of arguments and accept notonly sets, but any iterables. Sou you can get a union of two lists forexample:


>>> set().union([1, 2], [2, 3])
{1, 2, 3}

It does work because a union with an empty set is a no-op. This trickdoes not work with other methods. You have to convert the first iterableto set explicitly.


>>> set([1, 2]).symmetric_difference([2, 3])
{1, 3}

With general methods we can make unbound set methods accepting arbitraryiterables and convert the first one to set implicitly (or avoid creatinga set if it is possible). We could use set.symmetric_difference([1, 2],[2, 3]).

Maybe there are other use cases. But I do not suggest making all methodsof builtin types general: only if there is a more general protocol (asthe buffer protocol or the iterator protocol) and there is a profit ofusing such protocol without creating an object of the corresponding typeexplicitly. For example, I think that str methods should not be general,despite the fact that any object can be converted to string. Implicitconversions to str does not have profit and may hide bugs. Other example-- float.as_integer_ratio() should not accept int, despite the fact thatmost function which accept float implicitly convert int to float. It isa lossy conversion for large integers, and the loss will affect the result.

I used the term "general methods" to name this feature, but if there arebetter proposition I will use it.

_______________________________________________
Python-ideas mailing list -- python-ideas@python.org
To unsubscribe send an email to python-ideas-le...@python.org
https://mail.python.org/mailman3/lists/python-ideas.python.org/
Message archived at 
https://mail.python.org/archives/list/python-ideas@python.org/message/Z4JVVNINPRY4RA7ZFOYOTOVZEVGE7DAZ/
Code of Conduct: http://python.org/psf/codeofconduct/

[Python-ideas] General methods

Reply via email to