[Python-Dev] Re: PEP 563 in light of PEP 649

Larry Hastings Fri, 16 Apr 2021 12:27:26 -0700

Please don't confuse Inada Naoki's benchmark results with the effect PEP649 would have on a real-world codebase. His artifical benchmarkconstructs a thousand empty functions that take three parameters withrandomly-chosen annotations--the results provides some insights but arenot directly applicable to reality.

PEP 649's effects on code size / memory / import time are contingent onthe number of annotations and the number of objects annotated, not theoverall code size of the module. Expressing it that way, and suggestingthat Python users would see the same results with real-world code, ishighly misleading.

I too would be interested to know the effects PEP 649 had on areal-world codebase currently using PEP 563, but AFAIK nobody hasreported such results.



//arry/

On 4/16/21 11:05 AM, Jukka Lehtosalo wrote:

On Fri, Apr 16, 2021 at 5:28 PM Łukasz Langa <[email protected]<mailto:[email protected]>> wrote:
    [snip] I say "compromise" because as Inada Naoki measured, there's
    still a non-zero performance cost of PEP 649 versus PEP 563:

    - code size: +63%
    - memory: +62%
    - import time: +60%


    Will this hurt some current users of typing? Yes, I can name you
    multiple past employers of mine where this will be the case. Is it
    worth it for Pydantic? I tend to think that yes, it is, since it
    is a significant community, and the operations on type annotations
    it performs are in the sensible set for which
    `typing.get_type_hints()` was proposed.
Just to give some more context: in my experience, both import time andmemory use tend to be real issues in large Python codebases (code sizeless so), and I think that the relative efficiency of PEP 563 is animportant feature. If PEP 649 can't be made more efficient, this couldbe a major regression for some users. Python server applications needto run multiple processes because of the GIL, and since code objectsgenerally aren't shared between processes (GC and reference countingmakes it tricky, I understand), code size increases tend to beamplified on large servers. Even having a lot of RAM doesn'tnecessarily help, since a lot of RAM typically implies many CPU cores,and thus many processes are needed as well.
I can see how both PEP 563 and PEP 649 bring significant benefits, buttypically for different user populations. I wonder if there's a way ofcombining the benefits of both approaches. I don't like the idea ofhaving toggles for different performance tradeoffs indefinitely, but Ican see how this might be a necessary compromise if we don't want tomake things worse for any user groups.
Jukka

_______________________________________________
Python-Dev mailing list -- [email protected]
To unsubscribe send an email to [email protected]
https://mail.python.org/mailman3/lists/python-dev.python.org/
Message archived at 
https://mail.python.org/archives/list/[email protected]/message/PBJ6MBQIE3DVQUUAO764PIQ3TWGLBS3X/
Code of Conduct: http://python.org/psf/codeofconduct/

_______________________________________________
Python-Dev mailing list -- [email protected]
To unsubscribe send an email to [email protected]
https://mail.python.org/mailman3/lists/python-dev.python.org/
Message archived at 
https://mail.python.org/archives/list/[email protected]/message/4OHBEX4ARPMB57MS7ICTZNS44KEORJRI/
Code of Conduct: http://python.org/psf/codeofconduct/

[Python-Dev] Re: PEP 563 in light of PEP 649

Reply via email to