Re: [Python-Dev] PEP 393 review

Stefan Behnel Thu, 25 Aug 2011 11:49:30 -0700

"Martin v. Löwis", 24.08.2011 20:15:

- issues to be considered (unclarities, bugs, limitations, ...)

A problem of the current implementation is the need for callingPyUnicode_(FAST_)READY(), and the fact that it can fail (e.g. due toinsufficient memory). Basically, this means that even something as trivialas trying to get the length of a Unicode string can now result in an error.

I just noticed this when rewriting Cython's helper function that searches aunicode string for a (Py_UCS4) character. Previously, the entire functionwas safe, could never produce an error and therefore always returned aboolean result. In the new world, the caller of this function must checkand propagate errors. This may not be a major issue in most cases, but itcan have a non-trivial impact on user code, depending on how deep in a callchain this happens and on how much control the user has over the call chain(think of a C callback, for example).

Also, even in the case that there is no error, the potential need to buildup the string on request means that the run time and memory requirements ofan algorithm are less predictable now as they depend on the origin of theinput and not just its Python level string content.

I would be happier with an implementation that avoided this by alwaysinstantiating the data buffer right from the start, instead of carryingonly a Py_UNICODE buffer for old-style instances.


Stefan

_______________________________________________
Python-Dev mailing list
[email protected]
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] PEP 393 review

Reply via email to