[issue16311] Use _PyUnicodeWriter API in text decoders

STINNER Victor Wed, 24 Oct 2012 11:40:40 -0700

New submission from STINNER Victor:

Attached patch modifies text decoders to use the _PyUnicodeWriter API to 
factorize the code. It removes unicode_widen() and unicode_putchar() functions.


 * Don't overallocate by default  (except for "raw-unicode-escape" codec), 
enable overallocation on the first decode error (as done currently)
 * _PyUnicodeWriter_Prepare() only overallocates 25%, instead of 100%
for unicode_decode_call_errorhandler()
 * Use _PyUnicodeWriter_Prepare() + PyUnicode_WRITE() (two macros)
instead of unicode_putchar() (function)
 * _PyUnicodeWriter structures stores many useful fields, so we don't
have to pass multiple parameters to functions, only the writer

I wrote the patch to factorize the code, but it might be faster.

----------
files: codecs_writer.patch
keywords: patch
messages: 173695
nosy: haypo
priority: normal
severity: normal
status: open
title: Use _PyUnicodeWriter API in text decoders
type: performance
versions: Python 3.4
Added file: http://bugs.python.org/file27697/codecs_writer.patch

_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue16311>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue16311] Use _PyUnicodeWriter API in text decoders

Reply via email to