[issue25301] Optimize UTF-8 decoder with error handlers

2015-10-05 Thread Roundup Robot
Roundup Robot added the comment: New changeset 5b9ffea7e7c3 by Victor Stinner in branch 'default': Issue #25301: Fix compatibility with ISO C90 https://hg.python.org/cpython/rev/5b9ffea7e7c3 -- ___ Python tracker _

[issue25301] Optimize UTF-8 decoder with error handlers

2015-10-05 Thread Roundup Robot
Roundup Robot added the comment: New changeset 3152e4038d97 by Victor Stinner in branch 'default': Issue #25301: The UTF-8 decoder is now up to 15 times as fast for error https://hg.python.org/cpython/rev/3152e4038d97 -- nosy: +python-dev ___ Python t

[issue25301] Optimize UTF-8 decoder with error handlers

2015-10-05 Thread STINNER Victor
STINNER Victor added the comment: I pushed my optimization. I close the issue. -- resolution: -> fixed status: open -> closed ___ Python tracker ___

[issue25301] Optimize UTF-8 decoder with error handlers

2015-10-04 Thread STINNER Victor
STINNER Victor added the comment: Results of the microbenchmark on the UTF-8 decoder. As expected, performances on valid UTF-8 is unchanged, which was an important goal for me. Decoding with error handlers optimized by the patch are *much* faster. backslashreplace is still slow, because I did

[issue25301] Optimize UTF-8 decoder with error handlers

2015-10-04 Thread STINNER Victor
Changes by STINNER Victor : Added file: http://bugs.python.org/file40671/bench.py ___ Python tracker ___ ___ Python-bugs-list mailing list Uns

[issue25301] Optimize UTF-8 decoder with error handlers

2015-10-02 Thread STINNER Victor
STINNER Victor added the comment: Here is a first patch. It is written to keep best performances for valid UTF-8 encoded string, but speedup strings with a few undecodable bytes. -- keywords: +patch Added file: http://bugs.python.org/file40663/utf8_decoder.patch ___

[issue25301] Optimize UTF-8 decoder with error handlers

2015-10-02 Thread STINNER Victor
New submission from STINNER Victor: The issue #24870 optimized the ASCII decoder with error handlers: New changeset 3c430259873e by Victor Stinner in branch 'default': Issue #24870: Optimize the ASCII decoder for error handlers: surrogateescape, https://hg.python.org/cpython/rev/3c430259873e We