Hi all, Thanks everybody for your comments on this topic. Our initial motivation for doing that is to simplify RPython by getting rid of the RPython unicode type. I think that the outcome of these mails is that there is no single obvious answer as to whether the change would benefit or hurt Python applications. The benefits in reduced memory usage might in some applications win or loose to the higher cost of indexing.
I personally guess that most applications would benefit marginally, with a few special cases that loose importantly. This would not be a very nice outcome... The design of the index cache is crucial to minimize that... On the other hand, maybe some applications doing I/O on utf-8 strings might benefit a lot. Once we have a good design, the only thing we can do is try it, and measure the results on a range of applications. A bientôt, Armin. _______________________________________________ pypy-dev mailing list pypy-dev@python.org https://mail.python.org/mailman/listinfo/pypy-dev