Hi Walter,

Walter Alejandro Iglesias wrote on Mon, May 29, 2017 at 06:44:40PM +0200:

> Are those wide char versions of C functions consistent enough to write
> a separate implementation to be loaded when LC_TYPE is set to utf-8?

Sure, you can rewrite the complete shell to use wchar_t * rather
than char *, and if you do that, you can use the new code to handle
ASCII as well, no need to have two copies.  But that would be a
huge effort, even more error-prone than the small, careful adjustments
we are doing now, and would have a number of additional downsides;
among others, losing the ability to handle arbitrary bytes, while
in UTF-8 mode.

For an editor, going wchar_t might be better because having substantial
amounts of UTF-8 in user input is a common case in some files that
people edit.

For a shell, editing strings that contain non-ASCII is not the main
purpose.  Sure, it is nice if the command line is able to handle
strings containing an occasional UTF-8 character.  But the main
purpose of the shell remains to safely input and execute Unix-style
command lines, where non-ASCII characters are a non-essential addition
at best.

Yours,
  Ingo


For more details, see
https://www.openbsd.org/papers/eurobsdcon2016-utf8.pdf

Reply via email to