Paolo,
> > [=e=] to match "e" as well as accented versions like é, è and ê).
> > That is the one feature that you get with glibc, and that you would
> > sacrifice when building --with-included-regex.
>
> I agree. It's up to distros to choose, of course.
If you are on the point of sacrificing a glibc feature in many programs,
then IMO you should first talk with the glibc people to see what alternative
they can offer.
> It's strictly about gawk/grep/gnulib; no need to involve
> glibc from the beginning.
I disagree. It must involve glibc.
It is probably futile to ask Ulrich Drepper to change how [a-z] is interpreted
by default. But what would gnulib need so as to implement our "desired"
behaviour? As far as I understand, you want to keep the interpretation of
[=e=] in the POSIX + glibc way, but change the interpretation of [a-z]?
Then, what do we need from glibc?
- Do we need a RE_RANGES_IGNORE_LOCALES flag, like Arnold proposed?
- Do we need an API that allows us to access the collation elements?
(Or is strcoll and wcscoll sufficient?)
Bruno
--
In memoriam Johanna Kirchner <http://en.wikipedia.org/wiki/Johanna_Kirchner>