On Mon, 21 May 2012, Linda Walsh wrote:
Greg Wooledge wrote:
On Sun, May 20, 2012 at 11:36:35AM -0700, Linda Walsh wrote:
For instance, on HP-UX 10.20, in the en_US.iso88591 locale:
A a ... B b
Meanwhile, on Debian 6.0, in the en_US.iso88591 locale:
a A ... b B
As you can see, the two en_US.iso88591 implementations are not the same.
----
Great!...
So which is correct?
Anyone wanting to reference an upper or lower case range
[a-z] or [A-Z], is gonna hurt from this.
Use the correct references: [:upper:] and [:lower:] or (as I do)
always use LC_ALL=C in your scripts.
My OS uses "en_US.UTF-8".
My OS uses whatever I tell it to (which is C).
You'd think unicode would have something to say about collation
order that wouldn't allow such randomness, but maybe not.
--
Chris F.A. Johnson, <http://cfajohnson.com/>
Author:
Pro Bash Programming: Scripting the GNU/Linux Shell (2009, Apress)
Shell Scripting Recipes: A Problem-Solution Approach (2005, Apress)