I am not at all secure about how the standard GNU utilities will handle
non-ascii characters. For example, 'wc -c', just counts bytes. True,
the man page talks about bytes, not characters, but I am still left
uncomfortable. Then there are the dozens of bash, python, and perl
scripts that I have
n...@dad.org writes:
I am not at all secure about how the standard GNU utilities will handle
non-ascii characters. For example, 'wc -c', just counts bytes. True, the man
page talks about bytes, not characters, but I am still left uncomfortable.
GNU wc(1) has -m for that:
% echo $LANG
Hi,
Norm wrote:
I am not at all secure about how the standard GNU utilities will
handle non-ascii characters. For example, 'wc -c', just counts
bytes.
Christian has pointed out -c has remained bytes, --bytes is a synonym,
because otherwise too many things would break, and that -m has been
If you display that message with show in an UTF-8-capable terminal, what does
it display? (Please forgive the terrible formatting; I'm replying via iPad.)
~Chad
On Thu, Jul 17, 2014 at 12:23 PM, Michael Richardsonm...@sandelman.ca, wrote:
Ken Hornstein k...@pobox.com wrote:
I
In message 201407161843.s6gihmmx081...@shell0.rawbw.com, n...@dad.org writes:
Ken Hornstein k...@pobox.com writes:
I hate to be pedantic (okay, I don't really), but wouldn't the Tablets
of Stone have been written in a Hebrew script,
No. They were not. Before the advent of Mime, that came with
Also, I do not think you could accurately represent the Principia without
some of the mathematical symbols available in Unicode. I mean, it's hard to
do a\xB2 + b\xB2 = c\xB2,
So ... this brings up something where we didn't do a good job.
I sent out my original reply to Norm's email with the
ken wrote:
Also, I do not think you could accurately represent the Principia without
some of the mathematical symbols available in Unicode. I mean, it's hard
to
do a\xB2 + b\xB2 = c\xB2,
So ... this brings up something where we didn't do a good job.
I sent out my original
the copy i received was charset utf-8, encoding base64. no q-p to be
seen.
Yeah, mailman does that, for reasons I do not understand. If you look
at the raw mailbox archives on lists.gnu.org, you can see the message before
it gets processed by mailman:
ftp://lists.gnu.org/nmh-workers/2014-07
Ken Hornstein k...@pobox.com wrote:
I sent out my original reply to Norm's email with the line:
a² + b² = c²
And it rendered in mh-e, under GNU emacs as a? + b? = c?.
Which means that this is how I quoted it.
Not the same problem, but definitely part of the equation...
--
]
Ken Hornstein k...@pobox.com writes:
I'd like to understand how that happened so we can
fix it in the future (I am guessing maybe this happened in your editor).
Norm, which editor are you using to compose/reply to email? Are you
using anything like replyfilter? (You would have had to
7. Stored the result in UTF-8.
I think this the problem. What you sent out was definitely NOT UTF-8, but
ISO-8859-1. Which makes me think your editor, or maybe something before
the text got to your editor, was the problem.
UTF-8 has the virtue that it represents ASCII characters in ASCII. But
Ken Hornstein k...@pobox.com wrote:
a² + b² = c²
And it rendered in mh-e, under GNU emacs as a? + b? = c?. Which means
that this is how I quoted it.
That was a display problem on your end; the message you sent out was in
UTF-8 and the characters were correct. I
Ken Hornstein k...@pobox.com wrote:
I upgraded in theory, I should be running the latest... All I'm
saying is that there are still additional places where the encoding
can get lost/confused; it's a general Unix problem in my opinion...
I understand; I'm just trying to
n...@dad.org wrote:
Ken Hornstein k...@pobox.com writes:
I'd like to understand how that happened so we can
fix it in the future (I am guessing maybe this happened in your editor).
Norm, which editor are you using to compose/reply to email? Are you
using anything like
(Note. This Email did not require any copying or pasting)
Ken Hornstein k...@pobox.com writes:
7. Stored the result in UTF-8.
I think this the problem. What you sent out was definitely NOT UTF-8, but
ISO-8859-1. Which makes me think your editor, or maybe something before
the text got to your
1. When my editor stores characters in UTF-8, they are sometimes not the
characters I see it displaying.
That's hard for me to answer ... I guess it depends on what you see at
your end.
2. My Editor is not storing in UTF-8.
I believe this to be true. Well, you don't say exactly what happens
I take back back what I said, earlier about the default format that my editor
stored nmh bound files in. A few weeks, ago, I did change the default to
UTF-8. But as of, Dec 14 2012, for files whose names are all digits and are in
in my draft folder the default is overridden to ISO-8859-1. I don't
I take back back what I said, earlier about the default format that my editor
stored nmh bound files in. A few weeks, ago, I did change the default to
UTF-8. But as of, Dec 14 2012, for files whose names are all digits and are in
in my draft folder the default is overridden to ISO-8859-1. I don't
ASCII was good enough for my father and his father before him, and his father
before him. The tablets that Moses brought down from Mount Sinai, were in
ASCII, as were Newton's Principia, the Magna Carta, and the United States
Declaration of Independence.
MIME is a Communist, Fascist, Perverse,
n...@dad.org wrote:
ASCII was good enough for my father and his father before him, and his father
before him. The tablets that Moses brought down from Mount Sinai, were in
ASCII, ...
but imagine how much more fun they'd have been, with pictures
and an audio soundtrack. ;-)
On Wed, Jul 16, 2014 at 12:15 PM, n...@dad.org wrote:
ASCII was good enough for my father and his father before him, and his father
before him. The tablets that Moses brought down from Mount Sinai, were in
ASCII, as were Newton's Principia, the Magna Carta, and the United States
Declaration
ASCII was good enough for my father and his father before him, and his father
before him. The tablets that Moses brought down from Mount Sinai, were in
ASCII, as were Newton's Principia, the Magna Carta, and the United States
Declaration of Independence.
I hate to be pedantic (okay, I don't
Hi Earl,
I have the habit of always converting HTML emails I received into
plain text when I reply
Me too! :-)
I did not know Pythogorous wrote in English. If in ASCII, it would
be:
Ignoring the malformed octal... ;-)
$ egrep '^( +[0-9]{3})+$' ~/mail/inbox/11661 |
tr -s ' '
23 matches
Mail list logo