On 5/20/2015 9:57 PM, Eric Muller wrote:
On 5/20/2015 7:11 PM, Doug Ewell wrote:
In any event, URLs that point to images would be an awful basis for
an encoding.
I would make an exception for the URL
http://unicode.org/Public/8.0.0/ucd/StandardizedFlags.html.
Eric.
Currently that gives
...@unicode.org] On Behalf Of Richard
Wordingham
Sent: Wednesday, May 20, 2015 6:08 PM
To: unicode@unicode.org
Subject: Re: Tag characters
On Wed, 20 May 2015 17:15:28 -0700
Asmus Freytag (t) asmus-...@ix.netcom.com wrote:
Have there been any discussions of the flag alphabet? (Signal flags).
It seems
On 6/4/2015 17:03 , Chris wrote:
This whole discussion is about the fact that it would be technically
possible to have private character sets and private agreements that
your OS downloads without the user being aware of it.
The sticky issues are not the questions of how to make available
On 6/4/2015 1:46 AM, William_J_G Overington wrote:
I thought that I would mention it, though I cannot quite at the moment
understand the issue.
I'm long past where I'm sure I understand what the issue is.
:)
A./
On 5/28/2015 2:15 PM, Shawn Steele wrote:
I’m used to them being next to each other. So the entire discussion
seems to be about how to encode a concept vs how to get the shape you
want with existing code points. If you just want the perfect shape,
then maybe an svg is a better choice. If
that unicode has defined.
—
Chris
On Sun, May 31, 2015 at 9:06 PM, Asmus Freytag (t)
asmus-...@ix.netcom.com mailto:asmus-...@ix.netcom.com wrote:
reading this discussion, I agree with your reaductio ad absurdum
of infinitely nested HTML.
But I think you are onto something with your
John,
reading this discussion, I agree with your reaductio ad absurdum of
infinitely nested HTML.
But I think you are onto something with your hypothetical example of the
subset that works in ALL textual situations.
There's clearly a use case for something like it, and I believe many
On 7/3/2015 12:50 PM, Doug Ewell wrote:
Leo Broukhis leob at mailcom dot com wrote:
What I don't like about PRI #399 is its
proposing to use default-
ignorable characters. On a non-vexillology-aware platform, I'd
like
On 7/6/2015 9:42 AM, Steve Swales
wrote:
Or a flag inversion modifier… recently I discovered that the Philippines flag, for example, has a special meaning (we are at war) when inverted. Just a thought.
Rather than modifiers, I think a more natural
On 7/6/2015 8:18 AM, Doug Ewell wrote:
Most platforms display unknown printable characters as white
rectangles with hex digits in them.
In Doug's message, I saw a rectangle with 01F in the upper row, and
3F3 in the lower row.
This is a handy
On 5/23/2015 5:41 AM, baskar raj wrote:
Hi,
Is it possible to get a Unicode for a new symbol, designed for a
commonly used word, For Example lets say and . which can be used in
conjunction with numbers or letters. so is it possible to file
application seeking Unicode
Generally, there
[mailto:unicode-boun...@unicode.org] *On Behalf Of
*Asmus Freytag (t)
*Sent:* Wednesday, May 20, 2015 10:15 PM
*To:* Eric Muller; unicode@unicode.org
*Subject:* Re: Tag characters
On 5/20/2015 9:57 PM, Eric Muller wrote:
On 5/20/2015 7:11 PM, Doug Ewell wrote:
In any event, URLs
On 8/18/2015 6:18 AM,
alexwei...@alexweiner.com wrote:
"PUA"?
Private use area.
A./
PS: "underbar" is Swedish for "wonderful". Go figure.
Original Message
Subject: RE: APL Under-bar Characters
From:
On 8/16/2015 6:57 PM,
alexwei...@alexweiner.com wrote:
Bug APL,
After much discussion with The Unicode
Consortium Mailing List,
Can we use this to give the characters
unique names?
On 8/22/2015 9:35 AM, Julian Bradfield
wrote:
There is no inherent meaning to the
order of codepoints, it's just convenience.
And for that reason, we have property files to
explicitly give the properties rather than asking the user to "glean"
been required
to learn, and that few people actually do learn to any great extent.
Peter
*From:*Unicode [mailto:unicode-boun...@unicode.org] *On Behalf Of
*Asmus Freytag (t)
*Sent:* Monday, August 3, 2015 12:01 PM
*To:* unicode@unicode.org
*Subject:* Re: Emoji characters for food allergens
On 7/30/2015 12:07 PM, Andrew West
wrote:
On 30 July 2015 at 18:07, Marcel Schneider charupd...@orange.fr wrote:
I'll try to respond to all,
Please don't.
Andrew
What Andrew said.
A./
On 7/30/2015 10:07 AM, Marcel Schneider
wrote:
it might not be enough to avoid allergens,
we should pay attention to the presence of palm oil because of
the useless devastation of primates' habitats while enough
fallow land exists in a concerned
I'm sorry to really disagree with this little understandable criticism of laundry symbols. The most encountered of the care tags are self-explaining, as the washing and iron temperature limits or discouraging. The other symbols mainly concern dry cleaning and laundry
When it comes to symbolic and / or pictorial representations, there
are roughly three kinds. I'm going to use "symbol" here for both
symbols and emoji when the latter are used in contexts where there
also are (or could be) symbols - and I don't limit symbol to the
subset
On 8/11/2015 11:51 AM, Marcel Schneider
wrote:
All that, and some more, leads me to the
conclusion that when Windows was built, there was often not
enough time to write up the documentation;
Marcel,
please consider that this musing
On 8/8/2015 6:26 AM, Marcel Schneider
wrote:
a useless worsening of the usability and of the usefulness of a product.
Quote of the day.
A./
On 8/8/2015 1:57 PM, Marcel Schneider
wrote:
Now that I know Andrew is the PM for
MSKLC ¹,
Probably Mr Glass wasn't Mr Kaplan's boss, so he is to
overtake a legacy without having been involved in its
generating. I didn't well notice
On 8/14/2015 11:50 AM, Ken Whistler
wrote:
Garth,
The glyphs for the chess symbols in the 26XX block date from
Unicode 3.0. Most of the symbols redesigned for the Unicode 3.0
charts were done by John M. Fiscella. (See the font
On 8/14/2015 1:26 PM, Garth Wallace
wrote:
Would it be acceptable if I extracted the font from the code chart PDF
Extraction of chart font is never acceptable and violates
the Terms of Use. No exceptions.
and used it as the
On 7/21/2015 2:55 PM, Dreiheller,
Albrecht wrote:
Of course, there are
On 10/22/2015 9:54 AM, Rick McGowan wrote:
Personally, I think you're getting ahead of yourself. First, you
should demonstrate that you have done research and produced results
that at least some people find so useful and important that they are
eager to implement the findings. Then, once you
On 10/16/2015 12:18 AM, Marcel
Schneider wrote:
On Thu, 15 Oct 2015 15:46:46 -0700, Leo
Broukhis wrote:
> Along the same lines, should I be able to change
my last name
> officially to Ƃpyxᴎc? (NB all
When it comes to methods operating on
buffers there's always the tension between viewing the buffer as
text elements vs. as data elements. For some purposes, from error
detection to data cleanup you need to be able to treat the buffer
as data elements. For many
On 9/1/2015 9:37 AM, Doug Ewell wrote:
I have no idea whether my proposal is more or less serious, or more or
less likely to be adopted, than the original.
Well, you didn't consider that each style of
beer may be served in a different style glass. :)
On 9/3/2015 9:15 AM, Shawn Steele
wrote:
If
we have a bunch of ingredients emoji, then do yeast + grain
+ hops emoji combine into beer emoji?
Only if you add heat, time and water
Daniel Bünzli wrote:
Since I implement parts of the Unicode standard I'm interested in
keeping in touch with discussions about the standard and its evolution
from a technical point of view.
I'm however not interested in the encoding point of view and
On 9/6/2015 10:10 PM, Martin J. Dürst
wrote:
Hello
Ken,
You write "The bitcoin sign and baht symbol are two unrelated
symbols that have some visual similarity.", but don't really give
any supporting information for that claim.
On 9/6/2015 11:23 PM, Richard
Wordingham wrote:
On Thu, 03 Sep 2015 09:32:42 -0700
Rick McGowan wrote:
A proposed update to the LDML specification (UTS #35) will be
available for review as of Monday, September 7 at 06:00 GMT. The
Mark,
it is implied the String Range formulation is a compact form.
Can you prove that it doesn't create any set of strings that can't
be specified in other ways (other than full enumeration of the
strings?).
What about set operations on sets with
On 9/3/2015 2:09 AM, Marcel Schneider
wrote:
Asmus already told us that there'll be no
soy beans emoji, by lack of iconicity. However, could there be a
generic BEANS emoji
A coffee bean has a very recognizable shape,
and I personally would
On 8/25/2015 12:07 PM, Richard Wordingham wrote:
On Tue, 25 Aug 2015 09:54:29 +0100 (BST)
William_J_G Overington wjgo_10...@btinternet.com wrote:
Richard Wordingham wrote:
On Mon, 24 Aug 2015 11:00:32 +0100 (BST)
William_J_G Overington wjgo_10...@btinternet.com wrote:
Looking at the
On 9/10/2015 11:04 AM, Peter Constable
wrote:
I was having an offline discussion with
someone regarding certain topics that may show up on this list
on occasion, and the question came up of what evidence we
On 9/15/2015 6:45 PM, Daniel Bünzli
wrote:
Hello,
Is there any guidance on how to combine the information given by grapheme clusters and the east asian width property to do fixed-width layouts in terminal emulators ?
For example if we have:
U+AC01 ( 각 )
On 9/29/2015 8:40 PM, Sean Leonard
wrote:
I like the definition of "character" in ASCII:
3.3 Character. A member of a set of elements used for the
organization, control, or representation of data.
This, by the way, is the
On 10/3/2015 8:15 AM, Sean Leonard
wrote:
Thanks.
Well, "DIS 10646" is the Draft International Standard,
particularly Draft 1, from ~1990 or ~1991. (Sometimes it might
have been called 10646.1.) Therefore it would likely only be in
On 10/4/2015 6:02 AM, Richard
Wordingham wrote:
In the absence of a specific tailoring, is the combination of a lone
surrogate and a combining mark a user-perceived character? Does a lone
surrogate constitute a user-perceived character?
In an editing
On 10/4/2015 12:38 PM, Richard
Wordingham wrote:
On Sun, 4 Oct 2015 10:50:43 -0700
Markus Scherer wrote:
I would not spend any time specifying intricate rules for unpaired
surrogates in 16-bit strings, or out-of range values
On 10/4/2015 5:30 AM, Sean Leonard
wrote:
On
10/3/2015 12:28 PM, Asmus Freytag (t) wrote:
On 10/3/2015 8:15 AM, Sean Leonard wrote:
Thanks.
Well, "DIS 10646" is the Draft Internationa
On 10/4/2015 2:35 PM, Richard
Wordingham wrote:
However my opinion is that ᒏ�ᒺ (using U+FFFD substitution) gives 2
> grapheme clusters, I would prefer a solution that gives 3 grapheme
> clusters, as if the lone surrogate was a line-break control, so that
On 10/4/2015 4:14 PM, Richard
Wordingham wrote:
respect to what to erase or undo.
For sequences that belong to a given language, you can pick the
behavior that makes most sense in them, but for lone surrogates, by
definition you are dealing
On 10/6/2015 7:31 AM, Julian Bradfield
wrote:
All browsers I use display spaces in input boxes, and put blobs for
hidden fields. Do you have evidence for broken input fields?
Network keys. That interface seems to consistently give people a
choice
On 10/6/2015 5:24 AM, Sean Leonard
wrote:
And,
why did Unicode deem it necessary to replicate the C1 block at
0x80-0x9F, when all of the control characters (codes) were equally
reachable via ESC 4/0 - 5/15? I understand why it is desirable to
align
On 10/5/2015 7:51 AM, Philippe Verdy wrote:
Not silently ! Even if this removal is required to go on editing, this
must be notified to the user as it may occur in unedited parts of the
file (and it may be the sign that the document is not fully plain
text, so the user should not save the
On 12/9/2015 9:52 AM, Gerrit Ansmann
wrote:
After
the German spelling reform in 1996, "ß" then became a letter of
its own, and words containing the letter "ß" are no longer
equivalent to words containing an "ss" combination instead of
On 12/9/2015 3:49 PM, Hans Meiser
wrote:
Yes, they do it wrong because (1) they don't know better and (2) they let their software convert lower case text into upper case (a feature nearly every typographic software provides).
Yet, if we let the majority of
On 12/9/2015 1:11 PM, Michael Everson
wrote:
On 9 Dec 2015, at 20:57, Gerrit Ansmann wrote:
Proposal: Shouldn't the glyph be amended to match the natural language?
Nothing of this is really
Bing is pathetic. It treats the letter as if it didn't exist
Google maps it to the lowercase, neither allows you to find sites
that use just that character.
A./
On 12/9/2015 11:57 PM, "Jörg Knappen"
wrote:
Since the captial sharp s is easily available to the
public, I see it popping up everywhere in
German publications, mostly in an all caps environment. I
have a small
On 1/4/2016 10:41 AM, Michael Everson
wrote:
Certainly it does look more like a very common variant of “tau”
than “pi”
Variant of uppercase tau?
A./
On 1/4/2016 12:15 AM, "Jörg Knappen"
wrote:
Here is a report of a rather strange beast occurring in
historical math printing (work of C. F. Gauß) in thw 19th
century:
my suggestion of pi.
Raymond
From: Asmus Freytag (t)
Se
On 1/1/2016 6:00 AM, Andre Schappo
wrote:
Julian,
We have very different POVs on this topic. You raise a number of issues which would take me many many thousands of words to properly discuss. I will attempt a summary discussion of some of the issues.
① IT i18n
question is a variant of the greek letter tau (capital or
lowercase).
The identification to τ is from Asmus Freytag, not me.
Mine is a concurring opinion based on ME's suggestion, but corroborated,
in my view, by the systematic notational conventions and not merely
informed by visual sim
character sets is beyond a standard
curriculum, except perhaps History of Computing or Digital Archaeology :)
Teach them right the first time. They’ll never use a code page.
+1
A./
-Shawn
*From:*Unicode [mailto:unicode-boun...@unicode.org] *On Behalf Of
*Asmus Freytag (t)
*Sent:* January 6
ing on the bunny slope at the UTF-8 ski resort. Slap on your snowboard and practice -- get out there onto the 2-, 3- and 4-byte slopes with the experts!
--Ken
On 1/6/2016 4:09 AM, Andre Schappo wrote:
On 4 Jan 2016, at 16:59, Asmus Freytag (t) wrote:
ASCII sho
On 1/4/2016 7:49 AM, Michael Everson
wrote:
Excellent!
Looks like a candidate character for encoding. I’m sure I have some examples of good font designs for the old character in one of my books.
Admitting that a Greek letter inherently makes more
On 1/4/2016 12:06 AM, "Jörg Knappen"
wrote:
Err... in what respect would this symbol be different
from a CAPITAL GREEK LETTER GAMMA?
--Jörg Knappen
A sans-serif L would not have
On 1/4/2016 6:44 AM, Elizabeth J. Pyatt
wrote:
Like some others on the list, I believe Unicode should be mentioned at different points in a programming curriculum, particularly at the time when ASCII would be taught.
ASCII shouldn't be taught, perhaps?
On 12/28/2015 1:26 PM, Jonathan Coxhead
wrote:
On 2015-12-25 5:43am, Costello, Roger L. wrote:
Hi Folks,
Here is the upside down capital L, pointing to the left:
⅂ - TURNED SANS-SERIF CAPITAL L (U+2142)
On 12/29/2015 4:01 AM, Michael Everson wrote:
On 28 Dec 2015, at 23:48, Asmus Freytag (t) <asmus-...@ix.netcom.com> wrote:
Rather than engage in reflexive ad-hoc unification like this, it would be
useful to find out why U+2142 was disunified from TOP RIGHT CORNER and any
other symbols
On 11/27/2015 11:55 AM, Plug Gulp
wrote:
Hi,
The Unicode standard 8.0 states in chapter 23, section titled "Cursive
Connection and Ligatures"(printed page #814, PDF page #850) that:
"The zero width joiner and non-joiner characters are designed for use
in plain
On 11/27/2015 5:42 PM, Martin J. Dürst
wrote:
On
2015/11/28 04:55, Plug Gulp wrote:
The Unicode standard 8.0 states in chapter
23, section titled "Cursive
Connection and Ligatures"(printed page #814, PDF page #850)
On 11/26/2015 12:10 AM, "Jörg Knappen"
wrote:
I wonder how this concept relates to mathematical
notation, especially the root sign.
It doesn't, for several reasons.
One, because mathematical
On 11/26/2015 2:41 AM, Philippe Verdy
wrote:
However the proposal for these prepended concatenation marks
does not give any hint about how to compute the extent of the
following clusters above/over/below/around which they will apply
(do they
On 11/26/2015 3:08 AM, Philippe Verdy
wrote:
The related definition for extended grapheme
clusters says:
(
CRLF
| Prepend* (
RI-sequence | Hangul-Syllable | !Control )
On 11/26/2015 4:29 AM, Philippe Verdy wrote:
2015-11-26 12:38 GMT+01:00 Asmus Freytag (t) <asmus-...@ix.netcom.com
<mailto:asmus-...@ix.netcom.com>>:
On 11/26/2015 3:08 AM, Philippe Verdy wrote:
The related definition for extended grapheme clusters says:
( CRLF
On 6/6/2016 9:09 AM, Markus Scherer
wrote:
Interesting discussion!
ICU does not support "is" nor "in"
prefixes. I wasn't even aware that UAX #44 loose matching
prescribes "is". ICU just implements what
Just a note: for any living(!) language, it is important that
Unicode not mix scripts, but instead disunify characters based
on script. The reason for that is that in important
implementations, such as URLs (domain names) there are restrictions
that prevent
On 6/11/2016 8:25 PM, Andrew Cunningham wrote:
«If you add a [compatibility] feature to match behavior
> [found] somewhere else [not in the Unicode standard],
> it rarely pays to make that perform "better", because
> it just means it's now different and no longer matches
> [the behavior to which
On 6/10/2016 5:34 PM, Andrew Cunningham
wrote:
There is the logic of how kikakui numbers are encoded
in Unicode and there is the internal logic of the numeral system
itself. They are not necessarily the same.
This statement should be framed!
A./
On 6/23/2016 3:01 PM, Garth Wallace
wrote:
But precedent is for separate WITH LEFT HALF BLACK
and WITH RIGHT HALF BLACK geometric shapes.
Correct - best to not mix mirroring defaults
Also, I'm not sure if
On 2/8/2016 5:47 PM, Michael Everson
wrote:
It’s what I was taught as the scientific romanization for Russian and Slavic in general.
Michael Everson * http://www.evertype.com/
Source?
A./
On 2/8/2016 6:39 PM, Charlie Ruland
wrote:
Am 09.02.2016 schrieb Asmus Freytag (t):
On 2/8/2016 5:47 PM, Michael
Everson wrote:
It’s what I was taught as the scientific romanization for Russian
On 2/9/2016 1:36 PM, Ken Whistler
wrote:
On 2/9/2016 1:23 PM, David Faulks wrote:
Perhaps Unicode could create a ‘default
position’ property for combining characters, and encourage
OpenType and other font engines to adopt
On 2/9/2016 3:01 PM, Ken Whistler wrote:
Just adapt IndicPositionalCategory.txt for Unibook, and you've got
what you need.
I see. Not quite as simple; Unibook needs overrides that are
specifically able to correct bad fonts, not just "dumb" ones. We may
want to honor some part of the
On 2/11/2016 6:05 AM, QSJN 4 UKR wrote:
I can show an example of use both, prime (as soft sign) and apostroph
(hemisoft) in Cyrilic-based phonetic transcription (Orthoepic
Dictionary of Ukrainian, http://padaread.com/?book=84816=6
http://padaread.com/?book=84816=7)
On 2/7/2016 3:09 PM, Frédéric Grosshans
wrote:
In a
few words (more details below), I think this character is actually
used beyond optics should be encoded as an emoji with properties
(and aspect) similar to U+1F441 EYE, with a name like EYE SIDE
On 2/28/2016 9:04 PM, Karl Williamson
wrote:
http://abc27.com/2016/02/27/girl-12-charged-for-threatening-emojis/
"The mother says the girl shouldn’t have been charged."
In civilized countries 12-year-olds would be considered too young to
On 2/28/2016 11:14 PM, Tex Texin wrote:
However, how any of this belongs on the Unicode list is beyond me.
Surely we do not need to comment on every use of emoji that occurs in
the media.
But there you are mistaken, my dear sir!
We are constantly told that the discussions on this list
On 2/29/2016 1:55 PM, Philippe Verdy wrote:
. Well emojis were initially designed to track amotions and form a
sort of new language,
E-moji means "picture-character" in Japanese, has nothing to do (at
first) with emotions.
A./
On 2/14/2016 3:36 PM, David Faulks
wrote:
Hello,
This subject has been discussed before, but I am somehwat uncertain about something:
If the copyleft (reversed ©) symbol was proposed for encoding, with examples (from PDF files) showing it being used in a similar
On 2/14/2016 5:18 PM, Michael Everson
wrote:
On 15 Feb 2016, at 00:53, Asmus Freytag (t) <asmus-...@ix.netcom.com> wrote:
On 2/14/2016 3:36 PM, David Faulks wrote:
Hello,
This subject has been discussed before, bu
On 2/14/2016 7:42 PM, António
Martins-Tuválkin wrote:
On
2016.02.15 00:53, Asmus Freytag (t) wrote:
The key issue is whether this usage is
"established".
You can always make the case that what
On 2/15/2016 9:32 AM, Doug Ewell wrote:
Asmus Freytag wrote:
with the non-standard symbols like the copyleft, there's the desire to
not encode stuff based on "passing activism".
David Faulks wrote:
The samples I
in the discussions here.
A./
Thank You!
On Mon, Feb 15, 2016 at 2:29 PM, Asmus
Freytag (t) <asmus-...@ix.netcom.com>
wrote:
On 2/15/2016 9:32 AM, Doug
Mats,
it is really useful to document (let me amend that to "immensely
useful" to document) the required combinations (perhaps in a
Unicode Technical Note on African Writing Systems). It may even be
useful to ask for these sequences to become "named
On 2/20/2016 9:56 AM, Eli Zaretskii
wrote:
From: Philippe Verdy
Date: Sat, 20 Feb 2016 18:27:41 +0100
Cc: unicode Unicode Discussion
Unless we have case folding tailored by language, you cannot do that based on
On 2/21/2016 8:22 AM, Eli Zaretskii
wrote:
From: "Asmus Freytag (t)" <asmus-...@ix.netcom.com>
Date: Sat, 20 Feb 2016 14:10:04 -0800
What about language-independent character-folding: where in the
Unicode data
usage remains for emotions ; starting in the 1970's these were
ASCII art symbols such as the famous :-)
2016-02-29 23:24 GMT+01:00 Asmus Freytag (t) <asmus-...@ix.netcom.com>:
On 2/29/2016 1:55 PM, Philippe Verdy wrote:
. Well emojis were initially designed to track
On 3/9/2016 7:08 PM, Oren Watson wrote:
I was
surprised to find out that there are gaps in the
Mathematical alphanumeric symbols block (U+1d400 to
u+1d7ff). The gaps are associated with the inclusion of
On 3/10/2016 2:14 PM, Doug Ewell wrote:
Ken Whistler wrote:
NamesList.txt should *not* be data mined.
And yet it was the only Unicode data file utilized by MSKLC.
There are many possible reasons for this approach, which we will
probably
On 3/18/2016 11:11 AM, Garth Wallace
wrote:
The enclosure could also be something else than a circle (or arcs of
> circle): it could be a rectangle, hintable with joiners (like with circles)
> to create an enclosing square, or a rounded rectangle (hintable
On 3/12/2016 10:55 PM, Janusz S. "Bień"
wrote:
In fact, the possibility of reuse in this context probably among the
> unstated rationales for making the information and syntax available in
> the first place.
I understand there is no intention
On 3/14/2016 11:33 AM, Janusz S. Bien wrote:
Quote/Cytat - Doug Ewell (pon, 14 mar 2016, 19:22:14):
Ken Whistler wrote:
The trick is this: the status of annotational data in NamesList.txt
is different than that of normative data like the code points, names,
formal name
901 - 1000 of 1250 matches
Mail list logo