I know that there are some combining characters, and a lot of base
characters. But, is there any way to use a base character as a
combining character? Please help me!
- Michael Norton (a.k.a. Flarn)
E-mail address: [EMAIL PROTECTED]
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
On
Behalf Of Flarn
I know that there are some combining characters, and a lot of base
characters. But, is there any way to use a base character as a
combining character? Please help me!
Not in plain text. Can you explain precisely what you
Dear Flarn,
I'm not sure exactly what you're trying to do: your question is a bit sketchy.
Most Unicode characters are base characters, in that they do not combine with
the character preceding them in the character stream based on their character
properties. Some characters are combining
From: Jony Rosenne [EMAIL PROTECTED]
One of the problems in this context is the phrase original meaning. What
we have is a juxtaposition of two words, which is indicated by writing the
letters of one with the vowels of the other. In many cases this does not
cause much of a problem, because the
From: Addison Phillips [wM] [EMAIL PROTECTED]
For example, Dutch sometimes treats the sequence ij as a single letter
(it turns out that there are characters for the letter 'ij' in Unicode
too, but they are for compatibility with an ancient non-Unicode character
set). Software must be modified
From: Peter Kirk [EMAIL PROTECTED]
I don't want to go along with Philippe entirely on this, but surely he
must be right on this last point. Formally, Unicode is effectively the
agent of just one national body in this decision-making process.
To be honest, Peter, I never said that Unicode was a
At 04:23 PM 11/26/2004, Peter Kirk wrote:
As I understand it (and I asked for confirmation of this but have not
received it), according to the current version of UAX #14 there is no
break opportunity between SPACE and NBSP, because rule LB11b precedes rule
LB12, although there is a note Many
At 11:13 AM 11/26/2004, Philippe Verdy wrote:
Note however that the ZWJ prohibits breaking, despite in French there's a
possible hyphenation at the first occurence, where it is also a syllable
break, but not for the second occurence that occurs in the middle of the
second syllable.
None of the
Philippe Verdy scripsit:
For this reason, Dutch will need a distinct ij
letter, coded as a single character, and with its own capitalization rules
(the uppercase or titlecase form of ij will be the single letter IJ,
not two letters and not Ij; also there exists cases where diacritics can
I'm not the one that proposed encoding a AE ligature with A+ZWJ+E. I just
spoke about cases like true typographical ligatures like ffi. I do know
that AE or ae in French is better encoded with their distinct unique code,
even if French consider this letter as two letters (which may justify the
From: John Cowan [EMAIL PROTECTED]
the need to encode Dutch
ij as a single character, which is neither necessary nor practical.
(U+0132 and U+0133 are encoded for compatibility only.) In cases where
ij is a digraph in Dutch text, i+ZWNJ+j will be effective.
I suppose you wanted to speak about the
Title: RE: My Querry
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED]] On Behalf Of Mark E. Shoulson
Sent: Tuesday, November 23, 2004 8:43 PM
Why is it that even simple questions asked about
straightforward aspects of Unicode somehow mutate into
There is nothing straightforward
Can you please give me a list of all the ligatures available? Thanks!
- Michael Norton (a.k.a. Flarn)
E-mail address: [EMAIL PROTECTED]
Can you please give me a list of all the ligatures available? Thanks!
- Michael Norton (a.k.a. Flarn)
E-mail address: [EMAIL PROTECTED]
Jony Rosenne wrote:
-Original Message-
From: Doug Ewell [mailto:[EMAIL PROTECTED]
Sent: Friday, November 26, 2004 11:28 PM
To: Unicode Mailing List
Cc: Jony Rosenne; Peter Kirk
Subject: Re: No Invisible Character - NBSP at the start of a word
Jony Rosenne rosennej at qsm dot co dot il
Encoding it twice.
For example the circumflex accent is U+005E, the combining circumflex accent is
U+0302.
If you want to use a char as combining char while it yet exists in unicode only
as a base char you might want to try to propose the
combining char as a new char to be added to unicode.
Jony Rosenne wrote:
Jony, what do you think plain text is? Why should the
arrangement of text on a page as a
marginal note be considered any differently from text
anywhere else *in its encoding*? Are
you suggesting that Unicode is only relevant to ... what?
totally unformatted text in a
text
Mark E. Shoulson wrote:
Well, that's the difference under discussion. The plain text would
seem to be either the qere or the ketiv (but not the combined blended
form), since each of those is somewhat sensible.
Is there some place in the standard where it says text must be sensible?
JH
--
Tiro
.
-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] Behalf Of Flarn
Sent: 20041127 15:46
To: [EMAIL PROTECTED]
Subject: Ligatures
Can you please give me a list of all the ligatures available? Thanks!
- Michael Norton (a.k.a. Flarn)
E-mail address: [EMAIL
Hopefully not adding 37 pages...
Michael Norton (a.k.a. Flarn) flarn2003 at megapipe dot net wrote:
Can you please give me a list of all the ligatures available? Thanks!
If by available you mean separately encoded in precomposed form, you
could start by checking the online, definitive Unicode
At 01:26 PM 11/27/2004, Philippe Verdy wrote:
But it's true that the United States have delegated several times their
official international representation to the Unicode Concertium, acting on
behalf of the US government for some decisions or some limited domains
(this is valid because Unicode
At 07:44 PM 11/27/2004, Doug Ewell wrote:
The problem, as Addison pointed out, is that if you use these forms in
text, most searching and sorting operations will fail to recognize them.
That's not the only problem. In some languages other ligatures, such as
fj might be as commonly needed as fi -
At 04:58 PM 11/27/2004, John Hudson wrote:
Mark E. Shoulson wrote:
Well, that's the difference under discussion. The plain text would
seem to be either the qere or the ketiv (but not the combined blended
form), since each of those is somewhat sensible.
Is there some place in the standard where
23 matches
Mail list logo