2011/6/28 Asmus Freytag <asm...@ix.netcom.com>:
> On 6/28/2011 1:51 AM, Michael Everson wrote:
>>
>> On 28 Jun 2011, at 09:28, Jean-François Colson wrote:
>>
>>> In Times New Roman, which is the default font for MS Word (probably the
>>> best known word processor), the letters “a” and “ɑ” are indistinguishable in
>>> italics.
>>
>> That is a fault of the font.
>
> No, the font does what it's supposed to, which is to give the correct
> rendering of the letter 'a' for use in ordinary text. The problem is in
> Unicode's unification of the generic letter a with the IPA letter 'a', which
> has a restricted glyph range, and, as we now find out, must be treated
> differently when styles are applied.
>
> Encoding a new character is not the answer. However, encoding a standardized
> variation sequence would be the proper answer. Insisting that people have
> control over the font with which text is viewed is to a degree illusory. Not
> recognizing that fact is a weakness in Unicode's 1980's based design in this
> instance.
>
> A standardized variation sequence makes the IPA nature of the IPA 'a' more
> portable, while at the same time cleanly allowing text processing software
> to treat it like the ordinary 'a', when needed, by simply ignoring the
> variation selector.

I also fully agree. This is the solution that will be the easiest to
integrate with maximum compatibility, allowing a smooth transition
immediately. Then fonts will be updated to make use of the variation
sequence, but existing applications and encoded texts won't be broken,
whatever the fonts they were built for.

Anyway, the IPA organization should really support this encoding as
its new preferred way to represent the IPA 'a' n a way that will
explictly make the visual distinction with the IPA 'ɑ' which does not
need to be modified.

One of the main reason for adding the variation sequence is that
current font technologies (initially designed to preserve the
distinction of scripts, which does not apply here as it is still
Latin) can only be tweaked based on language (and IPA notations apply
to any language), and there's no way to specify an orthographic
distinction in OpenType for example (even if BCP 47 allows such
distinctions: OpenType does not follow the newer BCP 47
specifications).

Anyway, for applications that support BCP 47 (including text renderers
that could make use of it, independantly of the font technologies
used), it should be possible for the rendering engine to convert BCP47
language tags in order to selectively convert the untagged Latin 'a'
using the variation sequence, in order to properly select the correct
glyph in OpenType fonts.

The other solution would be to standardize in OpenType a "feature" for
phonetic notation, that the renderer could also support to include the
correct GSUB element(s) needed for proper rendering of texts tagged
with BCP47 identifiers (or similar mechanisms offerered in reich text
formats, including HTML).

But there will still remain applications where complete BCP47 tags
cannot be selectively embedded in texts, notably all the many uses of
plain-text elements such as simple database fields or "flat" CSV
files. For these, it should not shoke to include a variation sequence
in such plain-texts where the distinction is needed and there's no
simple way to isolate IPA notations embedded in other plain-text.

 (Embedding Unicode-encoded language tags using the special language
tag characters is now deprecated, let's not revive those, and anyway
it would really be overkill for just tagging a single character, when
a single standard variation selector can do the trick).

Variation sequences are now heavily used and supported for sinograms.
There's no reason not support them for Latin, where they will still
have many other uses outside IPA itself (for 'a' and 'g'), e.g. in
scientific/mathematical formulas (even if there are now separate
characters for maths needing specific glyph/stylistic designs).

Thanks.
-- Philippe.


Reply via email to