On 2025-12-25 23:32, [email protected] via Unicode wrote:
Dnia 26 grudnia 2025 01:44 Jim DeLaHunt via Unicode
<[email protected]> napisał(a):
On 2025-12-25 09:40, [email protected]
<mailto:[email protected]> via Unicode wrote:
In L2/98-354 (https://www.unicode.org/L2/L1998/98354.pdf
<https://www.unicode.org/L2/L1998/98354.pdf>), the following
characters were proposed for DEC Technical Character Set
compatibility:
* E0AE Right ceiling corner DEC Tech 03/05
* E0AF Right floor corner DEC Tech 03/06
However, in L2/00-159
(https://www.unicode.org/L2/L2000/00159-ucsterminal.txt
<https://www.unicode.org/L2/L2000/00159-ucsterminal.txt>) which
was incorporated into Unicode 3.2, those characters were withdrawn:
E0AE Right ceiling corner U+2309 (1)
E0AF Right floor corner U+230B (1)
(1) These characters were in Unicode all along, but the shape shown in the
Unicode book was different from the shape on the terminal. However,
this is not sufficient reason to have two versions of the same symbol.
However, in DEC Technical Character Set (vt100.net)
<https://urldefense.com/v3/__https://vt100.net/charsets/technical.html__;!!BDUfV1Et5lrpZQ!Sz6zze1z-M_LTSPW0rIIzWUQNelFKnVEyxjuZ9heR5jLaL5gUpYG6I0qDcmkLRMrcElIb4gqD0QQLkFGildA6qyV$>
the
usage of those characters is explained as being parts of the
summation symbol, and the left side of those characters connect
to U+2500.
As far as I know, the floor and ceiling symbols usually have
similar height to brackets (with the horizontal stroke being on
bottom or top of the bracket height), and are used in pairs of
left and right glyphs, which is completely different from the DEC
Tech 03/05 and DEC Tech 03/06 characters. Is there any
typographical precedent of floor and ceiling symbols being used
with a centered horizontal stroke or with a horizontal connection
to other characters?
The idea of "typographical precedent of floor and ceiling symbols
being used with a centered horizontal stroke" seems to me a
non-sequitur from the content at DEC Technical Character Set
(vt100.net)
<https://urldefense.com/v3/__https://vt100.net/charsets/technical.html__;!!BDUfV1Et5lrpZQ!Sz6zze1z-M_LTSPW0rIIzWUQNelFKnVEyxjuZ9heR5jLaL5gUpYG6I0qDcmkLRMrcElIb4gqD0QQLkFGildA6qyV$>.
The latter alludes to a 2-d layout system by which parts of a
sigma sign is assembled from eight graphical components to make a
2x3 or 3x5 composite graphic. The former talks about formatting of
linear plain text with general-purpose fonts.
I can see the benefit of providing a mapping between the code
units in the DEC TCS and Unicode. I don't think that implies that
ordinary plain text and text formatting mechanisms should be able
to perform the 2-d layout described by DEC TCS. If I were
implementing DEC TCS in a modern OS and app, I would either use
graphics mechanism to draw the sigma, or an app-specific font
where the glyphs have the size and relationships which the app
requires.
Does this answer your question? Or, reject the premise helpfully?
Best regards,
—Jim DeLaHunt
The proposal L2/00-159 was intended to introduce support of the DEC
Technical Character Set and some other legacy sets. Since U+23B7
RADICAL SYMBOL BOTTOM (only attested in DEC Technical) was accepted in
Unicode 3.2, it then implies that Unicode itself is intended to
support the DEC Technical Character Set.
I think you have a meaning of "support" which is perhaps more broad than
my understanding. I am not familiar with the details of the decisions
around L2/00-159. What I take from this email thread is that the Unicode
Standard provides a bidirectional mapping between glyph codes in the DEC
TCS and Unicode scalar values. That does not imply that plain Unicode
text and general-purpose fonts and ordinary text layout engines can
reproduce the behaviour of DEC TCS systems.
In that proposal, it is implied that 03/05 and 03/06 are mapped
to U+2309 and U+230B (⌉⌋). If, as you claim, 'The idea of
"typographical precedent of floor and ceiling symbols being used with
a centered horizontal stroke" seems to me a non-sequitur', this seems
to imply that the mapping is inappropriate, which is what I'm
concerned with.
You keep making implications. How about sticking to the plain meaning of
words? To me, your reference to "typographical precedent" does not
follow from your earlier reference to the list of DEC TCS glyph codes.
In VT330/VT340 Programmer Reference Manual
(http://bitsavers.org/pdf/dec/terminal/vt340/EK-VT3XX-TP-001_VT330_VT340_Text_Programming_Mar87.pdf,
page 27; page 41 of 348 in pdf), the list of individual characters is
shown, implying that like many legacy computing platforms, DEC
Technical Character Set uses a grid of independent character cells,
where each character tile may be set to a separate character.…
Note that a "grid of character cells" is a two-dimensional layout of
glyphs. That is different from the Unicode character-glyph model and
plain text. If a system wants to reproduce VT330/VT340 behaviour, it
needs to layer a higher-level protocol on top of Unicode plain text. So,
don't expect Unicode plain text and character-glyph models to reproduce
a VT330. And, the higher-level protocol can specify use of a font which
has the glyph shapes and alignments that fit the VT330 behaviour.
There is no mention of any 'graphics mechanism to draw the sigma', so
that is off topic to use.
Using 'an app-specific font where the glyphs have the size and
relationships which the app requires' is supposed to work in theory,
but there is the problem of deciding what mapping to use.
That is the task of the higher-level protocol. I think you should not
expect the Unicode plain text model to determine that.
Best regards,
—Jim DeLaHunt
Dnia 26 grudnia 2025 02:11 asmusf--- via Unicode
<[email protected]> napisał(a):
For regular mathematics we have encoded some glyph pieces that can
be used to construct large sizes of sigma, brackets or fences.
They don't rely on a specific layout mechanism, in fact they may
not even be serialized in the text stream.
To me, it would make sense to unify these with their terminal
equivalents which would then require an emulator to serialize them
properly and to use a font where the glyph pieces line up.
The bracket portions in DEC Technical are already unified with their
STIX equivalents. The top left and bottom left portions of summation
symbol in DEC Technical are already unified with the summation top and
bottom halves. What I'm asking about is the top right and bottom right
corners, which the L2/00-159 proposal did not identify as portions of
summation symbol at all, and likely misidentified the character.
There could be an issue with that approach if the way the symbol
is broken into pieces is different enough that a unification can't
work on a logical level, even with a dedicated font.
In this case, the top right and bottom right corners of summation
symbol being different enough from ceiling and floor symbols.
If there are pieces that don't map well to existing ones it might
be sensible to encode whichever ones can't be mapped.
Any documents requesting that need fully worked out examples.
A./
--
. --Jim DeLaHunt,[email protected] http://blog.jdlh.com/ (http://jdlh.com/)
multilingual websites consultant, Vancouver, Canada