Re: [OT] What is DEL for?

2001-02-28 Thread Gaute B Strokkenes

On Thu, 22 Feb 2001, [EMAIL PROTECTED] wrote:

 Otto Stolz [mailto:[EMAIL PROTECTED]] wrote:
 
 Dear Unicoders,
 
 again, I have inadvertently sent a contribution to a member rather
 than to the whole list, because the Unicode list sets the Reply-to
 header in an utmost inconvenient and unexpected manner.
 
 Here is a copy for the list. I hope I will not mistype the address.
 I really wish that I simply could use the reply-to-sender function
 of my MUA to answer to the Unicode list.

...

 Or maybe you need a mail client that allows you to apply a special
 rule to messages that come from this list such that any reply you
 send to a list message defaults to the address in the To: line
 rather than that in the From: or Reply To: line of the original
 message.

I agree with you about the specific case of the Reply-To: header, but
I think that it might be a good idea to change the list setup in other
ways.  For instance, the current setup seems to remove all
In-Reply-To: and References: headers.  This is a problem since it
breaks the ability of my email program (Gnus) to do threading, for no
particularly good reason.  In fact, I believe that all headers except
for a those in a particular list are removed.

Another annoyance is that no special headers are used to indicate that
the message is in fact from this mailing list, so that you have to use
the Sender: header etc. to do mail splitting, which is also annoying.

-- 
Gaute Strokkeneshttp://www.srcf.ucam.org/~gs234/
I'm using my X-RAY VISION to obtain a rare glimpse of the
 INNER WORKINGS of this POTATO!!



Re: [OT] What is DEL for?

2001-02-22 Thread Otto Stolz

Dear Unicoders,

again, I have inadvertently sent a contribution to a member rather than
to the whole list, because the Unicode list sets the Reply-to header in
an utmost inconvenient and unexpected manner.

Here is a copy for the list. I hope I will not mistype the address.
I really wish that I simply could use the reply-to-sender function
of my MUA to answer to the Unicode list.

Best wishes,
   Otto Stolz


--- Forwarded mail from [EMAIL PROTECTED] ("Otto Stolz")

Date: Wed, 21 Feb 2001 19:21:25 +

In a message dated 2001-02-21 07:03:46 Pacific Standard Time,
[EMAIL PROTECTED] writes:
 What is the function of ASCII control code 0x7F (DEL) in text interchange?

Am 2001-02-21 um 16:24 h hat [EMAIL PROTECTED] geschrieben:
 In MS-DOS, the TYPE command displays the glyph associated with
 U+2302 HOUSE (a.k.a. INVERTED HOME PLATE).

But this is not because it's ASCII DEL, but rather because it's
HOUSE (IBM character designator SM79), in CP 437 and CP 850.

I have donated my IBM books on character encoding to somebody who will
use them better than I could. From what I remember, the DOS codes came
in two flavours: the "Display code" which placed graphic characters in
255 of the 256 codepoints available, and the "Data code" which contained
the ASCII control characters, in their proper places (mostly). So,
CP 437 Display has the HOUSE at 0x7F, whilst CP 437 Data has the DEL-
control at 0x7F. If you send the bytes to the display (physical, or
emulated), Data code is assumed, so the HOUSE will be displayed.

I reckon, U+2032 HOUSE was included in Unicode, for this very reason.

Cf. ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/MISC/IBMGRAPH.TXT.

(The tables in ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/PC/
only account for the Data variant of the various DOS code pages.)

Best wishes,
  Otto Stolz

PS: Above, I said "mostly", because the meanings of three control
codes have been permuted relative to ASCII, in the DOS Data codes.

---End of forwarded mail from [EMAIL PROTECTED] ("Otto Stolz")



RE: [OT] What is DEL for?

2001-02-22 Thread Ayers, Mike


 From: Marco Cimarosti [mailto:[EMAIL PROTECTED]]

 This also casts some light on the fact that some fonts
 (notably JIS fonts)
 have a big black box glyphs at position 0x7F: it is probably
 for overwriting
 a character already printed on paper, so that it cannot be
 read anymore.

Probably not.  A big black box (big hollow boxes are also used for
this) is a polite way to represent a character which has no glyph.

 But a few people also raised some "contemporary" issues,
 which is what I was
 trying to obtain with my question:

 Doug Ewell wrote:
  You have probably already tested this, but in Windows [...]
  it simply displays as a box [...]
  Actually, I wasn't expecting this.  I tested a file
  containing the text
  "ABC*DEF" (where * = DEL) and expected that at least one
  process would
  display "ABDEF" where the DEL character had deleted the C.

I would not expect any system to display "ABDEF".  DEL is a bit of a
"one way" character - it triggers a deletion when sent as (interactive)
input, but should not cause the same effect when stored as data.  Note that
this is an "active" character which exists in Unicode for compatibility
issues (in this case, there's quite a lot to compat with), and does not fit
neatly into the character encoding model.  Specifically, DEL is meant to be
interpreted by a filter between sender and receiver, as are all the code
points 0x00-0x1F.  Many of these no longer require intermediary
interpretation (example: carriage return (0x0D) originally triggered
physical action on the receiving printer, but now is a logical marker).

 John Cowan wrote:
  In general it has none.  Some systems interpret it (erroneously) as
  either "cancel previous character" (usually BS) or
 "interrupt process"
  (usually ^C).

 Which systems interpret 0x7F as "interrupt process"? I know
 that this would
 be 0x03 in DOS (^C), and 0x03, 0x04 or 0x1A in Unix (^C, ^D, and ^Z,
 respectively), but I know nothing about other systems, e.g. Macintosh.

HPUX by default interprets DEL as "interrupt process", when received
from stdin.

 Valery E Ushakov (Uwe) wrote:
  On DEC (and, I belive other) terminals the -- "Rubout" key (PC
  keyboards has "BackSpace" key in this position) generates DEL.  So
  emacs, The One True Editor :-), uses ^H key (i.e.
 backspace) for help
  - which causes a lot of confusion for new users who have PC
 keyboards
  that generate backspace (^H) for -- key.

 This leads me to some more questions:

 1) What happens if emacs loads Doug Ewell's text file (I.e. a
 text file
 containing "ABCdelDEF") and then saves it? Would the file's
 content be
 changed to "ABDEF"?

No.  That DEL would be loaded as data, not sent to the input
handler, which is what interprets DEL as "delete character".

 2) Could emacs be invoked with a text file as the keyboard
 input? I mean a
 thing like:
   emacs  mycmd.txt

Probably, but it wouldn't be done like that, I think.  You could
certainly write some elisp to do this.

 3) If 2 above is possible, what would happen if mycmd.txt contains
 "ABCdelDEF^X^Wmytxt.txtnewline"? Would mytxt.txt
 contain "ABDEF"?
 (Note: I understand that ^X+^W+filename is the command to
 save a new file in
 emacs; sorry if this is wrong)

In principle, this would work.


HTH,

/|/|ike



RE: [OT] What is DEL for?

2001-02-22 Thread Marco Cimarosti

Mike Ayers wrote:
  This also casts some light on the fact that some fonts
  (notably JIS fonts)
  have a big black box glyphs at position 0x7F: [...]
 
   Probably not.  A big black box (big hollow boxes are 
 also used for
 this) is a polite way to represent a character which has no glyph.

I don't think this is the case -- at least not for MS Mincho, the only
Japanese font that I have at hand -- because the glyph for all other missing
characters is a small circle; only DEL is a black square.

_ Marco



RE: [OT] What is DEL for?

2001-02-22 Thread Jungshik Shin

On Thu, 22 Feb 2001, Marco Cimarosti wrote:

 Frank da Cruz wrote:

  DEL does indeed have a use in plain text files that are encoded with
  Shift-In / Shift-Out to switch between left and right halves of (say)
  ISO 8859-1 without having to actually put 8-bit characters in the
  file.

 This sounds quite double-byte Greek to me but, if my understanding is
 correct, it could be an interesting precedent.

 This is what I think I understood:

 - Same 8-bit character sets (e.g. Latin-1) may be encoded in 7-bit bytes.

 - The same values 0x20..0x7F are used both to represent characters
 0x20..0x7F themselves (the "left half", I guess) and characters 0xA0..0xFF
 (the "right half", I guess).

 - The Shift-In and Shift-Out control characters (0x0F and 0x0E) are inserted
 in the text to signal whether or not, from that point onwards, 0x80 has to
 be added to each byte's value.

 - In this scheme, DEL (0x7F) is used to represent both character 0x7F itself
 and 0xFF (= 0x7F + 0x80 = LATIN SMALL LETTER Y WITH DIAERESIS).

Not at the same time. 02/00 and 07/15 are used to represent U00A0 and
U00FF ONLY when the right half of ISO 8859-1 (there is the name for this,
but I forgot) is designated as G1(or G2/G3) (in advance) and (later)
invoked onto GL with SO (or LS2/LS3).  The meaning of 07/15 and 02/00
are preserved when ISO 646 is invoked on GL, I guess.


 If you ACK my understanding, the question is: how do these 0x7F bytes
 (representing 0xFF characters) interact with terminals/host communication?

I think a decent terminal (emulator) with the proper understanding of
ISO-2022 would not get confused :-)


  Ditto for "higher" levels of ISO-2022 character-set invocation (LS3, etc).

 Could I find ISO-2022 on-line (or an unofficial explanation of it)?

The answer was given yesterday :-). Pls, go to http://www.ecma.ch and
look for ECMA-35 (which is ISO-2022). Unlike ISO, ECMA has made their
standard documents available on-line for quite a while.

Jungshik Shin




RE: [OT] What is DEL for?

2001-02-22 Thread Pierpaolo BERNARDI


On Thu, 22 Feb 2001, Marco Cimarosti wrote:

 Could I find ISO-2022 on-line (or an unofficial explanation of it)?

Yes. ISO-2022 = ECMA-35

search in www.ecma.ch for ecma-35.pdf

BTW, here what it says about delete:


6.2.1 Character DELETE

Name: DELETE Acronym: DEL Coded representation: 07/15 DEL was originally
used to erase or obliterate an erroneous or unwanted character in punched
tape. DEL may be used for media-fill or time-fill. DEL characters may be
inserted into, or removed from, a CC-data-element without affecting its
information content, but such action may affect the information layout
and/or the control of equipment.


On request, ECMA also sends printed copies of their standards, and also a
cdrom containing all of their standards. All this for free.

Ciao
 Pierpaolo




Re: [OT] What is DEL for?

2001-02-21 Thread Valeriy E. Ushakov

On Wed, Feb 21, 2001 at 06:29:29 -0800, Marco Cimarosti wrote:

 What is the function of ASCII control code 0x7F (DEL) in text
 interchange?
 
 Particularly, what effect or interpretation might it have in
 communication protocols, terminal protocols and, especially, inside
 text files?
 
 My interest is about the function of this character in
 *contemporary* platforms and software, although I wouldn't dislike
 historical information, as far as it is clearly flagged as such.

AFAIK, the history is that on punched media (cards, paper tape) DEL
was used to delete a character as it was represented as holes in all
positions.

For paper tape the following demonstrates it nicely:

$ echo -ne '\177' | /usr/games/ppt
___
| .ooo|
___


On DEC (and, I belive other) terminals the -- "Rubout" key (PC
keyboards has "BackSpace" key in this position) generates DEL.  So
emacs, The One True Editor :-), uses ^H key (i.e. backspace) for help
- which causes a lot of confusion for new users who have PC keyboards
that generate backspace (^H) for -- key.

SY, Uwe
-- 
[EMAIL PROTECTED] |   Zu Grunde kommen
http://www.ptc.spbu.ru/~uwe/|   Ist zu Grunde gehen



Re: [OT] What is DEL for?

2001-02-21 Thread DougEwell2

In a message dated 2001-02-21 07:03:46 Pacific Standard Time, 
[EMAIL PROTECTED] writes:

 What is the function of ASCII control code 0x7F (DEL) in text interchange?

  Particularly, what effect or interpretation might it have in communication
  protocols, terminal protocols and, especially, inside text files?

You have probably already tested this, but in Windows text editors (even 
Word) it simply displays as a box.  In MS-DOS, the TYPE command displays the 
glyph associated with U+2302 HOUSE (a.k.a. INVERTED HOME PLATE).  I don't 
know if this would be different using the ANSI X3.64 emulator (ANSI.SYS).

Actually, I wasn't expecting this.  I tested a file containing the text 
"ABC*DEF" (where * = DEL) and expected that at least one process would 
display "ABDEF" where the DEL character had deleted the C.  I guess this 
means that on Microsoft operating systems, DEL is treated just like any other 
character (although a glyph may not be available).

-Doug Ewell
 Fullerton, California



Re: [OT] What is DEL for?

2001-02-21 Thread Frank da Cruz

  Which systems interpret 0x7F as "interrupt process"? I know that this would
  be 0x03 in DOS (^C), and 0x03, 0x04 or 0x1A in Unix (^C, ^D, and ^Z,
  respectively), but I know nothing about other systems, e.g. Macintosh.
 
 Very long ago, in the Seventh Edition of Unix, the default interrupt
 character was DEL, because it was (as Uwe writes) available on old
 terminals without the need to hold down any shift key.
 But even then most people changed it to ^C in actual practice.
 
Yes, DEL has many, many uses in the terminal-to-host direction, as do most
other control characters.  I probably use DEL about 1000 times a day.
We all should be familiar with the BS/DEL confusion, but those who aren't
might want to take a look at:

  http://www.columbia.edu/kermit/backspace.html

  1) What happens if emacs loads Doug Ewell's text file (I.e. a text file
  containing "ABCdelDEF") and then saves it? Would the file's content be
  changed to "ABDEF"?
 
 No.  As part of a text file, DEL has no known significance on any
 system.
 
You can never know what all its uses are.  If anybody hopes to be able to
recycle or abolish it, that would be a bad idea.  ASCII (ISO 646 IRV) must
remain stable and inviolable for all time.

DEL does indeed have a use in plain text files that are encoded with
Shift-In / Shift-Out to switch between left and right halves of (say)
ISO 8859-1 without having to actually put 8-bit characters in the file.
Ditto for "higher" levels of ISO-2022 character-set invocation (LS3, etc).

DEL is used by some (non-ANSI-X3.64) terminals for specific purposes,
and others for padding.

- Frank




Re: [OT] What is DEL for?

2001-02-21 Thread John Cowan

Marco Cimarosti wrote:


 Which systems interpret 0x7F as "interrupt process"? I know that this would
 be 0x03 in DOS (^C), and 0x03, 0x04 or 0x1A in Unix (^C, ^D, and ^Z,
 respectively), but I know nothing about other systems, e.g. Macintosh.

Very long ago, in the Seventh Edition of Unix, the default interrupt
character was DEL, because it was (as Uwe writes) available on old
terminals without the need to hold down any shift key.
But even then most people changed it to ^C in actual practice.

 1) What happens if emacs loads Doug Ewell's text file (I.e. a text file
 containing "ABCdelDEF") and then saves it? Would the file's content be
 changed to "ABDEF"?

No.  As part of a text file, DEL has no known significance on any
system.

-- 
There is / one art || John Cowan [EMAIL PROTECTED]
no more / no less  || http://www.reutershealth.com
to do / all things || http://www.ccil.org/~cowan
with art- / lessness   \\ -- Piet Hein




Re: [OT] What is DEL for?

2001-02-21 Thread Valeriy E. Ushakov

On Wed, Feb 21, 2001 at 09:42:53 -0800, Marco Cimarosti wrote:

 1) What happens if emacs loads Doug Ewell's text file (I.e. a text file
 containing "ABCdelDEF") and then saves it? Would the file's content be
 changed to "ABDEF"?

No.  I don't think any program interprets file contents in this way.


 2) Could emacs be invoked with a text file as the keyboard input?

No.  It needs a reall tty.


SY, Uwe
-- 
[EMAIL PROTECTED] |   Zu Grunde kommen
http://www.ptc.spbu.ru/~uwe/|   Ist zu Grunde gehen



Re: [OT] What is DEL for?

2001-02-21 Thread John Cowan

Frank da Cruz wrote:


 DEL does indeed have a use in plain text files that are encoded with
 Shift-In / Shift-Out to switch between left and right halves of (say)
 ISO 8859-1 without having to actually put 8-bit characters in the file.
 Ditto for "higher" levels of ISO-2022 character-set invocation (LS3, etc).

I did not think you could put a 96-element character set such as
8859-1-high-half (ESC 02/13 04/01) into G0, but I see by checking
ISO 2022 (ECMA-35) that you can, overriding the usual meanings of
SP and DEL.

-- 
There is / one art || John Cowan [EMAIL PROTECTED]
no more / no less  || http://www.reutershealth.com
to do / all things || http://www.ccil.org/~cowan
with art- / lessness   \\ -- Piet Hein




Re: [OT] What is DEL for?

2001-02-21 Thread John Cowan

Marco Cimarosti wrote:

 What is the function of ASCII control code 0x7F (DEL) in text interchange?
 
 Particularly, what effect or interpretation might it have in communication
 protocols, terminal protocols and, especially, inside text files?

In general it has none.  Some systems interpret it (erroneously) as
either "cancel previous character" (usually BS) or "interrupt process"
(usually ^C).

 
 My interest is about the function of this character in *contemporary*
 platforms and software, although I wouldn't dislike historical information,
 as far as it is clearly flagged as such.

Historically it was used to "rub out" existing characters on punched
paper tape.  Because it had all bits set (from a 7-bit viewpoint), it
could always be punched over any other character without the need of
changing holes to non-holes.  Therefore, punched paper tape systems
ignored it.


-- 
There is / one art || John Cowan [EMAIL PROTECTED]
no more / no less  || http://www.reutershealth.com
to do / all things || http://www.ccil.org/~cowan
with art- / lessness   \\ -- Piet Hein