[Rswg] Re: [Ext] Last call comments on draft-rswg-rfc7997bis

Pete Resnick Sun, 02 Nov 2025 05:03:41 -0800

On 31 Oct 2025, at 7:57, Martin J. Dürst wrote:

On 2025-10-29 09:33, Paul Hoffman wrote:
On Oct 28, 2025, at 01:35, Martin J. Dürst <[email protected]>wrote:
Content, major: Section 3: "There are many Unicode characters thatobviously cannot be displayed (such as control characters), and manywhose ability to be displayed is debatable.": It's unclear what"many whose ability to be displayed is debatable." means. I'd guessit refers to scripts and characters standardized recently, for whichfont support is still thin. If that's what is meant, please say so;if something else is meant, please make clear what that is.
There is a wide variety of things that can be debatable. Arecombining characters like U+0315 (COMBINING COMMA ABOVE RIGHT)displayable? What about non-spacing marks like U+0650 (ARABIC KASRA)?I am sure people would take each side of the debate ("I can see thesymbol printed in the Unicode Standard" vs. "I can't see that codepoint on my laptop even though it has quite a complete font set" andso on).
On any decent browser, these should display without problems. When itcomes to editors, shells, and the like, the field is much wider, sothere are no absolute guarantees. But these are in Unicode sinceUnicode 1.0 or so, so I would expect these to show.

I will leave it to you and Paul to replace "debatable" with somethingclearer.

Content, major (same paragraph): "If an RFC includes such charactersin normative or descriptive text, the RFC needs to also clearlydescribe the character.": There may be cases, in particular for thecorrect display of examples including bidirectional text in plaintext, where we want to use bidi control characters but we do notwant to "describe" them (because they are not needed in HTML orPostScript).
But I'm not talking about RTL characters such as Hebrew and Arabic.I'm talking about BIDI control characters, which are invisible (exceptthat they may affect how the graphic characters close to them areordered. If we need to insert such characters, we shouldn'tnecessarily talk about these characters, but about how we expect themto reorder the rest of the text (so that readers can check whetherthey see the text in the order the author expected them to see it).

Chair hat off, a text suggestion: "If an RFC includes such characters innormative or descriptive text, the RFC needs to also clearly describethe character or, as in the case of some control characters, describethe effect of the character."

Editorial, medium: Please remove "Authors of RFCs whose namesinclude non-ASCII characters will likely have preferences for howtheir names are displayed based on their lived experiences." People,including authors, just have names.
I fully disagree that authors don't have preferences.
I'm not saying at all that authors don't have preferences. Of coursethey have. But that applies as well to authors that have names thatcan be written in all ASCII. Some want their middle name included,others not. Some want to be William, others Bill, even though theirbirth certificates probably both said William, and so on. What I'mtrying to say is that mentioning preferences here makes people withnon-ASCII/non-Latin names special when they aren't (at least not inthis respect).


Ah, good point. Noted.

In fact, at various times in the past, you have had differentpreferences about the spelling of your surname in IETF documents. :-)
Not exactly. I have had the same preferences, but in old times,technology was limited, so I had to use some fallback.
In particular, some authors with Han / Kanji names have asked thattheir names be spelled with Latin characters, other have asked fortheir names to only be spelled with Han / Kanji, and yet others wantboth (often with the Latin of their family name in all caps). Theseare preferences that I think should be acknowledged and honored whensensible, even if bugs some other people.
In general, I agree. Only using Latin should of course be possible.Only using Han/Kanji (or any other non-Latin script) I think is a bigdisservice to the reader, and I'm glad that our current document, asfar as I understand it, disallows this. As for putting the family namein all caps, I think that's a style issue that should be left to theRPC.

So you're only looking for a change to the first two sentences to saythat all authors, even those who might write their names with non-ASCIIcharacters in other circumstances, can choose to give their names inonly ASCII characters in an RFC if that is their preference, and if theychoose to use non-ASCII characters, they need to provide an ASCIIinterpretation of their name.

Content, major: "Company names and geographic names generally do notneed ASCII interpretations, but they can be included at thediscretion of the author and the RPC.": This would mean that I couldgive my affiliation as 青山学院大学 and my address as相模原、日本 or so, but it surely can't be what we want.
If that's what the author of an RFC and their stream manager wants,then it is indeed what we want. The RPC can disagree, but thatdisagreement is on a case-by-case basis, not colored by thisdocument.
Sorry, but first, I don't understand why we are making a differencebetween names (where Latin equivalents are required) and companynames,..., where Latin equivalents are voluntary. Second, I think itwould be a big disservice to the readers if affiliations and locationswould be unreadable for most of them. As an example, the currentpolicy would allow to use just 华为 or 東芝, without making clearthat the author is affiliated with Huawei or Toshiba.


I'll note that as an open issue.

Content, major: RFCs currently use last (family) name plusinitial(s) in many places, and we should change this (as a matter ofpolicy if necessary). The reason is that there are many people wherethe family name isn't very informative. This is very frequent forKoreans, Chinese, and Danish. It can also happen in other cultures.
I fully agree, but that's a topic for the Style Guide, not thisdocument. If you start a thread about this on rfc-interest@, I wouldcertainly participate.
I'm not at all convinced that the RFC will be ready to change this, asit goes back to the start of the series. If the RFC doesn't think thisneeds changing, the only way to change it is to make it an issue ofpolicy, which means that this WG is responsible. And the quickest wayto do that is to put it into the current document, which alreadycontains policy about names.


And I'll note this as an open issue as well.

pr
--
Pete Resnick https://www.episteme.net/
All connections to the world are tenuous at best

--
rswg mailing list -- [email protected]
To unsubscribe send an email to [email protected]

[Rswg] Re: [Ext] Last call comments on draft-rswg-rfc7997bis

Reply via email to