Re: [Standards] UPDATED: XEP-0301 (In-Band Real Time Text) -- candidate for LAST CALL

Gunnar Hellström Sun, 22 Jul 2012 15:01:25 -0700

On 2012-07-22 03:19, Mark Rejhon wrote:

Hello XSF members (Peter and Kevin, et cetra)
I incorporated Gunnar's edit today. Do you prefer I submit an 0.5immediately before you review? The only changes I did were editorial-- word edits, phrase edits and sentence edits/moves, in approximately15 locations, no protocol changes over 0.4.

Mark, thanks for taking the trouble to transpose my edit proposals to0.4 and creating and submitting 0.5.Even if there are a few remaing items I am not in total agreement withyou about, I think these could either be accepted as you have proposedthem or be accepted as editorial changes during a last call.

See some comments on these below. ( stille referring to 0.3 sectionnumbering)

11. Section 4.1, Example 1, Line 9 , make the text part "my Ju", so that it is obvious that it is not about word by word
    transmission.
    12. Section 4.1, Example 1, Line 15 ,  make the text part "liet"
    only,   so that it is obvious that it is not about word by word
    transmission.

    13. Section 4.2.2 event='new' third line.  change "display, and
    then process" to "reception, and then process text and"    .
    Because we must not assume that all applications display the text. "
11/12. Edit Deferred -- It is merely an introductory example. Also, ifpeople chunk text instead of preserving key press intervals, thenwhole-word burst transmission is greatly preferred over broken-wordburst transmission.

But why do you want to confuse the reader with giving the impressionthat transmission is word-wise, when it is time-sampled in reality. Isuggest to accept my edit proposal in order to not cause wrongimpression what it is all about.

13. Edit Deferred -- It's a great suggestion in theory, but in allpracticality, the change is too confusing. Most implementers ofblind-assistive software will figure out "display" means "reception"or "present to the user" .... The word "display" is pretty standardin many XEP's, from a search I did. Even it's an XML element in some,i.e. XEP-0202 (A Final standard) uses a <display/> element.

I was thinking of non-displaying software as gateways,multi-party-bridges, applications etc. They never "initialize a newreal-time message for display". But I can accept your proposal. Thefinal intention is in most cases to display.

    14. Section 4.2.2 event='cancel'.  How does this behave through
    multi-user chat and multiple login situations? Is the
    event='cancel' sent through to all? I see a risk that one user
    sending event='cancel' would turn off rtt for all recipients. If
    this is true, I see three solutions:
    a) Delete event='cancel'. b) Add a sentence saying "event='cancel'
    SHALL not be used in a MUC or multi-login session.  c) Add a
    sentence saying "event='cancel' SHOULD be ignored in MUC and
    multi-login sessions.
    I have a slight preference for solution a), to delete cancel from
    the specification.

    If it is deleted, also the sections in 6.2.1 and 6.2.2 dealing
    with "cancel" shall be deleted.
14. Explanation -- Cancel is critical to the needs of severalimplementers. See Activating/Deactivating text sectionhttp://xmpp.org/extensions/xep-0301.html#activating_and_deactivating_realtime_text ....Also, cancel is 100% appropriate for multi-login session. Iclarified the MUC chapter to say that cancel should not deactivateoutgoing transmission since it would allow one participant to suppressreal-time text between other willing participants. (senders may stilluse it, in order to discard their unfinished real-time message whenlogging off, etc)However, an edit was made to the Activation/Deactivation section toclarify cancel behaviour during Multi-User Chat.
[2 sentences modified]


I need to see this before judging.

18. Consider deleting the "Forward Delete" d action element. It cannotbe used with the default value for p because that would point outsidethe real-time message. Therefore, a p must always be calculated andincluded. Then it is equal in complexity to use it as Backspace.Having both just seem to add complexity to implementations. ( It wouldhave been different and of value if it worked from a current cursorposition.) But if you have good reasons, e.g. easily matching someediting operation result, you can keep it.
18. Edit deferred -- Explanation given in long email.

Forward delete just introduces complexity. Since you do not have theconcept of "current position" in the specification, a forward delete anda backspace of anything else than the last character are equally long incoding.But, if you want to have these two codings of the same operation, I canaccept it.

    19. Section 4.5.2, third bullet point. I would like to see the
    words "Unicode Code Points" replace "Unicode Character Counting".
    Code points is the safe base that we count.
    20. Section 4.5.4.1 At the end, insert paragraph: "Characters
    consisting of multiple Unicode code points SHOULD be sent together
    in the same <t/> element. Values of /*p*/ and /*n*/ SHOULD NOT
    result in pointing within such combinations of code points."    (
    this is to avoid the situations described with the long note to
    section 4.5.4.2. The actions to avoid it should be more on the
    sender side as I propose here.
19. Edit deferred -- Explanation given in previous email. It helpsreader associate WHICH definition of "character" we are using. Eventhe RFC's say that the word has multiple interpretations, so it'sappropriate here in the title. The title is like a glossary entry, andthe contents explain we're using code points as the method of countingcharacters.

I still regard this dangerous and confusing. We are counting Unicodecode points, and that needs to be clear in all explanations.

20. Edit deferred -- I didn't like adding the paragraph either, butfollowing your suggestion will complicate implementations. If I doyour suggestion, it will no longer be easy to do "Monitoring MessageChanges Instead Of Key Presses"http://xmpp.org/extensions/xep-0301.html#sending_realtime_text becauseI would no longer be able to treat the real-time message as easily asif it was essentially "an array of code points". You are a strongadvocate of this method too, and I'm sure you agree with me you don'twant to complicate section 6.4.1

I think that typing of characters resulting in a multiple of code pointswill result in these code points being submitted to display at the sametime, and therefore easily can be put into the same <t/> element. Thisis valid for example for the combining diacritical marks 300 -36F, thatnormally are displayed together with their base character.

http://unicode.org/charts/PDF/U0300.pdf
Usually nothing is displayed on the sending side until both have been typed.

Putting both in the same t-element simplifies for both the transmitterand the receiver. The receiver does not need to handle an outstandingcombinable diacritical mark waiting for its base character.There would also be no risk that text in edits combine in an erroneousway with already existing code points, before next message arrivescontaining the correct second half of the character.

So, keeping combined characters together is a good goal andsimplification and should be adviced with a "SHOULD".

23. Section 4.5.4.2 The Note is correct, but very long. I would like tosee it shortened but have not wording proposal at the moment. It aims atavoiding situations that I suggest prevent by my proposal 20 on thesender side.


23. See my explanation in 20.  Suggestions of shortenings are welcome.

Original

Note thatElement <t/> -- Insert Text<http://xmpp.org/extensions/xep-0301.html#element_t_insert_text>isallowed to contain any subset sequence of Unicode characters from thereal-time message. This may result in certain situations where the texttransmitted in <t/> elements is allowed to be temporarily anincorrectly-formed Unicode string. (i.e. non spacing characters,orphaned diacritic, orphaned control character includingdirection-change character for bidirectional Unicode, incompletelyformed glyphs, etc.) but becomes correct when inserted into the middleof the recipient's real-time message, and passes recipientvalidation/normalization with no character modifications. Note that acompliant XML processor does not modify or fix Unicode errors caused bytaking only a subset of characters from correctly-formed Unicode text.One alternative way for implementers to visualize this, is to visualizethe Unicode text as an array of individual code points, and treatthepandnvalues accordingly.


Proposal

Note thatElement <t/> -- Insert Text<http://xmpp.org/extensions/xep-0301.html#element_t_insert_text>isallowed to contain any subset sequence of Unicode characters from thereal-time message. This may result in certain situations where the texttransmitted in <t/> elements is allowed to be temporarily anincorrectly-formed Unicode string. (i.e. non spacing characters,orphaned diacritic, orphaned control character includingdirection-change character for bidirectional Unicode, incompletelyformed glyphs, etc.) but becomes correct when inserted into the middleof the recipient's real-time message. The thepandnvalues should not beallowed to point inside characters consisting of multiple code points,and the code points of such combined characters should be sent in thesame <t/>element.

25/26/27. No longer applicable -- I already rewrote the paragraph insection 5 for 0.4

Yes, good to distinguish between service discovery, and activating support.
There is something missing in a sentence in version 0.4, chapter 5.

In order for an application to determine whether an entity supports thisprotocol, where possible it SHOULD use the dynamic, presence-basedprofile of service discovery defined in .



What was your intention after "in"?

In version 0.4, section 6.2 looks complex and need furtherrestructuring now before I can judge the final result of the protocol.

    30. Section 6.5.3.1. Please divide in paragraphs for easier reading.
    32. Section 6.5.3.2. Please divide in paragraphs for easier reading.
30. This is editorial related, so can wait for LAST CALL if there aremultiple agreements.

Yes, please do

32. This is editorial related, so can wait for LAST CALL if there aremultiple agreements.

Yes, please do

33. Edit deferred -- Although an implementation detail and there aregood reasons to almost always save it, but not 100% always.

Well, quite important to not lose messages. mention that save is thenormal, but that there may be application situations where discard is valid.



Thanks,

Gunnar

Re: [Standards] UPDATED: XEP-0301 (In-Band Real Time Text) -- candidate for LAST CALL

Reply via email to