Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-30 Thread Simon Josefsson
Pete Resnick presn...@qualcomm.com writes:

 On 5/29/11 1:29 PM, Simon Josefsson wrote:
 John C Klensinjohn-i...@jck.com  writes:


 --On Sunday, May 29, 2011 08:58 +0200 Simon Josefsson
 si...@josefsson.org  wrote:

  
 in a Unicode 6.0 environment, evaluate U+19DA as PVALID and
 therefore not raise that error, then it is not compliant
 with RFC 5892, irrelevant of the Updates status of the
 present document.
  
 I don't see how.

 My code uses the tables from RFC 5892 which were generated in
 an Unicode 5.2 environment.

 Then you are, in my terminology, implementing RFC 5892 in a Unicode
 5.2 environment. Your implementation is carrying the 5.2
 environment with it.

The Unicode library used during run-time, for RFC 5891, is version 6.0
though.

 But I now think I see the source of the misunderstanding:

 You could reasonably say that your implementation is conformant
 but current only to Unicode 5.2.   If you are willing to say
 that, I guess you don't need to change anything.
  
 I claim my implementation is compliant to all requirements in RFC 5890,
 RFC 5891, RFC 5892 and RFC 5893.

 There's the problem. You can't claim that your implementation is
 compliant with the above RFCs without also mentioning the version of
 Unicode you are using, precisely because the RFCs are now Unicode
 version independent. Your implementation that evaluates U+19DA as
 PVALID is complaint with the RFCs *as applied to Unicode version
 5.2*. Your implementation that evaluates U+19DA as PVALID is *not*
 complaint with the RFCs *as applied to Unicode version 6.0*.

The correct claim would then be that I use Unicode 5.2 (for tables) and
Unicode 6.0 (for run-time).

I believe this is typical of how IDNA2008 will be deployed: the IDNA2008
implementation uses pre-computed tables for one Unicode version fixed at
compile-time, and the Unicode library on the system may be more rapidly
changing and could support a later version of Unicode.

Can you point to some (normative) requirement in IDNA2008 that forbids
this?

/Simon
___
Ietf mailing list
Ietf@ietf.org
https://www.ietf.org/mailman/listinfo/ietf


Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-29 Thread Simon Josefsson
Pete Resnick presn...@qualcomm.com writes:

 On 5/26/11 6:19 AM, Simon Josefsson wrote:
 Dear IESG,
 Is the intention that this document will update RFC 5892 or not?
 The document does not contain a Updates: header but the draft name
 suggests otherwise to me, hence my question.


 The IESG has not yet discussed this topic, and I will be bringing it
 to their attention. We may decide to include Updates: RFC 5892; we
 may not.

Thanks for clarifying.

 If the document does not update RFC 5892 (or some other document), I
 support publishing this document because it will not affect my
 implementation of RFC 5892.

 If the document updates RFC 5892, in order to remain compliant with the
 RFCs I would have to modify my implementation and make a backwards
 incompatible change.  Today U+19DA converts to xn--pkf.  With this
 document, I would have to raise an error for that input instead.

 My understanding is that the above statements are, if not false, at
 least incomplete. It is not true, at least with regard to RFC 5892,
 that today U+19DA converts to xn-pkf because RFC 5892 doesn't do any
 converting.

Sure.  RFC 5892 is part of the IDNA2008 set and primarily RFC 5891
describe the conversion.  RFC 5891 uses the properties defined in RFC
5892.

 It *is* true that the code point U+19DA is PROTOCOL VALID using the
 algorithm in RFC 5892 as applied to Unicode 5.2, and is DISALLOWED
 using the algorithm in RFC 5892 as applied to Unicode 6.0.

Right.

 If what you mean above is that your implementation intends to raise an
 error for DISALLOWED code points and would,

That is required by RFC 5891 section 4.2.2:

4.2.2.  Rejection of Characters That Are Not Permitted

   The candidate Unicode string MUST NOT contain characters that appear
   in the DISALLOWED and UNASSIGNED lists specified in the Tables
   document [RFC5892].

and section 5.4:

   Putative U-labels with any of the
   following characteristics MUST be rejected prior to DNS lookup:

   o  Labels containing prohibited code points, i.e., those that are
  assigned to the DISALLOWED category of the Tables document
  [RFC5892].

 in a Unicode 6.0 environment, evaluate U+19DA as PVALID and
 therefore not raise that error, then it is not compliant with RFC
 5892, irrelevant of the Updates status of the present document.

I don't see how.

My code uses the tables from RFC 5892 which were generated in an Unicode
5.2 environment.  My IDNA2008 code may eventually run in an Unicode 6.0
environment, or any other future version of Unicode.  I can't control
the Unicode version used, and from what I understand this is one of the
features of IDNA2008.  Implementations need not lock down the Unicode
version to a single Unicode version, as they had to do for IDNA2003.

If this model is not permitted, I believe there are bigger problems.

To avoid doubt, and to back up your assertment that my implementation is
non-compliant, please point to the MUST or SHOULD in RFC 5892 that
forbis this, to me, logical implementation approach.

/Simon
___
Ietf mailing list
Ietf@ietf.org
https://www.ietf.org/mailman/listinfo/ietf


Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-29 Thread John C Klensin


--On Sunday, May 29, 2011 08:58 +0200 Simon Josefsson
si...@josefsson.org wrote:

 in a Unicode 6.0 environment, evaluate U+19DA as PVALID and
 therefore not raise that error, then it is not compliant
 with RFC 5892, irrelevant of the Updates status of the
 present document.
 
 I don't see how.
 
 My code uses the tables from RFC 5892 which were generated in
 an Unicode 5.2 environment.  My IDNA2008 code may eventually
 run in an Unicode 6.0 environment, or any other future version
 of Unicode.  I can't control the Unicode version used, and
 from what I understand this is one of the features of
 IDNA2008.  Implementations need not lock down the Unicode
 version to a single Unicode version, as they had to do for
 IDNA2003.

It seems to me that this is exactly where we are having a
misunderstanding.   In terms of determining conformance, those
tables are not normative, so it is not possible to say I
implemented the tables in RFC 5892 and therefore I conform to
the standard.  The closest you can get would be to say I
implemented the rules and tested against the tables when those
rules were applied to Unicode 5.2 and therefore have great
confidence in my implementaton, but conformance statements stop
with implemented the rules correctly.  

For practical reasons, we expect to see production
implementations using tables or other abstractions of the rules
that are somewhat pre-compiled, not applying the rule set each
time.   One consequence of this is that a given table-based
implementation is inevitably dependent on versions of Unicode
even if the Standard (and its conformance requirements) is not.
That would be true even if the type of change (correction) that
occurred with version 6.0 of Unicode had not occurred. It would
still be necessary to construct version-dependent tables to deal
with newly-assigned code points.

From the perspective of those who argued that the document
titled ...5852bis.. should not be produced and published
because it is unnecessary, the point is that we would not have
generated the document at all had the only changes been the
addition of new PROTOCOL-VALID and DISALLOWED code points by
virtue of new code points being added to Unicode.  But, in
practical terms, that is a much greater change to an
implementation than anything related to these few characters
with changed properties.

And, again, this situation would be true of virtually any
specification that depends on Unicode, regardless of whether the
definition is in terms of  rules/properties or tables. There
would be an exception if the specification depended on code
point assignments alone and was okay with treating unassigned
code points as if they had been assigned if they turned up in
the data stream (IDNA2003 attempted to lay the foundation for
the latter but failed because all of the properties that an
unassigned code point will have when it is assigned cannot be
known).  For anything else, working properly with a given
version of Unicode requires updating of code point tables,
normalization tables, and assorted property tables.   As Mark
points out, defining things in terms of the tables, with the
rules providing only guidance, has some important advantages in
this regard.  However, it guarantees the need to talk about
conformance to a Unicode version, not just Unicode.

 If this model is not permitted, I believe there are bigger
 problems.
 
 To avoid doubt, and to back up your assertment that my
 implementation is non-compliant, please point to the MUST or
 SHOULD in RFC 5892 that forbis this, to me, logical
 implementation approach.

The key is the text in Section 4 that says:

The table in Appendix B shows, for illustrative
purposes, the consequences of the categories and
classification rules, and the resulting property values.

The list of code points that can be found in Appendix B
is non-normative.  Sections 2 and 3 are normative.

It seems to me that is very clear about the relationship between
the rules and the tables.   That relationship is reiterated in
Section 7.1.1 of RFC 5892.

You could reasonably say that your implementation is conformant
but current only to Unicode 5.2.   If you are willing to say
that, I guess you don't need to change anything.   While we
recognize that you have no control over the Unicode version in
use, good sense suggests that systems will update versions of
Unicode (including all of the associated tables and support
routines as applicable) and versions of your library together,
While that should be clear from the context of the discussions
in RFC 5891 and 5892, RFC 5894 is quite explicit about it in the
second bullet of Section 7.1.2:

 o The Unicode tables (i.e., tables of code points,
character classes, and properties) and IDNA tables
(i.e., tables of contextual rules such as those
that appear in the Tables document), must be
consistent on the systems performing or validating
labels to be 

Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-29 Thread Simon Josefsson
John C Klensin john-i...@jck.com writes:

 --On Sunday, May 29, 2011 08:58 +0200 Simon Josefsson
 si...@josefsson.org wrote:

 in a Unicode 6.0 environment, evaluate U+19DA as PVALID and
 therefore not raise that error, then it is not compliant
 with RFC 5892, irrelevant of the Updates status of the
 present document.
 
 I don't see how.
 
 My code uses the tables from RFC 5892 which were generated in
 an Unicode 5.2 environment.  My IDNA2008 code may eventually
 run in an Unicode 6.0 environment, or any other future version
 of Unicode.  I can't control the Unicode version used, and
 from what I understand this is one of the features of
 IDNA2008.  Implementations need not lock down the Unicode
 version to a single Unicode version, as they had to do for
 IDNA2003.

 It seems to me that this is exactly where we are having a
 misunderstanding.   In terms of determining conformance, those
 tables are not normative, so it is not possible to say I
 implemented the tables in RFC 5892 and therefore I conform to
 the standard.  The closest you can get would be to say I
 implemented the rules and tested against the tables when those
 rules were applied to Unicode 5.2 and therefore have great
 confidence in my implementaton, but conformance statements stop
 with implemented the rules correctly.  

 For practical reasons, we expect to see production
 implementations using tables or other abstractions of the rules
 that are somewhat pre-compiled, not applying the rule set each
 time.   One consequence of this is that a given table-based
 implementation is inevitably dependent on versions of Unicode
 even if the Standard (and its conformance requirements) is not.

Right, and that describes my implementation.  There is no difference in
behaviour of an implementation that uses the informative tables in RFC
5892 directly or one that pre-computes the table at compile time using
Unicode 5.2.  The data and output are the same in both cases.  So I
don't follow where you think the misunderstanding is?  I agree with what
you say here.

 If this model is not permitted, I believe there are bigger
 problems.
 
 To avoid doubt, and to back up your assertment that my
 implementation is non-compliant, please point to the MUST or
 SHOULD in RFC 5892 that forbis this, to me, logical
 implementation approach.

 The key is the text in Section 4 that says:

   The table in Appendix B shows, for illustrative
   purposes, the consequences of the categories and
   classification rules, and the resulting property values.
   
   The list of code points that can be found in Appendix B
   is non-normative.  Sections 2 and 3 are normative.

 It seems to me that is very clear about the relationship between
 the rules and the tables.   That relationship is reiterated in
 Section 7.1.1 of RFC 5892.

s/5892/5894/

Sure.  But that does not prove (or disprove) Pete's claim that my
implementation is non-compliant.

 You could reasonably say that your implementation is conformant
 but current only to Unicode 5.2.   If you are willing to say
 that, I guess you don't need to change anything.

I claim my implementation is compliant to all requirements in RFC 5890,
RFC 5891, RFC 5892 and RFC 5893.

 While we recognize that you have no control over the Unicode version
 in use, good sense suggests that systems will update versions of
 Unicode (including all of the associated tables and support routines
 as applicable) and versions of your library together,

That is unrealistic.  Traditional operating systems are already so
complex that upgrading them to one Unicode versions across all software
pieces (Java, Perl, SQL databases, web browsers, word processors, etc)
is economically infeasible.

Modern operating system rely so much on network services that it is not
even useful to decouple the local system from external systems.
Essentially the system is identical to the Internet.  A flag day to
upgrade to the latest Unicode version across the Internet is, despite
how infinitely pleasant that would be, impossible.

If it was possible to upgrade software components to the latest Unicode
version in a controlled way, the IDNA2003 model would have worked fine.

Fortunately, I believe IDNA2008 does not require tight Unicode version
synchronization.  In fact, I believe one of the features with IDNA2008
is exactly that it doesn't require all Unicode versions to be in sync in
all parts of the Internet.

 While that should be clear from the context of the discussions in RFC
 5891 and 5892, RFC 5894 is quite explicit about it in the second
 bullet of Section 7.1.2:

  o The Unicode tables (i.e., tables of code points,
   character classes, and properties) and IDNA tables
   (i.e., tables of contextual rules such as those
   that appear in the Tables document), must be
   consistent on the systems performing or validating
   labels to be registered.  Note that this does not
   require that tables reflect the latest version of

Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-29 Thread Pete Resnick

On 5/29/11 1:29 PM, Simon Josefsson wrote:

John C Klensinjohn-i...@jck.com  writes:

   

--On Sunday, May 29, 2011 08:58 +0200 Simon Josefsson
si...@josefsson.org  wrote:

 

in a Unicode 6.0 environment, evaluate U+19DA as PVALID and
therefore not raise that error, then it is not compliant
with RFC 5892, irrelevant of the Updates status of the
present document.
 

I don't see how.

My code uses the tables from RFC 5892 which were generated in
an Unicode 5.2 environment.


Then you are, in my terminology, implementing RFC 5892 in a Unicode 5.2 
environment. Your implementation is carrying the 5.2 environment with 
it. But I now think I see the source of the misunderstanding:



You could reasonably say that your implementation is conformant
but current only to Unicode 5.2.   If you are willing to say
that, I guess you don't need to change anything.
 

I claim my implementation is compliant to all requirements in RFC 5890,
RFC 5891, RFC 5892 and RFC 5893.
   


There's the problem. You can't claim that your implementation is 
compliant with the above RFCs without also mentioning the version of 
Unicode you are using, precisely because the RFCs are now Unicode 
version independent. Your implementation that evaluates U+19DA as PVALID 
is complaint with the RFCs *as applied to Unicode version 5.2*. Your 
implementation that evaluates U+19DA as PVALID is *not* complaint with 
the RFCs *as applied to Unicode version 6.0*.


pr

--
Pete Resnickhttp://www.qualcomm.com/~presnick/
Qualcomm Incorporated - Direct phone: (858)651-4478, Fax: (858)651-1102

___
Ietf mailing list
Ietf@ietf.org
https://www.ietf.org/mailman/listinfo/ietf


Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-28 Thread Pete Resnick

On 5/26/11 6:19 AM, Simon Josefsson wrote:

Dear IESG,
Is the intention that this document will update RFC 5892 or not?
The document does not contain a Updates: header but the draft name
suggests otherwise to me, hence my question.
   


The IESG has not yet discussed this topic, and I will be bringing it to 
their attention. We may decide to include Updates: RFC 5892; we may not.



If the document does not update RFC 5892 (or some other document), I
support publishing this document because it will not affect my
implementation of RFC 5892.

If the document updates RFC 5892, in order to remain compliant with the
RFCs I would have to modify my implementation and make a backwards
incompatible change.  Today U+19DA converts to xn--pkf.  With this
document, I would have to raise an error for that input instead.


My understanding is that the above statements are, if not false, at 
least incomplete. It is not true, at least with regard to RFC 5892, that 
today U+19DA converts to xn-pkf because RFC 5892 doesn't do any 
converting. It *is* true that the code point U+19DA is PROTOCOL VALID 
using the algorithm in RFC 5892 as applied to Unicode 5.2, and is 
DISALLOWED using the algorithm in RFC 5892 as applied to Unicode 6.0. If 
what you mean above is that your implementation intends to raise an 
error for DISALLOWED code points and would, in a Unicode 6.0 
environment, evaluate U+19DA as PVALID and therefore not raise that 
error, then it is not compliant with RFC 5892, irrelevant of the 
Updates status of the present document.


pr

--
Pete Resnickhttp://www.qualcomm.com/~presnick/
Qualcomm Incorporated - Direct phone: (858)651-4478, Fax: (858)651-1102

___
Ietf mailing list
Ietf@ietf.org
https://www.ietf.org/mailman/listinfo/ietf


Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-26 Thread Donald Eastlake
For all those people just dying to know about this character (U+19DA),
the latest Unicode code chart listing it is here
http://www.unicode.org/charts/PDF/U1980.pdf
and the name of the character is NEW TAI LUE THAM DIGIT ONE.

Thanks,
Donald
=
 Donald E. Eastlake 3rd   +1-508-333-2270 (cell)
 155 Beaver Street
 Milford, MA 01757 USA
 d3e...@gmail.com

On Thu, May 26, 2011 at 7:19 AM, Simon Josefsson si...@josefsson.org wrote:
 The IESG iesg-secret...@ietf.org writes:

 The IESG has received a request from the Applications Area Working Group
 WG (appsawg) to consider the following document:
 - 'The Unicode code points and IDNA - Unicode 6.0'
   draft-faltstrom-5892bis-04.txt as a Proposed Standard

 Dear IESG,
 Is the intention that this document will update RFC 5892 or not?
 The document does not contain a Updates: header but the draft name
 suggests otherwise to me, hence my question.

 If the document does not update RFC 5892 (or some other document), I
 support publishing this document because it will not affect my
 implementation of RFC 5892.

 If the document updates RFC 5892, in order to remain compliant with the
 RFCs I would have to modify my implementation and make a backwards
 incompatible change.  Today U+19DA converts to xn--pkf.  With this
 document, I would have to raise an error for that input instead.  I
 believe a case-by-case evaluation for each modified code-point is a good
 way to determine whether or not to add an exception in the IDNA tables.
 I haven't seen any discussion why U+19DA is so harmful that it has to be
 disallowed.  On the contrary, everyone appears to agree that the code
 point is not widely used and the implications of continue permitting it
 are minor.  Thus I would support publication of the document after
 adding U+19DA to table BackwardCompatible (G) as PVALID.

 I do realize that I may be in the rough part of the consensus here,
 which happens, but I want to provide my feedback for the record and
 allow the decision process to proceed.  At least I will be able to shift
 blame to someone else if/when my users gets confused by this. :-)

 /Simon
 ___
 Ietf mailing list
 Ietf@ietf.org
 https://www.ietf.org/mailman/listinfo/ietf

___
Ietf mailing list
Ietf@ietf.org
https://www.ietf.org/mailman/listinfo/ietf


Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-26 Thread RJ Atkinson
Earlier, John Klensin wrote:
 I favor the publication of this document
 with a minimum of further fuss.

+1

Character set issues are inherently both very complex
and also difficult to reach consensus about.

More discussion is not at all likely to reach any
different conclusion.  It is important to document
the conclusion in this case, and to document it on the
standards track.

Yours,

Ran


___
Ietf mailing list
Ietf@ietf.org
https://www.ietf.org/mailman/listinfo/ietf


Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-26 Thread Paul Hoffman
On May 26, 2011, at 4:19 AM, Simon Josefsson wrote:

 Dear IESG,
 Is the intention that this document will update RFC 5892 or not?
 The document does not contain a Updates: header but the draft name
 suggests otherwise to me, hence my question.

As document co-editor, let me say definitively: this document does *not* update 
RFC 5892. The filename is an artifact of the early consideration and, as you 
know, disappears once the document is published as an RFC. Note the information 
at https://datatracker.ietf.org/doc/draft-faltstrom-5892bis/: this is what 
the IESG is going on. There is no mention of updates anywhere there.

 If the document does not update RFC 5892 (or some other document), I
 support publishing this document because it will not affect my
 implementation of RFC 5892.

It is always nice to hear support from actual implementers. :-)

--Paul Hoffman

___
Ietf mailing list
Ietf@ietf.org
https://www.ietf.org/mailman/listinfo/ietf


Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-26 Thread Simon Josefsson
Paul Hoffman paul.hoff...@vpnc.org writes:

 On May 26, 2011, at 4:19 AM, Simon Josefsson wrote:

 Dear IESG,
 Is the intention that this document will update RFC 5892 or not?
 The document does not contain a Updates: header but the draft name
 suggests otherwise to me, hence my question.

 As document co-editor, let me say definitively: this document does
 *not* update RFC 5892. The filename is an artifact of the early
 consideration and, as you know, disappears once the document is
 published as an RFC. Note the information at
 https://datatracker.ietf.org/doc/draft-faltstrom-5892bis/: this is
 what the IESG is going on. There is no mention of updates anywhere
 there.

Thanks for sharing your view.  Earlier discussion suggested that the
decision of document status would be punted to the IESG, but it seems
they will only make an aggregate decision about the entire document.

/Simon

 If the document does not update RFC 5892 (or some other document), I
 support publishing this document because it will not affect my
 implementation of RFC 5892.

 It is always nice to hear support from actual implementers. :-)

 --Paul Hoffman
___
Ietf mailing list
Ietf@ietf.org
https://www.ietf.org/mailman/listinfo/ietf


Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-25 Thread John C Klensin
Just for the record,..

I favor the publication of this document with a minimum of
further fuss.

I read and studied this document before the first I-D was
published, participated in discussion of it on the IDNABIS WG
list and again on the APPSAWG list.  In each case, essentially
the same arguments were repeated and the same conclusion
reached.  While the consensus is definitely rough, we are
putting a tremendous amount of of review cycles and energy into
a document whose essence is we have agreed to do nothing.  We
rarely publish documentation of a decision to do nothing in the
RFC Series.  Instead, we simply do nothing and move on.

   john


_Background and Reprise (in case needed or wanted)_

The argument for doing something --that this document makes and
describes the wrong decision-- is rooted in a set of decisions
the IDNABIS WG made (also with rough, rather than perfect,
consensus) a rather long time ago.  One of the key differences
between IDNA2003 and IDNA2008 was the decision to try to be
independent of Unicode versions rather than tied to a specific
one.  That decision, in turn, required that the IDNA properties
of characters -- whether they are Protocol-VALID in IDNs,
DISALLOWED, or valid only in particular contexts -- became a
matter of rules that, in turn, utilize the values of selected
Unicode properties.  If Unicode never changed the properties for
a particular character once it was allocated (and avoided
introducing new characters that required contextual evaluation),
the rule-set would never need to be changed and we would never
have to debate, and write, documents keyed to those property
changes: new versions of Unicode would add new characters but
not change the IDNA properties of previously-assigned ones.

The world is not that perfect: Unicode does change properties of
already-assigned characters.  It is rare (how rare depends a bit
on perspective), but it does happen.  The reason is typically to
correct errors made in earlier property assignments, but other
reasons are possible (or one could have a
philosophically-interesting, but otherwise useless, debate about
how far the concept of error can extend).  IDNA2008 recognized
that such changes might occur and might be disruptive, so
provided for a special exception rule that could be used to
list characters that needed special treatment because of changes
in Unicode properties.  

The WG discussed the question of how to fill the table
associated with that rule in, a discussion that included the
realization that having the  having the table become large would
simply create a new, and hard-to-explain, Unicode-version
dependence.  One option was to populate it automatically: if
Unicode made changes, the table would be filled in to preserve
the behavior at the time the character was first allocated or
that appeared in Unicode 5.2, whichever came later, whatever
that was.  Another was to turn the decision-making over to the
Unicode consortium, which might have resulted in either that
same rule or a rule that treated changes from DISALLOWED to
PVALID differently from changes from PVALID to DISALLOWED.  The
WG rejected both of those ideas as well as the idea that some
changes were inherently more attractive than others.

Instead, the WG concluded that Unicode changes of character
properties that affected IDNA needed to be evaluated in the IETF
on a case-by-case basis as to whether they were important enough
to justify an addition to that exceptions table or whether the
balance fell on the side of keeping the table small (and IDNA
reflecting as short and obvious an exception list as possible).  

In this particular instance, rough consensus has been reached in
multiple groups that it is better to keep the rules (including
that exception table stable and compact than to change the rules
in the interest of preserving the character classification
associated with Unicode 5.2.  Had the characters involved been
less obscure, or more obviously used in domain names for other
than demonstrations and equivalent purposes, perhaps the
decision would have been different (or, if they were less
obscure, perhaps Unicode either would not have made the mistake
in the first place or would have decided to not correct it).

But wanting to consider the tradeoffs and balance between those
alternatives, probably with a slight bias toward not making
changes in the rules,a is precisely why the WG decided that
Unicode property changes needed to be considered by the IETF and
on a case-by-case basis.   

There is no perfect answer to the tradeoff involved unless
Unicode never makes an error and concludes that a correction is
needed.  Because no perfect answer exists, it is possible to
reopen the issue, bring up variations on the same old arguments,
and debate them endlessly.  In that case, we've done that at
least twice, and, depending on how one counts, probably several
more times.  We've decided.  Let's get the document published
rather than seeing how many words and 

Re: Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-25 Thread Mark Davis ☕
I also favor publication of the document with a minimum of further fuss.

[For a somewhat different reason. I do believe that
draft-faltstrom-5892bis-04.txt
makes an incorrect choice, breaking compatibility when that isn't at all
necessary. However, I don't think that any further discussion will change
the decision-maker's minds. Since further discussion appears to just be a
waste of everyone's time,  the document might as well just get issued.]

Mark

*— Il meglio è l’inimico del bene —*


On Wed, May 25, 2011 at 10:32, John C Klensin john-i...@jck.com wrote:

 Just for the record,..

 I favor the publication of this document with a minimum of
 further fuss.

 I read and studied this document before the first I-D was
 published, participated in discussion of it on the IDNABIS WG
 list and again on the APPSAWG list.  In each case, essentially
 the same arguments were repeated and the same conclusion
 reached.  While the consensus is definitely rough, we are
 putting a tremendous amount of of review cycles and energy into
 a document whose essence is we have agreed to do nothing.  We
 rarely publish documentation of a decision to do nothing in the
 RFC Series.  Instead, we simply do nothing and move on.

   john


 _Background and Reprise (in case needed or wanted)_

 The argument for doing something --that this document makes and
 describes the wrong decision-- is rooted in a set of decisions
 the IDNABIS WG made (also with rough, rather than perfect,
 consensus) a rather long time ago.  One of the key differences
 between IDNA2003 and IDNA2008 was the decision to try to be
 independent of Unicode versions rather than tied to a specific
 one.  That decision, in turn, required that the IDNA properties
 of characters -- whether they are Protocol-VALID in IDNs,
 DISALLOWED, or valid only in particular contexts -- became a
 matter of rules that, in turn, utilize the values of selected
 Unicode properties.  If Unicode never changed the properties for
 a particular character once it was allocated (and avoided
 introducing new characters that required contextual evaluation),
 the rule-set would never need to be changed and we would never
 have to debate, and write, documents keyed to those property
 changes: new versions of Unicode would add new characters but
 not change the IDNA properties of previously-assigned ones.

 The world is not that perfect: Unicode does change properties of
 already-assigned characters.  It is rare (how rare depends a bit
 on perspective), but it does happen.  The reason is typically to
 correct errors made in earlier property assignments, but other
 reasons are possible (or one could have a
 philosophically-interesting, but otherwise useless, debate about
 how far the concept of error can extend).  IDNA2008 recognized
 that such changes might occur and might be disruptive, so
 provided for a special exception rule that could be used to
 list characters that needed special treatment because of changes
 in Unicode properties.

 The WG discussed the question of how to fill the table
 associated with that rule in, a discussion that included the
 realization that having the  having the table become large would
 simply create a new, and hard-to-explain, Unicode-version
 dependence.  One option was to populate it automatically: if
 Unicode made changes, the table would be filled in to preserve
 the behavior at the time the character was first allocated or
 that appeared in Unicode 5.2, whichever came later, whatever
 that was.  Another was to turn the decision-making over to the
 Unicode consortium, which might have resulted in either that
 same rule or a rule that treated changes from DISALLOWED to
 PVALID differently from changes from PVALID to DISALLOWED.  The
 WG rejected both of those ideas as well as the idea that some
 changes were inherently more attractive than others.

 Instead, the WG concluded that Unicode changes of character
 properties that affected IDNA needed to be evaluated in the IETF
 on a case-by-case basis as to whether they were important enough
 to justify an addition to that exceptions table or whether the
 balance fell on the side of keeping the table small (and IDNA
 reflecting as short and obvious an exception list as possible).

 In this particular instance, rough consensus has been reached in
 multiple groups that it is better to keep the rules (including
 that exception table stable and compact than to change the rules
 in the interest of preserving the character classification
 associated with Unicode 5.2.  Had the characters involved been
 less obscure, or more obviously used in domain names for other
 than demonstrations and equivalent purposes, perhaps the
 decision would have been different (or, if they were less
 obscure, perhaps Unicode either would not have made the mistake
 in the first place or would have decided to not correct it).

 But wanting to consider the tradeoffs and balance between those
 alternatives, probably with a slight bias toward 

Last Call: draft-faltstrom-5892bis-04.txt (The Unicode code points and IDNA - Unicode 6.0) to Proposed Standard

2011-05-23 Thread The IESG

The IESG has received a request from the Applications Area Working Group
WG (appsawg) to consider the following document:
- 'The Unicode code points and IDNA - Unicode 6.0'
  draft-faltstrom-5892bis-04.txt as a Proposed Standard

The IESG plans to make a decision in the next few weeks, and solicits
final comments on this action. Please send substantive comments to the
i...@ietf.org mailing lists by 2011-06-06. Exceptionally, comments may be
sent to i...@ietf.org instead. In either case, please retain the
beginning of the Subject line to allow automated sorting.

Abstract


This document specifies IETF consensus for IDNA derived character
properties related to the three code points, existing in Unicode 5.2,
that changed property values when version 6.0 was released.  The
consensus is that no update is needed to RFC 5892 based on the
changes made in Unicode 6.0.



The file can be obtained via
http://datatracker.ietf.org/doc/draft-faltstrom-5892bis/

IESG discussion can be tracked via
http://datatracker.ietf.org/doc/draft-faltstrom-5892bis/


No IPR declarations have been submitted directly on this I-D.


___
IETF-Announce mailing list
IETF-Announce@ietf.org
https://www.ietf.org/mailman/listinfo/ietf-announce