----- Original Message -----
From: "Dave Thaler" <dtha...@microsoft.com>
To: "Brian E Carpenter" <brian.e.carpen...@gmail.com>
Cc: <ipv6@ietf.org>; "Bob Hinden" <bob.hin...@gmail.com>
Sent: Saturday, August 11, 2012 6:14 PM
> > -----Original Message-----
> > From: ipv6-boun...@ietf.org [mailto:ipv6-boun...@ietf.org] On Behalf
Of
> > Brian E Carpenter
> > Sent: Saturday, August 11, 2012 3:40 AM
> > To: Dave Thaler
> > Cc: Bob Hinden; ipv6@ietf.org
> > Dave,
> >
> > On 11/08/2012 03:59, Dave Thaler wrote:
> > > Brian Carpenter writes:
> > >> On 09/08/2012 22:31, Stuart Cheshire wrote:
> > >>> At the meeting in Vancouver, Dave Thaler made a point that I
found
> > >>> convincing:
> > >>>
> > >>> Where is the character set for IPv6 zone IDs specified?
> > >> RFC 4007 doesn't do so, but can be read to imply ASCII.
> > >
> > > How?  RFC 4007 says:
> > >> An implementation MAY support other kinds of non-null strings as
> > >> <zone_id>.  However, the strings must not conflict with the
delimiter
> > >> character.  The precise format and semantics of additional
strings is
> > >> implementation dependent.
> >
> > Yes, it says that, but the context to me implies ASCII. We could
argue about
> > that for a long time, so let's not bother...
> >
> > > So it's completely implementation dependent, the only restriction
> > > being that % and null are disallowed.
> > >
> > >> draft-ietf-6man-uri-zoneid-02 is explicit that it refers to the
URI
> > >> character set, which is ASCII:
> > >>
> > >>    A <zone_id> SHOULD contain only ASCII characters classified
> > >>    in RFC 3986 as "unreserved".
> >
> > The draft isn't clear enough (yet) but my idea was that this was
part of the
> > update to RFC 4007.
>
> During the Vancouver meeting, the chairs called the question of
> whether to Update 4007 or not.   The clear sense of the room was not
> to do so.

Unfortunately, the minutes seem to have no recollection of this
happening,
although as you later say, it is what happens on the list that takes
precedence.
Had I been in the room, I would not have concurred.

Tom Petch

Minutes extract

   Bob Hinden presented the draft about Representing IPv6 Zone
          Identifiers in Uniform Resource Identifiers,
          (draft-ietf-6man-uri-zoneid-02.txt)

======================================================================

          Erik Kline wanted to know if the authors have considered '@'
as a
          separator. He also wanted to know how to ensure that the zone
id was a
          valid production. Bob mentioned that it cannot be ensured.
Dave Thaler
          mentioned that in windows the zone id was always an integer
but it
          could be different for other OSs.

          Stuart Cheshire wanted this to be mentioned in the document as
a
          guideline to OS implementers. He wanted to do what was best
for the
          end users rather than what was best for the programmers. He
believed
          that unless the zone id was greater than 250 there were no
issues. Bob
          Hinden agreed with Stuart and thought that browsers could
easily
          handle the special case.

          Dave Thaler thought that the whole justification about cut and
paste
          was dubious, and he wanted the justification to be rewritten
          concerning other issues with using %. He talked about the
validation
          of the stuff inside [] performed by RFC4007 validators. He
wanted to
          make sure that he could reuse the validation code instead of
writing
          two sets of code that could introduce bugs. Juergen
Schoenwaelder did
          not see the value with using a new separator either.

          Stig Venaas wanted to mention that the % could become
ambiguous if %
          escapable characters are already present in the interface
names. Dave
          mentioned that if the text has any foreign characters cut and
paste
          would not work anyway. Mark Andrews agreed with Stig
concerning this
          being confusing to humans but not for computers.

          Stuart and Kerry thanked Bob for taking on this tough job.

          Bob believed that the authors had enough information to do
another
          revision with the comments received during the meeting.
==================================================


>               Assuming that consensus continues to hold on the list, I
believe
> this means the sentence you quote needs to be removed, and we cannot
> place any restrictions on zone_id that aren't in 4007.
>
> -Dave
>
> > >> But it allows percent encoding in a URI, which is necessary
because
> > >> of the SHOULD:
> > >>
> > >>    ZoneID = 1*( unreserved / pct-encoded )
> > >
> > > ZoneID needs to allow (including via percent-encoding) the same
> > > characters as are allowed in <zone_id> in RFC 4007.  For example
the ']'
> > > character would be legal in RFC 4007 but would have to be percent
> > > encoded in a URI.
> >
> > Yes
> >
> > >
> > >>> If we accept
> > >>> that future interface names might include non-roman characters,
then
> > >>> we have to assume that to allow safe unambiguous use in URIs,
> > >>> interface names have to undergo escaping.
> > >> If we want to internationalise the ZoneID, that would be a whole
> > >> other discussion.
> > >
> > > It's already allowed by RFC 4007 as far as I know.
> >
> > Well, again, it's a matter of interpretation; the question is simply
not
> > addressed, which is a defect in the document IMHO.
> >
> > >
> > > Stuart's email is an accurate summary of my position.
> >
> > Yes, but that doesn't help with the %251 problem, which is where we
got
> > stuck some months ago and came to the initial decision to add a new
> > delimiter. If people don't want to solve that problem, i.e. accept
that %251 in
> > a URI is %1 in ping, and that %251 in ping is %25251 in a URI, then
we're done.
> >
> > I'm here as a document editor, looking for guidance.
> >
> >      Brian
> >
> >      Brian
> >
> > >
> > > -Dave
> > >
> > >>> And if the interface name itself is going to be escaped using
URI "%xx"
> > >>> notation, then why not escape the '%' the same way?
> > >> My impression is that this WG has already objected to that, which
is
> > >> why we ended up with the current proposal. I leave the next step
to the
> > WG Chair.
> > >>
> > >>    Brian
> > >>
> > >>> This argues in support of what Microsoft already did: Encode '%'
as
> > "%25".
> > >>>
> > >>> It's not my favourite outcome, but based on Dave Thaler's
comment,
> > >>> it's the one that gets my vote.
> > >>>
> > >>> In the spirit of "be liberal with what you accept" the doc
should
> > >>> also advocate that URI parsers are forgiving about accepting
bare
> > >>> '%' signs
> > >>> -- i.e. a '%' not followed by two valid hex characters is left
> > >>> untouched. This lets a human user copy-and-paste "fe80::a%en1"
from
> > >>> a "ping" command and have it work, though the strictly correct
form
> > >>> (which URI generators should output) remains "fe80::a%25en1".
> > >>>
> > >>> Stuart Cheshire


--------------------------------------------------------------------
IETF IPv6 working group mailing list
ipv6@ietf.org
Administrative Requests: https://www.ietf.org/mailman/listinfo/ipv6
--------------------------------------------------------------------

Reply via email to