Re: [Gen-art] Gen-ART Telechat review of draft-spinosa-urn-lex-13

Alissa Cooper Wed, 19 Feb 2020 20:55:36 -0800

Paul, thanks for your review. I have cc’ed you on my DISCUSS ballot.

Best,
Alissa



> On Feb 18, 2020, at 1:29 PM, Paul Kyzivat <pkyzi...@alum.mit.edu> wrote:
> 
> I am the assigned Gen-ART reviewer for this draft. The General Area Review 
> Team (Gen-ART) reviews all IETF documents being processed by the IESG for the 
> IETF Chair. Please wait for direction from your document shepherd or AD 
> before posting a new version of the draft. For more information, please see 
> the FAQ at <http://wiki.tools.ietf.org/area/gen/trac/wiki/GenArtfaq>.
> 
> Document: draft-spinosa-urn-lex-13
> Reviewer: Paul Kyzivat
> Review Date: 2020-02-18
> IETF LC End Date: 2017-09-14
> IESG Telechat date: 2020-02-18
> 
> Summary:
> 
> This draft is on the right track but has open issues, described in the review.
> 
> General:
> 
> I did a Last Call review on version -11 three years ago. Given the amount of 
> time that has passed, and the extent of changes, I'm surprised that there 
> wasn't another Last Call prior to scheduling the Telechat!
> 
> I had difficulty classifying the severity of the issues. Individually most of 
> them may be minor, but in aggregate they imply that IMO the document still 
> requires a lot of work.
> 
> Issues:
> 
> Major: 1
> Minor: 15
> Nits:  2
> 
> 1) MAJOR: Structure & ordering of the document
> 
> In spite of my earlier experience with this document I found it very 
> difficult to read in a single front to back pass. In retrospect, I think it 
> would have made much more sense if I had read Section 7 first.
> 
> A major problem was that I got the impression it was a goal that an arbitrary 
> legal document reader should be able to construct a valid LEX URN to that 
> document from information readily available to him, typically in the body of 
> the document itself. Yet I found many places where terms must be chosen in 
> ways that are not deterministic.
> 
> In retrospect, especially after reading Section 7, I tentatively reached the 
> conclusion that my first impression was wrong. Instead, it appears that an 
> expert (probably within the Jurisdiction Registrar) must choose the LEX URN 
> representation(s) for the legal document. This can then be incorporated into 
> the document and/or some metadata about the document that is potentially 
> available to those who later have need to reference the document.
> 
> I'm not sure I'm right about this. How this is intended to work should be 
> spelled out early in the document. I suggest moving Section 7 early in the 
> document, and elaborating it to cover this.
> 
> Section 3 (Registration Template) is very hard to follow when reading the 
> document front to back. Understanding it requires information that hasn't yet 
> been provided. In particular, section 3.1 gives an overview of the syntax of 
> 'jurisdiction', but not for 'local-name'. It then goes on to present examples 
> of full lex URNs.
> 
> I suggest that it would be helpful to move sections 2 and 3 to a position 
> later in the document - somewhere after section 5. Near or in Section 11 
> (IANA considerations) would be a good place.
> 
> 2) MINOR(?): Encoding national characters and diacritic signs
> 
> Section 4.4 has guidelines for encoding national characters and diacritic 
> signs for compatibility with DNS. I'm not qualified to evaluate whether these 
> guidelines are in agreement with best practice for DNS. If it hasn't already, 
> this document should be reviewed and approved by relevant experts.
> 
> 3) MINOR: Section 5 (Specific Syntax and Features of the LEX Identifier):
> 
> It isn't clear to me what specific components of the syntax are being 
> addressed in this section. I *think* these apply to 'jurisdiction-unit'. Can 
> you please make this clear?
> 
> I'm concerned that most of the specifications here are only RECOMMENDED. Who 
> makes the decision whether to follow these rules, and how can it be assured 
> that they are followed consistently? It is a problem if a consistent approach 
> to these isn't followed. For instance, when a reader of a document is trying 
> to create a LEX URN manually from a reference in the text, how will he know 
> whether to follow the recommendation or not?
> 
> (However, if the expectation is that an expert constructs the URN then this 
> is less of a concern.)
> 
> 4) MINOR: Annexes and Sub-Annexes:
> 
> When I read Section 6.4 the first time I was confused about the significance 
> of multiple annexes in a name - whether this refers to peer annexes of the 
> main document, or to a sub-annex of an annex. I later learned (in section 7) 
> that it means sub-annexes. This is an example of why the doc needs 
> reordering. Or else spell this out in this section.
> 
> 5) MINOR: "original" as document version
> 
> Both sections 6.6 and C1.2 discuss using "original" as the document version 
> to identify the original version of the document. But this is allowed to be 
> written in different languages. However it doesn't specify how the language 
> for this is chosen. How can someone constructing the URN from a textual 
> reference decide which spelling to use? More specification is needed.
> 
> 6) MINOR: Manifestation:
> 
> The following in section 6.7:
> 
>   Note that the value "all" can be expressed by language-dependent
>   equivalents. To indicate possible features or peculiarities, each
>   main element of the manifestation MAY be followed by further
>   specifications, for example as regards <format> the version, for
>   <editor> the archive name and the electronic publisher, etc.
> 
> doesn't appear to be normative regarding what the "specification" means for 
> any particular element. Rather it merely gives "examples". There needs to be 
> normative text defining the semantics. (Section 7 clarifies things to an 
> extent, but also isn't normative.)
> 
> Also, as with "original" above, the language to be used for "all" isn't 
> specified.
> 
> Also, by using semicolon to separate the format from its specification you 
> have excluded using semicolon to denote mime type parameters, as is 
> conventionally done. (E.g., in Content-Type header fields.) Some mime types 
> have mandatory parameters, so this will exclude using those types at all. 
> Some discussion of mime type parameters is needed.
> 
> The following example:
> 
>   - XML format (version 2.2 DTD NIR) of the text of the act and PDF
>     format (version 1.7) of the "Figura 1" (figure 1) contained in the
>     body, edited by the Italian Senate:
>     "urn:lex:it:stato:legge:2000-04-03;56$text-xml;dtd-nir-
>     2.2:senato.it:testo"
> 
> introduces further confusion, by mapping "XML format (version 2.2 DTD NIR)" 
> to "text-xml;dtd-nir-2.2". I see nothing that specifies how to perform this 
> mapping. Nor how to find out what this means. It appears to be very ad hoc. 
> Also, the XML Format is normally encoded within the XML itself, so why should 
> it be encoded in the LEX URN?
> 
> Then I had trouble deciphering the following example:
> 
>   the Spanish URN of the html format of the whole Judgement of the
>   European Court of Justice n. 33/08 of 11/06/2009, in Spanish version,
>   published in the Jurifast data base in anonymized form:
>   "urn:lex:eu:tribunal.justicia:sentencia:2009-06-11;33-
>   08@original:es$   the Spanish URN of the html format of the whole Judgement 
> of the
>   European Court of Justice n. 33/08 of 11/06/2009, in Spanish version,
>   published in the Jurifast data base in anonymized form:
>   "urn:lex:eu:tribunal.justicia:sentencia:2009-06-11;33-
>   08@original:es$text-html:juradmin.eu;jurifast:todo:anonimo"
> 
> As best I understand, "todo" is the 'component' and "anonimo" is the 
> 'feature. I discovered the Spanish "todo" means "everything" in English. So 
> apparently this means the anonymized form of everything. But how would 
> someone know to use "todo" or "everything"? Is the language for this word 
> determined by language tag in "original:es"? And even if the language is well 
> defined, how is the term to use ("everything" rather than "all", etc.) 
> determined? Again this seems underspecified, and may lead to interoperability 
> problems.
> 
> (Is this covered by section 7.2 that puts this responsibility on the 
> Jurisdictional Registry? If so, it would be good to have an exhaustive list 
> of things that the Jurisdictional Registry must define.)
> 
> 7) MINOR: Partition Identifiers:
> 
> It appears that section 6.8 is attempting to invent a one-off mechanism for 
> partition identifiers that is entirely redundant to the r- and q- components 
> of a URI as specified in RFC8141. ISTM it would be better to embrace those 
> mechanisms.
> 
> 8) MINOR: Resolution Examples:
> 
> While the high level intent of section 8.1 is reasonably clear, the details 
> are not. Some concrete examples showing relevant DNS records would be 
> extremely helpful.
> 
> 9) MINOR: Catalogs for Resolution:
> 
> While a catalog db for lex URN resolution (section 8.2) could offer some 
> advantages, it will be only as good as the content that populates it. For 
> such a catalog to be well populated it will probably need to be fed the 
> results of applying a good resolver algorithm, such as described in section 
> 8.3. If you have that, then the db really only serves as a optimizer.
> 
> I think this section needs better explanation.
> 
> 10) MINOR: Normalizing the Partition ID:
> 
> Section 8.3 says to separate the Partition ID and then use it determine what 
> to return. Not mentioned is that the partition ID itself may be inaccurate, 
> so normalization of it is also likely to be required. This may even involve a 
> trial and error mechanism to try different transformations of the partition 
> ID until the one that works is found. Again it would be helpful for the 
> document to mention this.
> 
> 11) MINOR: Returning multiple manifestations:
> 
> The following in Section 8.3:
> 
>   Lacking more specific indications, the resolver SHOULD select the
>   best (most recent) version of the requested source of law, and
>   provide all the manifestations with their related items.
>   A more specific indication in the uniform name to be resolved will,
>   of course, result in a more selective retrieval, based on any
>   suggested expression and/or manifestations components (e.g. date,
>   language, format, etc.).
> 
> implies that multiple items are to be returned. The form to return these 
> isn't specified. Are these to be returned as a single document containing all 
> this information? (May or may not be possible.) Or as multiple documents? 
> (Represented how?) For instance, if I'm looking for text/plain and there are 
> multiple annexes, will they come back as a multipart/mixed, or a zip of the 
> parts, or simply a concatenation of all the pieces as a single text/plain? 
> How will it know what I'm prepared to receive? (For some retrieval protocols 
> the types I can receive are specified in the request.)
> 
> 12) MINOR: Jurisdiction-code registration
> 
> Section 11.2 doesn't address what is to happen when a domain name that has 
> been used as a jurisdiction code is assigned to a different entity. (And can 
> this also happen for ccTLDs?) The old owner of the domain name may then get a 
> different domain name. And both the old and new entities may want to assign 
> LEX URNs. This situation needs to be addressed.
> 
> And regarding the IANA Jurisdiction-code Registry, it is common practice when 
> defining a new IANA registry to supply a template for what must be supplied. 
> Perhaps the first bullet list in section 11.2 is intended to be that, but it 
> is pretty vague, especially regarding the registrant. It should be more 
> explicit about what information is to be provided and the format for how it 
> is to be provided in a new registration and how it is to be represented in 
> the IANA registry.
> 
> 13) MINOR: jurisdiction-code syntax:
> 
> I suspect the following syntax in Attachment A:
> 
>   jurisdiction-code = 2*alf-dot
> 
> is not exactly what you want, though I'm not certain exactly what you do 
> want. I suspect you are trying to forbid a code that is a single character. 
> If so, then the following might be better:
> 
>   jurisdiction-code = alfanum alf-dot
> 
> 14) MINOR: ABNF ambiguities (Attachment A):
> 
> First, the rule 'specification' is used in a variety of contexts. But the 
> semantics differ with the context. I strongly recommend making them separate 
> rule names that correspond to the semantics they denote. And then refer to 
> the corresponding names in the text.
> 
> Also, the following is ambiguous:
> 
>   ;-------------------------------------------------------------------
>   ; Structure of the <authority> element
>   ;-------------------------------------------------------------------
>   authority = issuer *("+" issuer)
>   issuer = (institution *(";" body-function)) / office
>   institution = alf-dot
>   body-function = alf-dot
>   office = alf-dot
> 
> Something matchng 'alf-dot' could be either an 'institution' or a 'office'. 
> This can present problems when using tools to generate a parser from the ABNF.
> 
> 15) MINOR: Purpose of Attachment D (Http LEX Identifier):
> 
> There are no references to Attachment D within the body of the document, and 
> it seems entirely unnecessary to the understanding of the document. And it 
> doesn't appear to be normative.
> 
> Consideration should be given to moving this to another document describing a 
> system for using and resolving LEX URNs.
> 
> If it is to be retained in this document, then there should be a reference to 
> it from within the body of the document, explaining its reason for being 
> included.
> 
> 16) NIT: Use of "However":
> 
> Section 3.2 says:
> 
>   The "lex" NID syntax conforms to [RFC8141]. However, a series of
>   characters are reserved to identify elements or sub-elements, or for
>   future extensions of the identifier.
> 
> The use of "However" here is confusing. It seems to suggest that this is an 
> exception to conformance with RFC8141. If that is not what is intended, then 
> this should be reworded. For instance:
> 
>   The "lex" NID syntax conforms to [RFC8141]. A series of
>   characters are reserved to identify elements or sub-elements, or for
>   future extensions of the identifier. Hence only a subset of the
>   syntax allowed by RFC8141 is available for definition of local-names.
> 
> 17) NIT: Use of "Recently"
> 
> There are a couple of uses of "recently" that won't (haven't) age(d) well:
> 
> In the following text from section 2:
> 
>         The use of r- and q- components, recently introduced by
>         [RFC8141], with LEX URNs is not defined in this document.
> 
> RFC8141 is now almost three years old. I don't think you can still call this 
> recent.
> 
> There is a similar issue in the following from Section D1:
> 
>   Http URIs have been recently promoted as stable and location-
>   independent identifiers [RFC3986].
> 
> This same wording was in version -11 from three years ago, so it isn't very 
> recent. And if the "recent" reference is really in RFC3986 then it is much 
> older.
> 
> In both cases I think you should find more suitable wording that recognizes 
> that these are by now well established.
> 
> (THE END)
> 
> _______________________________________________
> Gen-art mailing list
> Gen-art@ietf.org
> https://www.ietf.org/mailman/listinfo/gen-art

_______________________________________________
Gen-art mailing list
Gen-art@ietf.org
https://www.ietf.org/mailman/listinfo/gen-art

Re: [Gen-art] Gen-ART Telechat review of draft-spinosa-urn-lex-13

Reply via email to