[Gen-art] Gen-art last call review of draft-ietf-scim-core-schema-17

Elwyn Davies Fri, 17 Apr 2015 14:28:01 -0700

I am the assigned Gen-ART reviewer for this draft. For background on
Gen-ART, please see the FAQ at


<http://wiki.tools.ietf.org/area/gen/trac/wiki/GenArtfaq>.

Please resolve these comments along with any other Last Call comments
you may receive.

Document: draft-ietf-scim-core-schema-17.txt
Reviewer: Elwyn Davies
Review Date: 2015/04/09
IETF LC End Date: 2015/04/20
IESG Telechat date: (if known) -

Summary: Not ready. The 'major' issue identified is really politicalrather than strictly technical although the proposed syntax does limitthe applicability (or at least the easy applicability) of the scheme.Making the schemas more aware of practice outside the basic Englishspeaking world should be an aim of IETF work, IMO. The minor issues aremostly only just more than editorial nits - and there are quite a fewof these also.


Major issues:
===========

s4.1.1, "name" attribute: The definition of this attribute isculturally insensitive. Thecollection of name sub-attribute terms are North American/UK/Aussie/NZEnglish -speaking biased. The authors might wish to considerhttp://www.w3.org/International/questions/qa-personal-names. To alesser extent this also applies to the definition of the addressesattribute in s4.1.2. The issue of the representation of postaladdresses incorporated in I-Ds and RFCs in the xml2rfc schema has beendebated at length on the rfc-interest mailing list. The new (v3)vocabulary replaces the specific sub-attributes with an ordered list of"postalLine" elements (seehttps://tools.ietf.org/html/draft-hoffman-xml2rfc-16#section-2.39).Further, the use of country codes in RFCs has been dropped some timeago. It might be better to represent the address in a less specific wayand leave display up to user interfaces that can consider the relevantlocale. My suggestion, FWIW, would be to have a country, possibly acode field plus an ordered array of postalLines that can contain any ofthe additional components and cater for any locale specific format.


======================================================

Minor issues:
===========

Reference to SCIM Protocol document: At a bare minimum a normativereference to the SCIM protocol document (currentlydraft-ietf-scim-api-16) is needed in s1.2 where the protocol is referredto in the first two definitions. In my opinion, this document would beimproved by the addition of a brief overview of the operation of theSCIM protocol and the implications for the design of the schema. Forexample, s2 talks about 'replacement of a resource': Knowing in advancethat one of the operations anticipated in the protocol is replacementmakes this clearer.

s1.1, Use of OPTIONAL and REQUIRED: These terms are overloaded in thisdocument. The majority of uses are not specifying features of theprotocol as per RFC 2119 but indicating the necessity or otherwise ofthe presence of particular attributes in resource types. AFAICS theonly RFC 2119 usages are one place in s2.2.7 for OPTIONAL and twoadjacent places in s10.3.1 for REQUIRED . To avoid the overloading itwould be easy to omit OPTIONAL and REQUIRED from the RFC 2119 list, usethe alternative RFC 2119 terminology (MAY in s2.2.7 and MUST in s10.3.1)and provide a separate note on the usage of OPTIONAL and REQUIRED in s1.1.

s2.1, Syntax of attribute names: I am confused by the constraintssuggested here.(1) "Attribute names SHOULD be camel-cased": AFAICS this has noimpact on the specification or protocol. My guess is that thespecification has adopted the convention normally used in JavaScript.This is merely a representation of the convention used in SCIM schemasand RFC 2119 language is inappropriate. I suggest replacing this with"This document uses the camel-casing convention for attribute names(e.g., "camelCase").(2) "nameChar = "-" / "_" / DIGIT / ALPHA": Given the closeassociation with JavaScript, it seems inappropriate to allow hyphen (-)as a character in attribute names as this is illegal in JavaScript.

(3) The definition should say whether attribute names are case sensitive.

(4) Even though there is ABNF, it would be useful to note explicitlythat names are limited to a subset of ASCII rather than the much widerJSON string or JavaScript variable character sets.

s2.2.7, $ref: In s2.2.7, $ref is defined as a sub-attribute name butdoes not match the attribute name syntax discussed in the previouscomment for s2.1. Does the attribute name syntax apply to sub-attributes? Or are they just JSON member names?

s2.3, next to last para: To ensure that the service provider knows whatit ought to do to canonicalize a given value, the schema specificationneeds to specify what canonicalization means for each type ofattribute. Having read further on, I see that this is done in mostcases for relevant attributes defined in this draft. A note that thisshould be done generally when defining new schemas is needed here. Thisis particularly important for strings that might haveinternationalization issues (c.f., the discussion of string comparisonin filtering in section 5 of draft-ietf-scim-api-16.)


s7, canonicalValues:  The wording here

         When
         applicable service providers MUST specify the canonical types
         specified in the core schema specification; e.g., "work",
         "home".

seems to imply that the possible canonicalValues mentioned in thedefinitions of User, Group etc. schemas earlier in the draft areactually normative minimum requirements that could, at least in somecases, be extended. The wording used in the earlier sections is ratherless definitive and appears to indicate that the suggested values areexamples that a service provider might possible want to replace if theyconsidered alternative values better suited to their application, e.g.

   userType
      Used to identify the organization to user relationship. Typical
      values used might be "Contractor", "Employee", "Intern", "Temp",
      "External", and "Unknown" but any value may be used.

and

   phoneNumbers
      Phone numbers for the user.   ...  The "display" sub-attribute
      MAY be used to return the canonicalized representation of the
      phone number value.  The sub-attribute "type" often has typical
      values of "work", "home", "mobile", "fax", "pager", and "other",
      and MAY allow more types to be defined by the SCIM clients.

The wording used in the earlier sections seems to need 'tightening up'to make it clear what minimum set of canonicalValues is required forconformance, if indeed that is what is wanted.

s7, caseExact: I think you may need to clarify what case insensitivitymeans for languages other than unaccented English. It may be sufficientto provide a note and a pointer to the discussion of filtering andnormalization in the protocol draft.

s10.3: The registration procedure seems overly complex. If, as stated,an RFC is required in all cases, then the standard (RFC 7035) IETFReview registration policy would seem to fill the bill and there is noneed for a designated expert. Alternatively, Specification Required(with a designated expert as is standard for this case) could be used ifother types of specification could be countenanced. I suspect therequirement for a standards track RFC as a way of modifying an existingvalue is going to come back to bite us if the original specification wasnot standards track. I am not sure this attempt to provide a higherhurdle for modifications is the best way to go about this - In general,IETF Review would, I think, give enough pushback against inappropriateupdates without requiring standards track in all cases. Overall, Irecommend that the authors consult your AD and IANA to determine howbest to structure the registration procedure.


===========================================================

Nits/editorial comments:
=====================
Global: s/e.g. /e.g., /

The term 'endpoint': The term '(network) endpoint' has a particulartechnical meaning in W3C/HTTP jargon although it the usage in (e.g.)http://www.w3.org/TR/wsdl.html seems rather self-referential. It wouldbe useful to provide a definition. Perhaps something like:(Network) endpoint: Also known as a 'port' (seehttp://www.w3.org/TR/wsdl.html). A port has a 'port type' thatidentifies a set of operations invoked by HTTP methods. Each port isidentified by a URI typically constructed from the base URI identifyingthe server implementing the operation and a relative URI bound to theport type. The methods are associated with abstract data types, such asthe schema specified in this document. HTTP messages carry datastructured according to the abstract data types.

Canonicalized URLs: Presumably URLs should be canonicalized in linewith Section 6 of RFC 3986. An appropriate global place to say thiswould be s2.3 I believe. However RFC 6986 offers a 'ladder' ofcanonicalizations and it would be desirable to say what rung on thisladder should be used. Presumably either 6.2.3 or 6.2.4.

s1, para 1, last sentence: The phrase 'redundantly integrated' is notfelicitous. Suggestion:


OLD:
   Similarly, cloud services
   providers seeking to inter-operate with multiple application
   marketplaces or cloud identity providers must be redundantly
   integrated.
NEW:
   Similarly, cloud services
   providers seeking to inter-operate with multiple application
   marketplaces or cloud identity providers would require pairwise
   integration.
END

s1, para 2: Worth adding a reference to [PortableContacts] since youhave it already and its not a 'well-known' item.I fear that LDAP is not a well-known abbreviation within the meaning ofthe act, and needs expanding.

Maybe add a ref to RFC 6350 for vCards.

s2, para 1, 1st sentence: s/contents of which/allowable contents of which/
s2, para 1, 4th sentence: s/alidation/Validation/

s2, para 1, last sentence: s/the attributed defined schema/itscharacteristics as defined in the relevant schema/


s2, para 2: s/extend schema/ extend a schema/ [or "extend schemas"]

s2.1, para 1: s/For each attribute, SCIM schema/For each attribute, aSCIM schema/

s2.2: The list of characteristics and their default values is notassociated with the data type of the attribute but is another set ofattributes of each attribute defined. This would be clearer if the listof defaults and examples was separated out into a new section (probablyafter s2.2). It would be helpful to point out explicitly that thesedefaults apply to all the attributes defined in the draft - I found thetacit assumption of default characteristics in later definitions ofattributes had me asking myself whether certain characteristics ought tohave been defined whereas they were actually covered by the defaults.


s2.2, 1st bullet: For consistency, s/required/REQUIRED/

s2.2, bullet 5:
OLD:
o  have no canonical values (e.g. type is "home" or "work"),
NEW:

o have no canonical values (for example, the "type" sub-attribute inSection 2.3),

END

s2.2.6, Base 64 URL encoding: Presumably the trailing paddingcharacters can be omitted here - this should be mentioned whether or notthey are needed.

s2.2.8: Presumably, in line with s2.3 and the JSON specification, theorder of component attributes is not significant. If this is so, itshould be mentioned here: Perhaps add:

    The order of the component attributes is not significant. Servers and
    clients MUST NOT require or expect attributes to be in
    any specific order when an object is either generated or analyzed.

s2.3, 1st para:  I found this difficult to parse.  Suggest:
OLD:
   Multi-valued attributes contain a list of value or may contain sub-
   attributes and MAY also be considered complex attributes.  The order
   of values returned by the server SHOULD NOT be guaranteed.  The sub-
   attributes below are considered normative and when specified SHOULD
   be used as defined.
NEW:

Multi-valued attributes contain a list of elements, using the JSONarray format

    defined in Section 5 of [RFC7159].  Elements can be either
    o   primitive values, or

o objects with a set of sub-attributes and values, using the JSONobject formatdefined in Section 4 of [RFC7159], in which case they MAY alsobe consideredto be complex attributes. As with complex attributes, theorder of sub-attributesis not significant. The pre-defined sub-attributes listed inthis section can beused with multi-valued attribute objects but thesesub-attributes should only be used

       with the meanings as defined here.

s2.3: Question: Can sub-attributes have sub-sub-attributes? I don'tthink I see any examples and maybe the definition in s1.2 effectivelyexcludes them. Might be worth being explicit.

s2.3, "primary" sub-attribute: Should this be specified as assumed to be"false" if not present in a relevant object? I don't think this iscovered by the defaults anywhere.

s2.3, $ref: I guess this ought always to be canonicalized - this can benoted in the following paragraph where canonicalization is discussed.This would be a good place to specify a reference for URLcanonicalization as mentioned above.

s2.3, last para: Suggest being a little more explicit about the scope ofthis paragraph. I suggest:

OLD:
   Service providers MAY return the same value more than once with
   different types (e.g. the same e-mail address may used for work and
   home), but SHOULD NOT return the same (type, value) combination more
   than once per Attribute, as this complicates processing by the
   Consumer.
NEW:

Service providers MAY return element objects with the same "value"sub-attributemore than once with a different "type" sub-attribute (e.g., thesame e-mail addressmay used for work and home), but SHOULD NOT return the same (type,value)combination more than once per Attribute, as this complicatesprocessing by the

   consumer.
END

Note "Consumer" replaced by "consumer" - there is no definition of aspecific meaning for this term.

s3, Resource Type: s/("meta.resourceType")/("meta.resourceType", seeSection 3.1)/

s3, Schemas Attribute: I think s/the namespace of SCIM schema thatdefines/the namespaces of the SCIM schemas that define/; s/Allrepresentations of SCIM schema MUST include a non-zero value array/Allrepresentations of SCIM schemas MUST include a non-empty array/

s3, name used in example: I don't know if the RFC Editor has a policyon suitable fictitious names equivalent to example.com for domains.Apparently Jane Roe and Mary Major have been used in US legal practiceas female alternatives to the ubiquitous Mr John Doe. Probably good tocheck with the RFC Editor.

s3.1, id, externalId, meta.version, meta.resourceType: I suspect theseought to be caseExact?

s3.1, externalId: The concepts of "provisioning domain" and a "client'stenant" need to be defined. The externalId attribute is not explicitlydefined as REQUIRED or OPTIONAL.

s3.1.1, meta.resource: I got the impression from s3 thatmeta.resourceType was REQUIRED rather than being optional as noted inthe first para of s3.1.1.

s3.1.1, meta.location: Should the value of this sub-attribute be thesame as Content-Location rather than Location? Is it intended that therequest should be redirected (or that the resource was newly created?If not it seems Content-Location would be more appropriate. A normativereference to the relevant HTTP RFC (probably RFC 7231) ought to be included.

s3.1.1, meta.version: Would one expect a weak or strong ETag? Anormative reference to the relevant HTTP RFC (probably RFC 7232) oughtto be included.


s3.2, last sentence: s/Section 6and/Section 6 and/ (missing space).

s3.3, 1st para:    s/used in LDAP/are used in LDAP/;

s/Each "schemas" value indicates additiveschema/Each value in the "schemas" attribute indicates an additive schema/;s/See Figure 5 for an example JSONrepresentation/See Figure 5 for an example of the JSON representation/


s3.3, para 2: s/"schemas" URI value/URI value in the "schemas" attribute/

s4.1.1, userName: Having said that each User MUST have include anon-empty userName value, why is this attribute RECOMMENDED rather thanREQUIRED? I guess it ought to be caseExact also.


s4.1.1, profileUrl: Needs a canonicalization mechanism specified.

s4.1.1, preferredLanguage: There is potentially more than one preferredlanguage (as per Accept-Languages) so this presumably this ought to be aMulti-valued attribute. The Accept-Language header syntax also has anoptional, per language, weight to assist with selection. Should this becatered for here as well? This would presumably mean that it shouldhave sub-attributes (e.g.) using "value" for the name and "weight" orsome such. Also s/localized User interface/localized user interface/

s4.1.1, password: I *hope* there is a discussion of the securityimplications of this field later. A pointer to this discussion would behighly desirable.

s4.1.2, photos: A reference to the canonicalization mechanism is needed(see previous comment).

s4.1.2, entitlements, roles: There doesn't seem to be any good reasonfor capitalizing 'NO' here: s/NO/no/, 2 places.

s4.2, para 2: s/by the service provider are considered/by the serviceprovider, and are considered/

s4.3, employeeNumber: Maybe this might be better called an"employeeIdentifier" since it can be alphanumeric. Is there any reasonwhy this can't just be any old string?

s5, patch: A pointer to the SCIM protocol draft PATCH operation would behelpful.

s5, bulk: A pointer to the SCIM protocol draft Bulk operations sectionwould be helpful. I note that the capitalized form is not used in theprotocol draft: suggest s/BULK/Bulk/ (total of 2 places)

s5, filter: A pointer to some appropriate part of the SCIM protocoldraft (maybe s3.4.2.2) would be helpful.

s6, endpoint: (1)The endpoint is defined to be a relative URI. It istherefore inappropriate that the example here is "/Users". I guess itought to be "Users". There are a number of example of relative URIsstarting with / in the examples in Section 8 that also ought to becorrected.

s6, endpoint: (2) Please bear with me, this is a bit long winded... Iinitially thought that the 'endpoint' mechanism was a possiblecontravention of BCP 190/RFC 7320: Quoting s2.3 of RFC 7320:

    Scheme definitions define the presence, format, and semantics of a
    path component in URIs; all other specifications MUST NOT constrain,
    or define the structure or the semantics for any path component.

    ....

    For example, an application ought not specify a fixed URI path
    "/myapp", since this usurps the host's control of that space.

    Specifying a fixed path relative to another (e.g., {whatever}/myapp)
    is also bad practice (even if "whatever" is discovered as suggested
    in Section 3); while doing so might prevent collisions, it does not
    avoid the potential for operational difficulties (for example, an
    implementation that prefers to use query processing instead, because
    of implementation constraints).

In Section 6, the definition of the endpoint attribute specifies thateach schema has to declare a relative URI or path component that givesaccess to schema instances. My initial thinking was that the endpointvalue was standardized for Users and Groups in the draft. Myinterpretation of s2.3 of RFC 7320 was that this technique is deprecatedas bad practice. After sleeping on it, I think I understand that theendpoint value is *not* standardized and potentially each serviceprovider can use a different endpoint name if they really have to(although I guess in this case it would be good to go with thedefaults.) So I am happy that this isn't flagrantly contravening BCP190, although I am not sure about the query processing bit at the end ofthe quoted section. Conclusion: I think it would be useful to add anote to the definition of endpoint to indicate that it is at the choiceof the service delivering the resources and is not a fixed value, maybesaying that this is intended to avoid infringing BCP 190.


s7, mutability:
OLD:
mutability  A single keyword indicating what types of
                    modifications an attribute MAY accept as follows:
This 'MAY' is not about the 'protocol'.. Suggest:
NEW:
mutability  A single keyword indicating the circumstances under
                    which the value of the attribute can be (re)defined:
END

s9: s/personally identifiable information/personally identifyinginformation/g


s9, 1st bullet: s/mulitple/multiple/

s9: para 1: It would be sensible to also forbid the carrying ofpasswords in requests that are not encrypted.

s9: It would be worth emphasizing that privacy issues should beconsidered whenever resource extensions are defined.

s10.1: This is a request for a new entry in the 'URN Sub-namespace forRegistered Protocol Parameter Identifiers' ...

OLD:
   IANA has created a registry for new IETF URN sub-namespaces,
   "urn:ietf:params:scim:", per [RFC3553].  The registration request is
   as follows:

   Per [RFC3553], IANA has registered a new URN sub-namespace,
   "urn:ietf:params:scim".
NEW:

IANA is requested to add an entry to the 'IETF URN Sub-namespace forRegistered Protocol Parameter Identifiers'registry and create a sub-namespace for the Registered ParameterIdentifier as per [RFC3553]:

   "urn:ietf:params:scim:".
   The registration request is as follows:
END

s10.2: This section is lacking a specification of exactly what isrecorded in the new SCIM registry - the template tells how to apply andconsiderations to be used in granting the request. See Section 8.4 ofRFC 7035, for example, to see what is needed here.



s11.1: Needs a reference to the SCIM protocol document.

s11.2, [Olson-TZ] is incomplete - I suspect it needs a reference to theIANA TZ database http://www.iana.org/time-zones


_______________________________________________
Gen-art mailing list
Gen-art@ietf.org
https://www.ietf.org/mailman/listinfo/gen-art

[Gen-art] Gen-art last call review of draft-ietf-scim-core-schema-17

Reply via email to