Clemens Vasters created AVRO-3975:
-------------------------------------

             Summary: "displayname" schema attribute
                 Key: AVRO-3975
                 URL: https://issues.apache.org/jira/browse/AVRO-3975
             Project: Apache Avro
          Issue Type: New Feature
          Components: spec
    Affects Versions: 1.11.3
            Reporter: Clemens Vasters


https://avro.apache.org/docs/1.11.1/specification/#names

The "name" attribute's permitted set of characters is, rightfully, constrained 
such that it will cleanly map to most programming languages. 

In data systems, especially in end-user-facing platforms, users ask for 
structural elements of their data to be named in their own language and that 
requires characters outside of the permissible set. Those user-given names 
choices should be preservable in the schema even if the technical "name" is 
produced from the user-defined name such that it conforms to the restrictions. 

I therefore propose a formal "displayname" attribute to be defined for all 
schema constructs that have names, which accepts an arbitrary UTF-8 string. The 
constraint on that attribute is that the displayname MUST follow the same 
relative uniqueness constraints as "name". 

The presence of the "displayname" would have no wire-impact.

I understand that this can be introduced as custom metadata extension, but I 
believe this is a common issue. We are more aware of internationalization 
issues now than when such naming constraints were conceived.    



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to