I can't get away with it … I wrote the old text, and I wrote the code. In defence (UK English) of the text, yes, "normalized to lower case" is ambiguous: should 'FI' go to 'fi' or to 'fı', but the W3C rules are that the docs are written in US English, so I think that Jeremy-the-spec-writer finds fault with Jeremy-the-developer in this case.
Jeremy-the-older-and-wiser-one On Feb 28, 2013, at 10:01 AM, Andy Seaborne <[email protected]> wrote: > On 28/02/13 17:39, Jeremy Carroll wrote: >> My bad. >> >> Mistakes from over a decade ago continue to haunt me! Success ? >> >> Jeremy > > On the contrary! The code exactly follows RDF concepts which says: > > "optionally a language tag as defined by [RFC-3066], normalized to lowercase." > > so the normalization is separate from being from RFC-3066. > > I have raised a bug report on the new "RDF 1.1 Concepts and Abstract Syntax" > :-) > > Andy > >> >> >> On Feb 28, 2013, at 6:51 AM, Andy Seaborne (JIRA) <[email protected]> wrote: >> >>> >>> [ >>> https://issues.apache.org/jira/browse/JENA-407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel >>> ] >>> >>> Andy Seaborne updated JENA-407: >>> ------------------------------- >>> >>> Component/s: (was: ARQ) >>> IRI >>> >>>> toLowerCase without Locale.English causing trouble in some language >>>> regions (Turkey especially) >>>> ----------------------------------------------------------------------------------------------- >>>> >>>> Key: JENA-407 >>>> URL: https://issues.apache.org/jira/browse/JENA-407 >>>> Project: Apache Jena >>>> Issue Type: Bug >>>> Components: IRI >>>> Affects Versions: Jena 2.10.0 >>>> Reporter: Simon Helsen >>>> Priority: Minor >>>> >>>> The instance I am referring to concretely is the language tag constructor: >>>> LanguageTag. >>>> It makes the following call on line 41: String lc = tag.toLowerCase(); >>>> This should be corrected to String lc = tag.toLowerCase(Locale.English); >>>> The problem is that otherwise, it use the machine default language to >>>> produce the lower cases which in some Locales (Turkey being one of them) >>>> incorrectly lowercases letters like 'I'. Because the tag is a 'technical' >>>> term (not an actual piece of language) it should lowercase in English >>>> The effect of this particular instance is that we see >>>> System.err.println("Internal Error in static initializer of >>>> IanaLnaguageTag.") >>>> appear in std.err and it has raised concerns with our customers. >>>> In general, any occurrence of toLowerCase should be adjusted if it >>>> lowercases a technical term. >>> >>> -- >>> This message is automatically generated by JIRA. >>> If you think it was sent incorrectly, please contact your JIRA >>> administrators >>> For more information on JIRA, see: http://www.atlassian.com/software/jira >> >
