I can't get away with it … I wrote the old text, and I wrote the code.

In defence (UK English) of the text, yes, "normalized to lower case" is 
ambiguous: should 'FI' go to 'fi' or to 'fı', but the W3C rules are that the 
docs are written in US English, so I think that Jeremy-the-spec-writer finds 
fault with Jeremy-the-developer in this case.

Jeremy-the-older-and-wiser-one





On Feb 28, 2013, at 10:01 AM, Andy Seaborne <[email protected]> wrote:

> On 28/02/13 17:39, Jeremy Carroll wrote:
>> My bad.
>> 
>> Mistakes from over a decade ago continue to haunt me! Success ?
>> 
>> Jeremy
> 
> On the contrary!  The code exactly follows RDF concepts which says:
> 
> "optionally a language tag as defined by [RFC-3066], normalized to lowercase."
> 
> so the normalization is separate from being from RFC-3066.
> 
> I have raised a bug report on the new "RDF 1.1 Concepts and Abstract Syntax" 
> :-)
> 
>       Andy
> 
>> 
>> 
>> On Feb 28, 2013, at 6:51 AM, Andy Seaborne (JIRA) <[email protected]> wrote:
>> 
>>> 
>>>     [ 
>>> https://issues.apache.org/jira/browse/JENA-407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
>>>  ]
>>> 
>>> Andy Seaborne updated JENA-407:
>>> -------------------------------
>>> 
>>>    Component/s:     (was: ARQ)
>>>                 IRI
>>> 
>>>> toLowerCase without Locale.English causing trouble in some language 
>>>> regions (Turkey especially)
>>>> -----------------------------------------------------------------------------------------------
>>>> 
>>>>                Key: JENA-407
>>>>                URL: https://issues.apache.org/jira/browse/JENA-407
>>>>            Project: Apache Jena
>>>>         Issue Type: Bug
>>>>         Components: IRI
>>>>   Affects Versions: Jena 2.10.0
>>>>           Reporter: Simon Helsen
>>>>           Priority: Minor
>>>> 
>>>> The instance I am referring to concretely is the language tag constructor: 
>>>> LanguageTag.
>>>> It makes the following call on line 41:  String lc = tag.toLowerCase(); 
>>>> This should be corrected to  String lc = tag.toLowerCase(Locale.English);
>>>> The problem is that otherwise, it use the machine default language to 
>>>> produce the lower cases which in some Locales (Turkey being one of them) 
>>>> incorrectly lowercases letters like 'I'. Because the tag is a 'technical' 
>>>> term (not an actual piece of language) it should lowercase in English
>>>> The effect of this particular instance is that we see
>>>> System.err.println("Internal Error in static initializer of 
>>>> IanaLnaguageTag.")
>>>> appear in std.err and it has raised concerns with our customers.
>>>> In general, any occurrence of toLowerCase should be adjusted if it 
>>>> lowercases a technical term.
>>> 
>>> --
>>> This message is automatically generated by JIRA.
>>> If you think it was sent incorrectly, please contact your JIRA 
>>> administrators
>>> For more information on JIRA, see: http://www.atlassian.com/software/jira
>> 
> 

Reply via email to