[ 
https://issues.apache.org/jira/browse/LUCENE-2073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12779004#action_12779004
 ] 

Robert Muir commented on LUCENE-2073:
-------------------------------------

Mark, but here is the crux of the issue:

While i think casing property is normative and should not change, new 
characters can be introduced with new casing properties.
This of course should not affect Simple/StopAnalyzer, but may affect 
StandardAnalyzer.

The reason is that StandardTokenizer contains hardcoded (sometimes oversized) 
ranges that may include some characters that were previously unassigned in 
Unicode 3.
If they are assigned in Unicode 4, with a casing property, then this means for 
lucene, they were indexed in uppercase in 1.4 but lowercase in 1.5... i hope 
this makes sense.

> Document issues involved in building your index with one jdk version and then 
> searching/updating with another
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-2073
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2073
>             Project: Lucene - Java
>          Issue Type: Task
>            Reporter: Mark Miller
>         Attachments: LUCENE-2073.patch, LUCENE-2073.patch
>
>
> I think this needs to go in something of a permenant spot - this isn't a one 
> time release type issues - its going to present over multiple release.
> {quote}
> If there is nothing we can do here, then we just have to do the best we can -
> such as a prominent notice alerting that if you transition JVM's between 
> building and searching the index and you are using or doing X, things will 
> break.
> We should put this in a spot that is always pretty visible - perhaps even a 
> new readme file titlted something like IndexBackwardCompatibility or 
> something, to which we can add other tips and gotchyas as they come up. Or 
> MaintainingIndicesAcrossVersions, or 
> FancyWhateverGetsYourAttentionAboutUpgradingStuff. Or a permanent 
> entry/sticky entry at the top of Changes.
> {quote}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to