[ https://issues.apache.org/jira/browse/LUCENE-2554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael McCandless updated LUCENE-2554: --------------------------------------- Attachment: LUCENE-2554.patch Attached VERY rough patch. Mosts tests pass but at least 2 don't. I think the fixes to surrogates dance are close to correct. The new random test (TestSurrogates) is good at finding bugs there... The hack I have to forcefully use the test-only preflexrw codec needs to be made more generic, so that we can randomly pick the default codec to use for each test. > preflex codec doesn't order terms correctly > ------------------------------------------- > > Key: LUCENE-2554 > URL: https://issues.apache.org/jira/browse/LUCENE-2554 > Project: Lucene - Java > Issue Type: Test > Reporter: Michael McCandless > Assignee: Michael McCandless > Fix For: 4.0 > > Attachments: LUCENE-2554.patch > > > The surrogate dance in the preflex codec (which must dynamically remap terms > from UTF16 order to unicode code point order) is buggy. > To better test it, I want to add a test-only codec, preflexrw, that is able > to write indices in the pre-flex format. Then we should also fix tests to > randomly pick codecs (including preflexrw) so we better test all of our > codecs. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org