[ https://issues.apache.org/jira/browse/LUCENE-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael McCandless updated LUCENE-1241: --------------------------------------- Attachment: LUCENE-1241.take2.patch Attached take2 patch. I fixed it to apply to trunk, and I removed 0xffff entirely. All tests pass, but... Unfortunately, this change causes a significant net slowdown (5.9%) in indexing throughput. I ran this alg: analyzer=org.apache.lucene.analysis.standard.StandardAnalyzer doc.maker=org.apache.lucene.benchmark.byTask.feeds.LineDocMaker docs.file=/Volumes/External/lucene/wiki.txt doc.stored = true doc.term.vector = true doc.add.log.step=2000 directory=FSDirectory autocommit=false compound=false ram.flush.mb=64 { "Rounds" ResetSystemErase { "BuildIndex" - CreateIndex { "AddDocs" AddDoc > : 200000 - CloseIndex } NewRound } : 5 RepSumByPrefRound BuildIndex I ran the test on an Intel quad core Mac Pro with 4-drive RAID 0. JVM is 1.5 and I run with "-Xms1024M -Xmx1024M -Xbatch -server". Trunk gets 897.3 rec/s and the patch gets 844.3 rec/s, best of 5 = 5.9% slower. I don't think we should commit this. > 0xffff char is not a string terminator > -------------------------------------- > > Key: LUCENE-1241 > URL: https://issues.apache.org/jira/browse/LUCENE-1241 > Project: Lucene - Java > Issue Type: Improvement > Components: Index > Reporter: Hiroaki Kawai > Assignee: Michael McCandless > Attachments: ComparableCharSequence.java, LUCENE-1241.patch, > LUCENE-1241.take2.patch > > > Current trunk index.DocumentWriter uses "\uffff" as a string terminator, but > it should not to be for some reasons. \uffff is not a terminator char itself > and we can't handle a string that really contains \uffff. And also, we can > calculate the end char position in a character sequence from the string > length that we already know. > However, I agree with the usage for assertion, that "\uffff" is placed after > at the end of a string in a char sequence. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]