I think your on the right tack Ken. Don't know enough about Unicode myself, but I was looking at this this morning, and what you say somewhat jives with what I saw.

I don't think you can just flip that switch though - the index format will not match what its trying to read (having been written in the new format). Which is why that couldn't have been intended to test reading the old format, unless it was mistake (it was also added when the format changed, so its not like it was left around).

Possibly a mistake that works when your Unicode support is older? (Im on Ubuntu 8.10 - not sure what that means to my Unicode level) That unicode version comment looks very interesting - no one really noticing this problem in America (thats mentioned it), and I think Sami is in Europe.

McCandless knows whats wrong I sure (he did that patch), but hes either busy fixing it, or ordering another margarita in tijuana.

In either case, I'm sure the issue will be resolved soon.

- Mark

Ken Krugler wrote:
Ok, it's not a java 1.6 thing it's something else. I also found a box that runs that test ok.

From what I can tell, this is the test that's failing:

http://www.krugle.org/kse/entfiles/lucene/apache.org/java/trunk/src/test/org/apache/lucene/index/TestIndexInput.java#89

This is verifying that the "Modified UTF-8 null bytes" sequence is handled properly, from line 63 in the same file.

I think this is the old, deprecated format for pre-2.4 indexes.

So shouldn't there be a call to setModifiedUTF8StringsMode()? And since this is a one-way setting of the preUTF8Strings flag, It feels like this should be in a separate test.

Without this call, you'll get the result of calling the String class's default constructor with an ill-formed UTF-8 sequence (for Unicode 3.1 or later), since 0xC0 0x80 isn't the shortest form for the u0000 code point.

-- Ken


Mark Miller wrote:
Hey Sami, I've been running tests quite a bit recently with Ubuntu 8.10 and OpenJDK 6 on a 64-bit machine, and I have not seen it once. Just tried again with Sun JDK 6 and 5 32-bit as well, and I am still not seeing it.

Odd.

- Mark

Sami Siren wrote:
I am constantly seeing following error when running "ant test":

[junit] Testcase: testRead(org.apache.lucene.index.TestIndexInput): FAILED
   [junit] expected:<[]> but was:<[??]>
[junit] junit.framework.ComparisonFailure: expected:<[]> but was:<[??]> [junit] at org.apache.lucene.index.TestIndexInput.testRead(TestIndexInput.java:89)

on both intel and amd architectures running linux.

java on AMD:
java version "1.6.0_11"
Java(TM) SE Runtime Environment (build 1.6.0_11-b03)
Java HotSpot(TM) 64-Bit Server VM (build 11.0-b16, mixed mode)

java on Intel:
java version "1.6.0_0"
IcedTea6 1.4 (fedora-7.b12.fc10-x86_64) Runtime Environment (build 1.6.0_0-b12)
OpenJDK 64-Bit Server VM (build 10.0-b19, mixed mode)

java version "1.6.0_11"
Java(TM) SE Runtime Environment (build 1.6.0_11-b03)
Java HotSpot(TM) 64-Bit Server VM (build 11.0-b16, mixed mode)

java version "1.6.0_11"
Java(TM) SE Runtime Environment (build 1.6.0_11-b03)
Java HotSpot(TM) Server VM (build 11.0-b16, mixed mode)

Anyone else seeing this?

--
Sami Siren

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org




---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to