Hi,
Could you run org.apache.lucene.index.CheckIndex on your index and
post the result?
Are these exceptions easily reproduced starting from scratch (new
index)?
More responses/questions below:
crspan wrote:
-- OS: Linux lg99 2.6.5-7.276-smp #1 SMP Fri Sep 28 20:33:22 AKDT
2007 x86_64 x86_64 x86_64 GNU/Linux
-- Lucene: 2.3.2 (tried 2.2.0 as well, since the index was built
around 2.2.0, jdk1.6.0_01 )
Do you see these same exceptions when you run on Lucene 2.2.0?
-- JDK: Sun jdk1.6.0_06 ( from jdk-6u6-linux-x64.bin ) & Sun
jdk1.5.0_15 ( from jdk-1_5_0_15-linux-amd64.bin)
( both installed locally in the user's home directory WITHOUT root
privilege.)
-- Souce code:
System.out.print("\n\n Range = "+range+"\nQuery = "+q.toString()
+"\n") ;
tds = is.search( q, (Filter)null, range );
-- Stack traces (1):
Range = 500
Query = TEXT:illeg^30.820824 TEXT:technolog^22.290413
TEXT:transfer^33.307804 TEXT:bipartisan^20.942562
TEXT:laboratori^18.500801 TEXT:norm^21.193087
TEXT:counterintellig^29.724474 TEXT:spi^19.285275
TEXT:lab^20.497044 TEXT:american^11.090684 TEXT:question^11.929131
TEXT:review^14.588552 TEXT:obtain^17.56319 TEXT:commun^12.5947275
TEXT:nation^10.737445 TEXT:offici^11.375352 TEXT:rep^17.646774
TEXT:contribut^15.35846 TEXT:report^11.633566
TEXT:congress^14.976282 TEXT:justic^16.433678 TEXT:govern^12.003913
TEXT:declassifi^31.553194 TEXT:campaign^14.959521
TEXT:inform^14.187338 TEXT:compani^13.717714
TEXT:classifi^23.613848 TEXT:washington^13.995003
TEXT:hugh^23.138725 TEXT:issu^14.177698 TEXT:space^18.239595 TEXT:
1996^16.198292 TEXT:rocket^21.983511 TEXT:administr^17.11987
TEXT:satellit^21.777317 TEXT:nuclear^20.927034
TEXT:republican^18.929497 TEXT:committe^18.195517
TEXT:intellig^21.868582 TEXT:hous^17.309698 TEXT:democrat^18.528954
TEXT:investig^19.492653 TEXT:panel^22.208527 TEXT:senat^20.456139
TEXT:chines^19.726551 TEXT:sensit^23.52441 TEXT:secur^20.280426
TEXT:depart^21.874023 TEXT:missil^25.32581 TEXT:illeg^27.417799
TEXT:loral^41.9551 TEXT:transfer^33.933247
QueryString:
illeg^30.820824 technolog^22.290413 transfer^33.307804
Error: java.lang.ArrayIndexOutOfBoundsException:
132704java.lang.ArrayIndexOutOfBoundsException: 132704
at org.apache.lucene.search.BooleanScorer2
$Coordinator.coordFactor(BooleanScorer2.java:55)
at org.apache.lucene.search.BooleanScorer2.score
(BooleanScorer2.java:358)
at org.apache.lucene.search.BooleanScorer2.score
(BooleanScorer2.java:320)
at org.apache.lucene.search.IndexSearcher.search
(IndexSearcher.java:146)
at org.apache.lucene.search.IndexSearcher.search
(IndexSearcher.java:113)
at org.apache.lucene.search.Searcher.search(Searcher.java:132)
at org.cr.search.TrecQueryRelevanceFeedback.main
(TrecQueryRelevanceFeedback.java:785)
I'm not sure what could cause this one.
-- Souce code:
TermFreqVector[] termsV = reader.getTermFreqVectors
(docID);
-- Stack traces (2):
Range = 500
Query = TEXT:oceanograph^68.48028 TEXT:vessel^43.191563
QueryString:
oceanograph^68.48028 vessel^43.191563
Error:
java.lang.ArrayIndexOutOfBoundsExceptionjava.lang.ArrayIndexOutOfBound
sException
at org.apache.lucene.index.TermVectorsReader.readTermVector
(TermVectorsReader.java:353)
at org.apache.lucene.index.TermVectorsReader.readTermVectors
(TermVectorsReader.java:287)
at org.apache.lucene.index.TermVectorsReader.get
(TermVectorsReader.java:232)
at org.apache.lucene.index.SegmentReader.getTermFreqVectors
(SegmentReader.java:981)
at org.cr.rf.RelevanceFeedback.RelFeedbackWeight
(RelevanceFeedback.java:145)
at org.cr.search.TrecQueryRelevanceFeedback.main
(TrecQueryRelevanceFeedback.java:789)
This looks like index corruption. If you run CheckIndex it should
detect this. Did you hit any previous exceptions when writing to
this index? Is it possible to send me a copy of the index?
-- Other Info:
* This index can be searched in other programs in the same
environment.
* The same program runs just fine in Window (1.6.0-b105 & 1.5.0_03-
b07), and a HP-UX (1.5.0.05) without those runtime errors.
This is very strange. These same programs that create the above two
exceptions, run fine on Windows & HP-UX? Did you copy the index
between these machines, or is it the very same index in a shared mount?
-- My questions:
---------? What is your reading of those two stack traces?
---------? Where
QueryString:
illeg^30.820824 technolog^22.290413 transfer^33.307804
got printed? It is NOT from my code, so is it part of Lucene's
error message? Which line in Lucene is for this print out?
I don't know!
---------?? As you can see, from the line
System.out.print("\n\n Range = "+range+"\nQuery = "+q.toString()
+"\n") ;
it just printed the 50 terms in the query:
Range = 500
Query = TEXT:illeg^30.820824 TEXT:technolog^22.290413
TEXT:transfer^33.307804 TEXT:bipartisan^20.942562
TEXT:laboratori^18.500801 TEXT:norm^21.193087
TEXT:counterintellig^29.724474 TEXT:spi^19.285275
TEXT:lab^20.497044 TEXT:american^11.090684 TEXT:question^11.929131
TEXT:review^14.588552 TEXT:obtain^17.56319 TEXT:commun^12.5947275
TEXT:nation^10.737445 TEXT:offici^11.375352 TEXT:rep^17.646774
TEXT:contribut^15.35846 TEXT:report^11.633566
TEXT:congress^14.976282 TEXT:justic^16.433678 TEXT:govern^12.003913
TEXT:declassifi^31.553194 TEXT:campaign^14.959521
TEXT:inform^14.187338 TEXT:compani^13.717714
TEXT:classifi^23.613848 TEXT:washington^13.995003
TEXT:hugh^23.138725 TEXT:issu^14.177698 TEXT:space^18.239595 TEXT:
1996^16.198292 TEXT:rocket^21.983511 TEXT:administr^17.11987
TEXT:satellit^21.777317 TEXT:nuclear^20.927034
TEXT:republican^18.929497 TEXT:committe^18.195517
TEXT:intellig^21.868582 TEXT:hous^17.309698 TEXT:democrat^18.528954
TEXT:investig^19.492653 TEXT:panel^22.208527 TEXT:senat^20.456139
TEXT:chines^19.726551 TEXT:sensit^23.52441 TEXT:secur^20.280426
TEXT:depart^21.874023 TEXT:missil^25.32581 TEXT:illeg^27.417799
TEXT:loral^41.9551 TEXT:transfer^33.933247
But how comes it became:
QueryString:
illeg^30.820824 technolog^22.290413 transfer^33.307804
---------? Overall, is this related to the JDK installation? wrong
bin file? without root? etc? Any advice to fix the problem?
Mike
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]