> > Another thing to try is turning on the infoStream
> > (IndexWriter.setInfoStream(...)) and capture & post the resulting log.
> > It will be very large since it takes quite a while for the error to
> > occur...
>
> I can do that.
Here's a more complete dump. I've modified the code so that I now
remove any existing versions of the document before re-indexing it and
its pages.
Bill
/Library/Java/Home/bin/java '-Dcom.parc.uplib.indexing.debugMode=true'
'-Dcom.parc.uplib.indexing.indexProperties=contents:title:categories$,*:date@:apparent-mime-type*:authors$\sand\s:comment:abstract:email-message-id*:email-guid*:email-subject:email-from-name:email-from-address*:email-attachment-to*:email-thread-index*:email-references$,*:email-in-reply-to$,*:keywords$,*:album:performer:composer:music-genre*:audio-length:accompaniment:paragraph-ids$,*:sha-hash*'
-classpath
"/local/uplib/share/UpLib-1.7/code/lucene-core-2.2.0.jar:/local/uplib/share/UpLib-1.7/code/LuceneIndexing.jar"
-Dorg.apache.lucene.writeLockTimeout=20000
com.parc.uplib.indexing.LuceneIndexing "/local/janssen-uplib/index" update
/local/janssen-uplib/docs 01174-15-2815-270 01174-15-2552-042 01173-98-5675-575
01173-98-4457-188 01173-83-8266-533 01173-80-8759-205
updating
doc_root_dir is /local/janssen-uplib/docs
Working on document /local/janssen-uplib/docs/01174-15-2815-270
Adding header 'apparent-mime-type' I to 01174-15-2815-270
Adding header 'authors' IT to 01174-15-2815-270
Adding header 'categories' I (article) to 01174-15-2815-270
Adding header 'date' I (20070317) to 01174-15-2815-270
Adding header 'sha-hash' I to 01174-15-2815-270
Created empty doc Document<stored/uncompressed,indexed<id:01174-15-2815-270>
stored/uncompressed,indexed<uplibdate:20070317>
stored/uncompressed,indexed<uplibtype:whole>>
Using charset utf8 for contents.txt
Using language en for contents.txt
page 0 (3566): human nature Full-Mental Nudit
page 1 (3100): I know what you're thinking: W
Using charset utf8 for contents.txt
Using language en for contents.txt
Added 01174-15-2815-270 (3 versions)
Working on document /local/janssen-uplib/docs/01174-15-2552-042
Adding header 'abstract' IT to 01174-15-2552-042
Adding header 'apparent-mime-type' I to 01174-15-2552-042
Adding header 'categories' I (photo) to 01174-15-2552-042
Adding header 'date' I (20070316) to 01174-15-2552-042
Adding header 'sha-hash' I to 01174-15-2552-042
Created empty doc Document<stored/uncompressed,indexed<id:01174-15-2552-042>
stored/uncompressed,indexed<uplibdate:20070317>
stored/uncompressed,indexed<uplibtype:whole>>
Added 01174-15-2552-042 (1 versions)
Working on document /local/janssen-uplib/docs/01173-98-5675-575
Adding header 'apparent-mime-type' I to 01173-98-5675-575
Adding header 'authors' IT to 01173-98-5675-575
Adding header 'categories' I (article) to 01173-98-5675-575
Adding header 'categories' I (medical) to 01173-98-5675-575
Adding header 'date' I (20070313) to 01173-98-5675-575
Adding header 'sha-hash' I to 01173-98-5675-575
Created empty doc Document<stored/uncompressed,indexed<id:01173-98-5675-575>
stored/uncompressed,indexed<uplibdate:20070315>
stored/uncompressed,indexed<uplibtype:whole>>
Using charset utf8 for contents.txt
Using language en for contents.txt
page 0 (2730): March 13, 2007 DOW JONES REPRI
page 1 (4445): But just how far -- and how fa
page 2 (2638): "We don't sell snow tires," sa
page 3 (981): A spokeswoman for Rite Aid say
Using charset utf8 for contents.txt
Using language en for contents.txt
Added 01173-98-5675-575 (5 versions)
Working on document /local/janssen-uplib/docs/01173-98-4457-188
Adding header 'apparent-mime-type' I to 01173-98-4457-188
Adding header 'authors' IT to 01173-98-4457-188
Adding header 'categories' I (article) to 01173-98-4457-188
Adding header 'date' I (19911006) to 01173-98-4457-188
Adding header 'sha-hash' I to 01173-98-4457-188
Created empty doc Document<stored/uncompressed,indexed<id:01173-98-4457-188>
stored/uncompressed,indexed<uplibdate:20070315>
stored/uncompressed,indexed<uplibtype:whole>>
Using charset utf8 for contents.txt
Using language en for contents.txt
page 0 (2897): The Economics of the Colonial
merging segments _ram_0 (1 docs) _ram_1 (1 docs) _ram_2 (1 docs) _ram_3 (1
docs) _ram_4 (1 docs) _ram_5 (1 docs) _ram_6 (1 docs) _ram_7 (1 docs) _ram_8 (1
docs) _ram_9 (1 docs) into _1v9 (10 docs)
flush 6 buffered deleted terms on 6 segments.
[EMAIL PROTECTED] main: now checkpoint "segments_3qj" [isCommit = true]
[EMAIL PROTECTED] main: IncRef "_1v4.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v4_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v5.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v5_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v6.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v6_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v7.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v7_1.del": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1v8.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v8_1.del": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1v9.fnm": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1v9.fdx": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1v9.fdt": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1v9.tii": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1v9.tis": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1v9.frq": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1v9.prx": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1v9.nrm": pre-incr count is 0
[EMAIL PROTECTED] main: deleteCommits: now remove commit "segments_3qi"
[EMAIL PROTECTED] main: DecRef "_1v4.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v4_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v5.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v5_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v6.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v6_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v7.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v8.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "segments_3qi": pre-decr count is 1
[EMAIL PROTECTED] main: delete "segments_3qi"
[EMAIL PROTECTED] main: now checkpoint "segments_3qk" [isCommit = true]
[EMAIL PROTECTED] main: IncRef "_1v4.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v4_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v5.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v5_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v6.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v6_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v7.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v7_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v8.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v8_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v9.cfs": pre-incr count is 0
[EMAIL PROTECTED] main: deleteCommits: now remove commit "segments_3qj"
[EMAIL PROTECTED] main: DecRef "_1v4.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v4_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v5.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v5_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v6.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v6_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v7.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v7_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v8.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v8_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v9.fnm": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.fnm"
[EMAIL PROTECTED] main: DecRef "_1v9.fdx": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.fdx"
[EMAIL PROTECTED] main: DecRef "_1v9.fdt": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.fdt"
[EMAIL PROTECTED] main: DecRef "_1v9.tii": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.tii"
[EMAIL PROTECTED] main: DecRef "_1v9.tis": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.tis"
[EMAIL PROTECTED] main: DecRef "_1v9.frq": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.frq"
[EMAIL PROTECTED] main: DecRef "_1v9.prx": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.prx"
[EMAIL PROTECTED] main: DecRef "_1v9.nrm": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.nrm"
[EMAIL PROTECTED] main: DecRef "segments_3qj": pre-decr count is 1
[EMAIL PROTECTED] main: delete "segments_3qj"
page 1 (3372): Newsweek. I rely on The Econom
page 2 (3395): print, ÒI rely on the Wall Str
page 3 (3032): British condescension toward t
page 4 (1100): The Òthat being soÓ style of d
Using charset utf8 for contents.txt
Using language en for contents.txt
Added 01173-98-4457-188 (6 versions)
Working on document /local/janssen-uplib/docs/01173-83-8266-533
Adding header 'apparent-mime-type' I to 01173-83-8266-533
Adding header 'categories' I (article) to 01173-83-8266-533
Adding header 'categories' I (medical) to 01173-83-8266-533
Adding header 'categories' I (medical/aging) to 01173-83-8266-533
Adding header 'categories' I (medical) to 01173-83-8266-533
Adding header 'categories' I (medical/exercise) to 01173-83-8266-533
Adding header 'categories' I (medical) to 01173-83-8266-533
Adding header 'date' I (20070313) to 01173-83-8266-533
Adding header 'sha-hash' I to 01173-83-8266-533
Adding header 'source' IT to 01173-83-8266-533
Adding header 'title' IT (Study shows why exercise boosts brainpower) to
01173-83-8266-533
Created empty doc Document<stored/uncompressed,indexed<id:01173-83-8266-533>
stored/uncompressed,indexed<uplibdate:20070313>
stored/uncompressed,indexed<uplibtype:whole>>
Using charset utf8 for contents.txt
Using language en for contents.txt
page 0 (2282): Powered by SAVE THIS | EMAIL T
page 1 (1040): Exercise generated blood flow
Using charset utf8 for contents.txt
Using language en for contents.txt
Added 01173-83-8266-533 (3 versions)
Working on document /local/janssen-uplib/docs/01173-80-8759-205
Adding header 'apparent-mime-type' I to 01173-80-8759-205
Adding header 'categories' I (photo) to 01173-80-8759-205
Adding header 'categories' I (apple) to 01173-80-8759-205
Adding header 'categories' I (apple) to 01173-80-8759-205
Adding header 'date' I (20070312) to 01173-80-8759-205
Adding header 'sha-hash' I to 01173-80-8759-205
Adding header 'title' IT (Apple Store New York showing iPhone ad on March
12th, 2007) to 01173-80-8759-205
Created empty doc Document<stored/uncompressed,indexed<id:01173-80-8759-205>
stored/uncompressed,indexed<uplibdate:20070313>
stored/uncompressed,indexed<uplibtype:whole>>
Added 01173-80-8759-205 (1 versions)
Optimizing...
merging segments _ram_a (1 docs) _ram_b (1 docs) _ram_c (1 docs) _ram_d (1
docs) _ram_e (1 docs) _ram_f (1 docs) _ram_g (1 docs) _ram_h (1 docs) _ram_i (1
docs) into _1va (9 docs)
[EMAIL PROTECTED] main: now checkpoint "segments_3ql" [isCommit = true]
[EMAIL PROTECTED] main: IncRef "_1v4.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v4_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v5.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v5_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v6.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v6_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v7.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v7_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v8.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v8_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v9.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1va.fnm": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1va.fdx": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1va.fdt": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1va.tii": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1va.tis": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1va.frq": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1va.prx": pre-incr count is 0
[EMAIL PROTECTED] main: IncRef "_1va.nrm": pre-incr count is 0
[EMAIL PROTECTED] main: deleteCommits: now remove commit "segments_3qk"
[EMAIL PROTECTED] main: DecRef "_1v4.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v4_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v5.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v5_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v6.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v6_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v7.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v7_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v8.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v8_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v9.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "segments_3qk": pre-decr count is 1
[EMAIL PROTECTED] main: delete "segments_3qk"
[EMAIL PROTECTED] main: now checkpoint "segments_3qm" [isCommit = true]
[EMAIL PROTECTED] main: IncRef "_1v4.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v4_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v5.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v5_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v6.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v6_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v7.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v7_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v8.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v8_1.del": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1v9.cfs": pre-incr count is 1
[EMAIL PROTECTED] main: IncRef "_1va.cfs": pre-incr count is 0
[EMAIL PROTECTED] main: deleteCommits: now remove commit "segments_3ql"
[EMAIL PROTECTED] main: DecRef "_1v4.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v4_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v5.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v5_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v6.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v6_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v7.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v7_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v8.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v8_1.del": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1v9.cfs": pre-decr count is 2
[EMAIL PROTECTED] main: DecRef "_1va.fnm": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.fnm"
[EMAIL PROTECTED] main: DecRef "_1va.fdx": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.fdx"
[EMAIL PROTECTED] main: DecRef "_1va.fdt": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.fdt"
[EMAIL PROTECTED] main: DecRef "_1va.tii": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.tii"
[EMAIL PROTECTED] main: DecRef "_1va.tis": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.tis"
[EMAIL PROTECTED] main: DecRef "_1va.frq": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.frq"
[EMAIL PROTECTED] main: DecRef "_1va.prx": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.prx"
[EMAIL PROTECTED] main: DecRef "_1va.nrm": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.nrm"
[EMAIL PROTECTED] main: DecRef "segments_3ql": pre-decr count is 1
[EMAIL PROTECTED] main: delete "segments_3ql"
merging segments _1v4 (19680 docs) _1v5 (10 docs) _1v6 (9 docs) _1v7 (10 docs)
_1v8 (9 docs) _1v9 (10 docs) _1va (9 docs)[EMAIL PROTECTED] main: refresh:
removing newly created unreferenced file "_1vb.fdt"
[EMAIL PROTECTED] main: delete "_1vb.fdt"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file
"_1vb.fdx"
[EMAIL PROTECTED] main: delete "_1vb.fdx"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file
"_1vb.fnm"
[EMAIL PROTECTED] main: delete "_1vb.fnm"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file
"_1vb.frq"
[EMAIL PROTECTED] main: delete "_1vb.frq"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file
"_1vb.prx"
[EMAIL PROTECTED] main: delete "_1vb.prx"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file
"_1vb.tii"
[EMAIL PROTECTED] main: delete "_1vb.tii"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file
"_1vb.tis"
[EMAIL PROTECTED] main: delete "_1vb.tis"
java.lang.ArrayIndexOutOfBoundsException: Array index out of range: 23754
at org.apache.lucene.util.BitVector.get(BitVector.java:72)
at
org.apache.lucene.index.SegmentTermDocs.next(SegmentTermDocs.java:118)
at
org.apache.lucene.index.SegmentTermPositions.next(SegmentTermPositions.java:98)
at
org.apache.lucene.index.SegmentMerger.appendPostings(SegmentMerger.java:361)
at
org.apache.lucene.index.SegmentMerger.mergeTermInfo(SegmentMerger.java:325)
at
org.apache.lucene.index.SegmentMerger.mergeTermInfos(SegmentMerger.java:297)
at
org.apache.lucene.index.SegmentMerger.mergeTerms(SegmentMerger.java:261)
at org.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java:98)
at
org.apache.lucene.index.IndexWriter.mergeSegments(IndexWriter.java:1883)
at org.apache.lucene.index.IndexWriter.optimize(IndexWriter.java:1231)
at
com.parc.uplib.indexing.LuceneIndexing.update(LuceneIndexing.java:419)
at com.parc.uplib.indexing.LuceneIndexing.main(LuceneIndexing.java:664)
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]