> > Another thing to try is turning on the infoStream
> > (IndexWriter.setInfoStream(...)) and capture & post the resulting log.
> > It will be very large since it takes quite a while for the error to
> > occur...
> 
> I can do that.

Here's a more complete dump.  I've modified the code so that I now
remove any existing versions of the document before re-indexing it and
its pages.

Bill

/Library/Java/Home/bin/java '-Dcom.parc.uplib.indexing.debugMode=true' 
'-Dcom.parc.uplib.indexing.indexProperties=contents:title:categories$,*:date@:apparent-mime-type*:authors$\sand\s:comment:abstract:email-message-id*:email-guid*:email-subject:email-from-name:email-from-address*:email-attachment-to*:email-thread-index*:email-references$,*:email-in-reply-to$,*:keywords$,*:album:performer:composer:music-genre*:audio-length:accompaniment:paragraph-ids$,*:sha-hash*'
 -classpath 
"/local/uplib/share/UpLib-1.7/code/lucene-core-2.2.0.jar:/local/uplib/share/UpLib-1.7/code/LuceneIndexing.jar"
 -Dorg.apache.lucene.writeLockTimeout=20000 
com.parc.uplib.indexing.LuceneIndexing "/local/janssen-uplib/index" update 
/local/janssen-uplib/docs 01174-15-2815-270 01174-15-2552-042 01173-98-5675-575 
01173-98-4457-188 01173-83-8266-533 01173-80-8759-205
updating
doc_root_dir is /local/janssen-uplib/docs
Working on document /local/janssen-uplib/docs/01174-15-2815-270
  Adding header 'apparent-mime-type' I to 01174-15-2815-270
  Adding header 'authors' IT to 01174-15-2815-270
  Adding header 'categories' I (article) to 01174-15-2815-270
  Adding header 'date' I (20070317) to 01174-15-2815-270
  Adding header 'sha-hash' I to 01174-15-2815-270
  Created empty doc Document<stored/uncompressed,indexed<id:01174-15-2815-270> 
stored/uncompressed,indexed<uplibdate:20070317> 
stored/uncompressed,indexed<uplibtype:whole>>
  Using charset utf8 for contents.txt
  Using language en for contents.txt
    page 0 (3566):  human nature Full-Mental Nudit
    page 1 (3100):  I know what you're thinking: W
  Using charset utf8 for contents.txt
  Using language en for contents.txt
Added 01174-15-2815-270 (3 versions)
Working on document /local/janssen-uplib/docs/01174-15-2552-042
  Adding header 'abstract' IT to 01174-15-2552-042
  Adding header 'apparent-mime-type' I to 01174-15-2552-042
  Adding header 'categories' I (photo) to 01174-15-2552-042
  Adding header 'date' I (20070316) to 01174-15-2552-042
  Adding header 'sha-hash' I to 01174-15-2552-042
  Created empty doc Document<stored/uncompressed,indexed<id:01174-15-2552-042> 
stored/uncompressed,indexed<uplibdate:20070317> 
stored/uncompressed,indexed<uplibtype:whole>>
Added 01174-15-2552-042 (1 versions)
Working on document /local/janssen-uplib/docs/01173-98-5675-575
  Adding header 'apparent-mime-type' I to 01173-98-5675-575
  Adding header 'authors' IT to 01173-98-5675-575
  Adding header 'categories' I (article) to 01173-98-5675-575
  Adding header 'categories' I (medical) to 01173-98-5675-575
  Adding header 'date' I (20070313) to 01173-98-5675-575
  Adding header 'sha-hash' I to 01173-98-5675-575
  Created empty doc Document<stored/uncompressed,indexed<id:01173-98-5675-575> 
stored/uncompressed,indexed<uplibdate:20070315> 
stored/uncompressed,indexed<uplibtype:whole>>
  Using charset utf8 for contents.txt
  Using language en for contents.txt
    page 0 (2730):  March 13, 2007 DOW JONES REPRI
    page 1 (4445):  But just how far -- and how fa
    page 2 (2638):  "We don't sell snow tires," sa
    page 3 (981):  A spokeswoman for Rite Aid say
  Using charset utf8 for contents.txt
  Using language en for contents.txt
Added 01173-98-5675-575 (5 versions)
Working on document /local/janssen-uplib/docs/01173-98-4457-188
  Adding header 'apparent-mime-type' I to 01173-98-4457-188
  Adding header 'authors' IT to 01173-98-4457-188
  Adding header 'categories' I (article) to 01173-98-4457-188
  Adding header 'date' I (19911006) to 01173-98-4457-188
  Adding header 'sha-hash' I to 01173-98-4457-188
  Created empty doc Document<stored/uncompressed,indexed<id:01173-98-4457-188> 
stored/uncompressed,indexed<uplibdate:20070315> 
stored/uncompressed,indexed<uplibtype:whole>>
  Using charset utf8 for contents.txt
  Using language en for contents.txt
    page 0 (2897):  The Economics of the Colonial 
merging segments _ram_0 (1 docs) _ram_1 (1 docs) _ram_2 (1 docs) _ram_3 (1 
docs) _ram_4 (1 docs) _ram_5 (1 docs) _ram_6 (1 docs) _ram_7 (1 docs) _ram_8 (1 
docs) _ram_9 (1 docs) into _1v9 (10 docs)
flush 6 buffered deleted terms on 6 segments.
[EMAIL PROTECTED] main: now checkpoint "segments_3qj" [isCommit = true]
[EMAIL PROTECTED] main:   IncRef "_1v4.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v4_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v5.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v5_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v6.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v6_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v7.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v7_1.del": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1v8.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v8_1.del": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1v9.fnm": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1v9.fdx": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1v9.fdt": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1v9.tii": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1v9.tis": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1v9.frq": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1v9.prx": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1v9.nrm": pre-incr count is 0
[EMAIL PROTECTED] main: deleteCommits: now remove commit "segments_3qi"
[EMAIL PROTECTED] main:   DecRef "_1v4.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v4_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v5.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v5_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v6.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v6_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v7.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v8.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "segments_3qi": pre-decr count is 1
[EMAIL PROTECTED] main: delete "segments_3qi"
[EMAIL PROTECTED] main: now checkpoint "segments_3qk" [isCommit = true]
[EMAIL PROTECTED] main:   IncRef "_1v4.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v4_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v5.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v5_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v6.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v6_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v7.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v7_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v8.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v8_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v9.cfs": pre-incr count is 0
[EMAIL PROTECTED] main: deleteCommits: now remove commit "segments_3qj"
[EMAIL PROTECTED] main:   DecRef "_1v4.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v4_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v5.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v5_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v6.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v6_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v7.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v7_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v8.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v8_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v9.fnm": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.fnm"
[EMAIL PROTECTED] main:   DecRef "_1v9.fdx": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.fdx"
[EMAIL PROTECTED] main:   DecRef "_1v9.fdt": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.fdt"
[EMAIL PROTECTED] main:   DecRef "_1v9.tii": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.tii"
[EMAIL PROTECTED] main:   DecRef "_1v9.tis": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.tis"
[EMAIL PROTECTED] main:   DecRef "_1v9.frq": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.frq"
[EMAIL PROTECTED] main:   DecRef "_1v9.prx": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.prx"
[EMAIL PROTECTED] main:   DecRef "_1v9.nrm": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1v9.nrm"
[EMAIL PROTECTED] main:   DecRef "segments_3qj": pre-decr count is 1
[EMAIL PROTECTED] main: delete "segments_3qj"
    page 1 (3372):  Newsweek. I rely on The Econom
    page 2 (3395):  print, ÒI rely on the Wall Str
    page 3 (3032):  British condescension toward t
    page 4 (1100):  The Òthat being soÓ style of d
  Using charset utf8 for contents.txt
  Using language en for contents.txt
Added 01173-98-4457-188 (6 versions)
Working on document /local/janssen-uplib/docs/01173-83-8266-533
  Adding header 'apparent-mime-type' I to 01173-83-8266-533
  Adding header 'categories' I (article) to 01173-83-8266-533
  Adding header 'categories' I (medical) to 01173-83-8266-533
  Adding header 'categories' I (medical/aging) to 01173-83-8266-533
  Adding header 'categories' I (medical) to 01173-83-8266-533
  Adding header 'categories' I (medical/exercise) to 01173-83-8266-533
  Adding header 'categories' I (medical) to 01173-83-8266-533
  Adding header 'date' I (20070313) to 01173-83-8266-533
  Adding header 'sha-hash' I to 01173-83-8266-533
  Adding header 'source' IT to 01173-83-8266-533
  Adding header 'title' IT (Study shows why exercise boosts brainpower) to 
01173-83-8266-533
  Created empty doc Document<stored/uncompressed,indexed<id:01173-83-8266-533> 
stored/uncompressed,indexed<uplibdate:20070313> 
stored/uncompressed,indexed<uplibtype:whole>>
  Using charset utf8 for contents.txt
  Using language en for contents.txt
    page 0 (2282):  Powered by SAVE THIS | EMAIL T
    page 1 (1040):  Exercise generated blood flow 
  Using charset utf8 for contents.txt
  Using language en for contents.txt
Added 01173-83-8266-533 (3 versions)
Working on document /local/janssen-uplib/docs/01173-80-8759-205
  Adding header 'apparent-mime-type' I to 01173-80-8759-205
  Adding header 'categories' I (photo) to 01173-80-8759-205
  Adding header 'categories' I (apple) to 01173-80-8759-205
  Adding header 'categories' I (apple) to 01173-80-8759-205
  Adding header 'date' I (20070312) to 01173-80-8759-205
  Adding header 'sha-hash' I to 01173-80-8759-205
  Adding header 'title' IT (Apple Store New York showing iPhone ad on March 
12th, 2007) to 01173-80-8759-205
  Created empty doc Document<stored/uncompressed,indexed<id:01173-80-8759-205> 
stored/uncompressed,indexed<uplibdate:20070313> 
stored/uncompressed,indexed<uplibtype:whole>>
Added 01173-80-8759-205 (1 versions)
Optimizing...
merging segments _ram_a (1 docs) _ram_b (1 docs) _ram_c (1 docs) _ram_d (1 
docs) _ram_e (1 docs) _ram_f (1 docs) _ram_g (1 docs) _ram_h (1 docs) _ram_i (1 
docs) into _1va (9 docs)
[EMAIL PROTECTED] main: now checkpoint "segments_3ql" [isCommit = true]
[EMAIL PROTECTED] main:   IncRef "_1v4.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v4_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v5.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v5_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v6.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v6_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v7.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v7_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v8.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v8_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v9.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1va.fnm": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1va.fdx": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1va.fdt": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1va.tii": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1va.tis": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1va.frq": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1va.prx": pre-incr count is 0
[EMAIL PROTECTED] main:   IncRef "_1va.nrm": pre-incr count is 0
[EMAIL PROTECTED] main: deleteCommits: now remove commit "segments_3qk"
[EMAIL PROTECTED] main:   DecRef "_1v4.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v4_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v5.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v5_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v6.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v6_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v7.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v7_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v8.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v8_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v9.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "segments_3qk": pre-decr count is 1
[EMAIL PROTECTED] main: delete "segments_3qk"
[EMAIL PROTECTED] main: now checkpoint "segments_3qm" [isCommit = true]
[EMAIL PROTECTED] main:   IncRef "_1v4.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v4_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v5.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v5_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v6.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v6_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v7.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v7_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v8.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v8_1.del": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1v9.cfs": pre-incr count is 1
[EMAIL PROTECTED] main:   IncRef "_1va.cfs": pre-incr count is 0
[EMAIL PROTECTED] main: deleteCommits: now remove commit "segments_3ql"
[EMAIL PROTECTED] main:   DecRef "_1v4.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v4_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v5.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v5_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v6.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v6_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v7.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v7_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v8.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v8_1.del": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1v9.cfs": pre-decr count is 2
[EMAIL PROTECTED] main:   DecRef "_1va.fnm": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.fnm"
[EMAIL PROTECTED] main:   DecRef "_1va.fdx": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.fdx"
[EMAIL PROTECTED] main:   DecRef "_1va.fdt": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.fdt"
[EMAIL PROTECTED] main:   DecRef "_1va.tii": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.tii"
[EMAIL PROTECTED] main:   DecRef "_1va.tis": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.tis"
[EMAIL PROTECTED] main:   DecRef "_1va.frq": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.frq"
[EMAIL PROTECTED] main:   DecRef "_1va.prx": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.prx"
[EMAIL PROTECTED] main:   DecRef "_1va.nrm": pre-decr count is 1
[EMAIL PROTECTED] main: delete "_1va.nrm"
[EMAIL PROTECTED] main:   DecRef "segments_3ql": pre-decr count is 1
[EMAIL PROTECTED] main: delete "segments_3ql"
merging segments _1v4 (19680 docs) _1v5 (10 docs) _1v6 (9 docs) _1v7 (10 docs) 
_1v8 (9 docs) _1v9 (10 docs) _1va (9 docs)[EMAIL PROTECTED] main: refresh: 
removing newly created unreferenced file "_1vb.fdt"
[EMAIL PROTECTED] main: delete "_1vb.fdt"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file 
"_1vb.fdx"
[EMAIL PROTECTED] main: delete "_1vb.fdx"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file 
"_1vb.fnm"
[EMAIL PROTECTED] main: delete "_1vb.fnm"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file 
"_1vb.frq"
[EMAIL PROTECTED] main: delete "_1vb.frq"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file 
"_1vb.prx"
[EMAIL PROTECTED] main: delete "_1vb.prx"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file 
"_1vb.tii"
[EMAIL PROTECTED] main: delete "_1vb.tii"
[EMAIL PROTECTED] main: refresh: removing newly created unreferenced file 
"_1vb.tis"
[EMAIL PROTECTED] main: delete "_1vb.tis"
java.lang.ArrayIndexOutOfBoundsException: Array index out of range: 23754
        at org.apache.lucene.util.BitVector.get(BitVector.java:72)
        at 
org.apache.lucene.index.SegmentTermDocs.next(SegmentTermDocs.java:118)
        at 
org.apache.lucene.index.SegmentTermPositions.next(SegmentTermPositions.java:98)
        at 
org.apache.lucene.index.SegmentMerger.appendPostings(SegmentMerger.java:361)
        at 
org.apache.lucene.index.SegmentMerger.mergeTermInfo(SegmentMerger.java:325)
        at 
org.apache.lucene.index.SegmentMerger.mergeTermInfos(SegmentMerger.java:297)
        at 
org.apache.lucene.index.SegmentMerger.mergeTerms(SegmentMerger.java:261)
        at org.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java:98)
        at 
org.apache.lucene.index.IndexWriter.mergeSegments(IndexWriter.java:1883)
        at org.apache.lucene.index.IndexWriter.optimize(IndexWriter.java:1231)
        at 
com.parc.uplib.indexing.LuceneIndexing.update(LuceneIndexing.java:419)
        at com.parc.uplib.indexing.LuceneIndexing.main(LuceneIndexing.java:664)

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to