: There is an inconsistency between the files format page (from Lucene : website) and the source code. It concerns the positions and offsets of term : vectors. It seems that documentation (website) is not up to date. According : to the file format page, offsets and positions are not stored! Is that : correct?
can you cite exactly what about the fileformats doc leads you to believe this? ... a quick search for "offsets" and "positions" finds these lines for me... If the third lowest-order bit is set (0x04), term positions are stored with the term vectors. If the fourth lowest-order bit is set (0x08), term offsets are stored with the term vectors. ...and that's just to start with. -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]