Lucene files

Tremont romain Mon, 28 Oct 2002 04:00:43 -0800

Hi folk,

        I was looking in the structure of the index files and here is what 
I have found... I'd like to know if what I have seen is right and if I
didn't forget something:


*.fdt:
        - Start with the number of record
        - for each record
                - record number
                - if it's tokenized
                - record value (which field stored is true, this is for short 
                  field)

*.fnm:
        -contain fields information:
        for each fields:
                - field name
                - if field is indexed
segment:
        - contain the base name of the segment file

*.prx:
        - contain proximity offset for each indexed word

*.freq: 
if the document is not optimize
        - contain document number and frequence of the word in the document
if the document is optimized 
        - contain the frequence of the word in the document 

.f([1-9])+:

        I don't know (here I need your help)

.tii or .tis:
        I m really not sure about this

                - contain word
                - proximity
                - frequence


Thanks in advance :)
                
-- 

Tr�mont romain 
EPITA promotion 2004
Stagiaire chez AIS

--
To unsubscribe, e-mail:   <mailto:lucene-dev-unsubscribe@;jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@;jakarta.apache.org>

Lucene files

Reply via email to