Please ask these type of questions on the user mailing list, you will get much better responses. The dev list is for developers of Lucene.

To answer your question, yes you can do this. You may find the FieldSelector API additions and Lazy Field Loading to be helpful performance wise, as well.

-Grant

On Mar 17, 2007, at 8:36 AM, jafarim wrote:

Hello
It's a whil that I am using lucene and as most of people seemingly do, I used to save only some important fields of a docuemnt in the index. But recently I thought why not store the whole document bytes as an untokenized
field in the index in order to ease the retrieval process? For example
serialize the pdf file into a byte[] and then save the bytes as a field in the index.(some gzip and base64 encodings may be needed as glue logic). Then I can delete the original file from the system. Is there any reason against
this idea? Can lucene bear this large volume of input streamed data?

--------------------------
Grant Ingersoll
Center for Natural Language Processing
http://www.cnlp.org

Read the Lucene Java FAQ at http://wiki.apache.org/jakarta-lucene/ LuceneFAQ



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to