Are these super large files supposed to be searchable?
Can the binary files be stored somewhere else and just pointed to?
Can the text files be broken up?


-----Original Message-----
From: George Washington [mailto:[EMAIL PROTECTED] 
Sent: Thursday, January 19, 2006 10:52 PM
To: java-user@lucene.apache.org
Subject: Storing large text or binary source documents in the index and
memory usage

I would like to store large source documents (>10MB) in the index in
their 
original form, i.e. as text for text documents or as byte[] for binary 
documents.
I have no difficulty adding the source document as a field to the Lucene

index document, but when I write the index document to the index I 
consistently get out-of-memory errors for documents larger than approx
9MB.
Is there a formula that can help calculate the max size of a document
which 
can be added to the index?
Is there an alternative way to store such large documents that you can 
suggest?
I have 512MB memory under WinXP. Increasing the VM heap size does not
help.
Many thanks

_________________________________________________________________
Make your dream car a reality 
http://a.ninemsn.com.au/b.aspx?URL=http%3A%2F%2Fcarpoint%2Eninemsn%2Ecom
%2Eau&_t=12345&_r=emailtagline&_m=EXT


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to