Reconstructing a field from an index is 1> slow 2> lossy (what about stemmed words? stopwords? )
UNLESS you have stored the data (Field.Store.YES/COMPRESS), in which case you can just get the field from each index and put it in the new one. Tokenization has little to do with this although you could get a similar effect with untokenized fields but why would you want to? I assume you have a way to uniquely identify the documents that you want to combine, relying on the Lucene doc ID is fragile.... If possible, your best bet would be to reconstruct the new index from the source you used to create the original indexes. Maybe a higher-level problem statement would help generate more suggestions. Best Erick On Thu, Feb 26, 2009 at 7:07 AM, liat oren <oren.l...@gmail.com> wrote: > Hi, > > I have two indexes, each has a tokenized field and I would like to combine > them both into one field in a new index. > How can it be done? > (Is it a good approach or is it better to hold them as untokenized text and > only when I create the new index, then to tokenize it?) > > Many thanks, > Liat >