Duplicated Keyword

2008-01-04 Thread Jae Joo
Hi, Is there any way to dedup the keyword cross the document? Ex. "china" keyword is in doc1 and doc2. Will Solr index have only 1 "china" keyword for both document? Thanks, Jae Joo

Re: Duplicated Keyword

2008-01-04 Thread Robert Young
I don't quite understand what you're getting at. What is the problem you're encountering or what are you trying to achieve? Cheers Rob On Jan 4, 2008 3:26 PM, Jae Joo <[EMAIL PROTECTED]> wrote: > Hi, > > Is there any way to dedup the keyword cross the document? > > Ex. > > "china" keyword is in d

Re: Duplicated Keyword

2008-01-04 Thread Jae Joo
title of Document 1 - "This is document 1 regarding china" - fieldtype = text title of Document 2 - "This is document 2 regarding china" fieldtype=text Once it is indexed, will index hold 2 "china" text fields or just 1 china word which is pointing document1 and document2? Jae On Jan 4, 2008

Re: Duplicated Keyword

2008-01-04 Thread Robert Young
You can think of it as the latter but it's quite a bit more complicated than that. For details on how lucene stores it's index check out the file formats page on lucene. http://lucene.apache.org/java/docs/fileformats.html Cheers Rob On Jan 4, 2008 4:59 PM, Jae Joo <[EMAIL PROTECTED]> wrote: > ti