You can think of it as the latter but it's quite a bit more complicated than that. For details on how lucene stores it's index check out the file formats page on lucene. http://lucene.apache.org/java/docs/fileformats.html
Cheers Rob On Jan 4, 2008 4:59 PM, Jae Joo <[EMAIL PROTECTED]> wrote: > title of Document 1 - "This is document 1 regarding china" - fieldtype = > text > title of Document 2 - "This is document 2 regarding china" fieldtype=text > > Once it is indexed, will index hold 2 "china" text fields or just 1 china > word which is pointing document1 and document2? > > Jae > > > On Jan 4, 2008 10:54 AM, Robert Young <[EMAIL PROTECTED]> wrote: > > > I don't quite understand what you're getting at. What is the problem > > you're encountering or what are you trying to achieve? > > > > Cheers > > Rob > > > > On Jan 4, 2008 3:26 PM, Jae Joo <[EMAIL PROTECTED]> wrote: > > > Hi, > > > > > > Is there any way to dedup the keyword cross the document? > > > > > > Ex. > > > > > > "china" keyword is in doc1 and doc2. Will Solr index have only 1 "china" > > > keyword for both document? > > > > > > Thanks, > > > > > > Jae Joo > > > > > >