You can think of it as the latter but it's quite a bit more
complicated than that. For details on how lucene stores it's index
check out the file formats page on lucene.
http://lucene.apache.org/java/docs/fileformats.html

Cheers
Rob


On Jan 4, 2008 4:59 PM, Jae Joo <[EMAIL PROTECTED]> wrote:
> title of Document 1 - "This is document 1 regarding china" - fieldtype =
> text
> title of Document 2 - "This is document 2 regarding china"  fieldtype=text
>
> Once it is indexed, will index hold  2 "china"  text fields  or just 1 china
> word which is pointing document1 and document2?
>
> Jae
>
>
> On Jan 4, 2008 10:54 AM, Robert Young <[EMAIL PROTECTED]> wrote:
>
> > I don't quite understand what you're getting at. What is the problem
> > you're encountering or what are you trying to achieve?
> >
> > Cheers
> > Rob
> >
> > On Jan 4, 2008 3:26 PM, Jae Joo <[EMAIL PROTECTED]> wrote:
> > > Hi,
> > >
> > > Is there any way to dedup the keyword cross the document?
> > >
> > > Ex.
> > >
> > > "china" keyword is in doc1 and doc2. Will Solr index have only 1 "china"
> > > keyword for both document?
> > >
> > > Thanks,
> > >
> > > Jae Joo
> > >
> >
>

Reply via email to