I'm just a novice but I had to do this recently to store items and attached
files. There is a many-to-many relationship between items and attached
files. If the relationships change I don't want to reindex the
items/attachments.

So I added the item documents (with unique key in ID), I added the
attachment documents (with unique key in ID) and then I added extra
documents to represent the link with the item id stored in the ID field and
the attachment id stored in the ATTID field.

When I have found a matching attachment and I want to look up the items that
contains it I can find all documents that have an ATTID equal to the
attachmentid and then use the list of IDs from the retrieved documents to go
get the actual items if that's what I want to do

I couldn't think of a better way so I look forward to seeing the responses
you get!

-Gwyn

-----Original Message-----
From: Mario Alejandro M. [mailto:[EMAIL PROTECTED] 
Sent: 23 January 2006 15:58
To: Otis Gospodnetic
Cc: [email protected]
Subject: Re: Indexing Urls pointing to same content

I know Lucene is not a web indexer... maybe I explain this bad.

I'm asking in how STORE the data, not in how locate it. If two files are the
same, using MD5 is my actual approach, then I plan to STORE the content once
but is necesary add the two locations.

Example:

c:\file1 Content: One
c:\file2 Content: One

In the index:

Content:One
    Location: C:\File1
    Location: C:\File2

So, or I put the locations and the content in separate Lucene index or I put
it in the same, but I don't know what can be the best, this is the advice
I'm asking for...

>
> Mario Alejandro Montoya
> http://sourceforge.net/projects/mutis
> MUTIS: The Open source Delphi search engine
> AnyNET: Convert from ANY .NET assembly to Delphi code
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to