I'm just a novice but I had to do this recently to store items and attached files. There is a many-to-many relationship between items and attached files. If the relationships change I don't want to reindex the items/attachments.
So I added the item documents (with unique key in ID), I added the attachment documents (with unique key in ID) and then I added extra documents to represent the link with the item id stored in the ID field and the attachment id stored in the ATTID field. When I have found a matching attachment and I want to look up the items that contains it I can find all documents that have an ATTID equal to the attachmentid and then use the list of IDs from the retrieved documents to go get the actual items if that's what I want to do I couldn't think of a better way so I look forward to seeing the responses you get! -Gwyn -----Original Message----- From: Mario Alejandro M. [mailto:[EMAIL PROTECTED] Sent: 23 January 2006 15:58 To: Otis Gospodnetic Cc: [email protected] Subject: Re: Indexing Urls pointing to same content I know Lucene is not a web indexer... maybe I explain this bad. I'm asking in how STORE the data, not in how locate it. If two files are the same, using MD5 is my actual approach, then I plan to STORE the content once but is necesary add the two locations. Example: c:\file1 Content: One c:\file2 Content: One In the index: Content:One Location: C:\File1 Location: C:\File2 So, or I put the locations and the content in separate Lucene index or I put it in the same, but I don't know what can be the best, this is the advice I'm asking for... > > Mario Alejandro Montoya > http://sourceforge.net/projects/mutis > MUTIS: The Open source Delphi search engine > AnyNET: Convert from ANY .NET assembly to Delphi code > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
