Does Lucene support partition-by-keyword indexing?

2008-03-01 Thread Yin Qiu
Hi, I'm planning to implement a search infrastructure on a P2P overlay. To achieve this, I want to first distribute the indices to various nodes connected by this overlay. My approach is to partition the indices by keyword, that is, one node takes care of certain keywords (or terms). When a simple

Re: Does Lucene support partition-by-keyword indexing?

2008-03-01 Thread Mathieu Lecarme
The easiest way is to split index by Document. In Lucene, index contains Document and inverse index of Term. If you wont to put Term in different place, Document will be duplicated on each index, with only a part of their Term. How will you manage node failure in your network? They were so

Re: Does Lucene support partition-by-keyword indexing?

2008-03-01 Thread 仇寅
Hi, I agree with your point that it is easier to partition index by document. But the partition-by-keyword approach has much greater scalability over the partition-by-document approach. Each query involves communicating with constant number of nodes; while partition-by-doc requires spreading the q

AW: Does Lucene support partition-by-keyword indexing?

2008-03-01 Thread Uwe Goetzke
or your approach? How many queries and how many documents do you expect? Regards Uwe -Ursprüngliche Nachricht- Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Im Auftrag von ?? Gesendet: Sonntag, 2. März 2008 03:05 An: java-user@lucene.apache.org Betreff: Re: Does Lucene support par

Re: Does Lucene support partition-by-keyword indexing?

2008-03-02 Thread eks dev
:05 AM Subject: Re: Does Lucene support partition-by-keyword indexing? Hi, I agree with your point that it is easier to partition index by document. But the partition-by-keyword approach has much greater scalability over the partition-by-document approach. Each query involves communicating with co

Re: Does Lucene support partition-by-keyword indexing?

2008-03-02 Thread Mathieu Lecarme
Le 2 mars 08 à 03:05, 仇寅 a écrit : Hi, I agree with your point that it is easier to partition index by document. But the partition-by-keyword approach has much greater scalability over the partition-by-document approach. Each query involves communicating with constant number of nodes; whi

Re: Does Lucene support partition-by-keyword indexing?

2008-03-02 Thread 仇寅
Hi Mathieu, I hope my previous mail has explained something. My objective is just to do some simple research and to demonstrate the feasibility, so we can leave other options alone. And you talked about caching. Yes, that will also be my concern. Thanks for your advice though. On Sun, Mar 2, 2008

Re: Does Lucene support partition-by-keyword indexing?

2008-03-02 Thread 仇寅
t;evil" query can bring a node "down" but the network is > still working. > > Do you have any results using lucene on a single node for your approach? How > many queries and how many documents do you expect? > > Regards > > Uwe > > -Ursprüngliche N

Re: Does Lucene support partition-by-keyword indexing?

2008-03-02 Thread Mathieu Lecarme
[mailto:[EMAIL PROTECTED] Im Auftrag von ?? Gesendet: Sonntag, 2. März 2008 03:05 An: java-user@lucene.apache.org Betreff: Re: Does Lucene support partition-by-keyword indexing? Hi, I agree with your point that it is easier to partition index by document. But the partition-by-keyword appro

Re: Does Lucene support partition-by-keyword indexing?

2008-03-02 Thread 仇寅
e points of failure. And the scalability > >> seems limited because you would need to recalibrate the whole > >> network when a adding a new node. Why don't you distribute the > >> complete index (we do this after getting it locally zipped and > >> lat

Re: Does Lucene support partition-by-keyword indexing?

2008-03-03 Thread Mathieu Lecarme
ow many documents do you expect? Regards Uwe -Ursprüngliche Nachricht- Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Im Auftrag von ?? Gesendet: Sonntag, 2. März 2008 03:05 An: java-user@lucene.apache.org Betreff: Re: Does Lucene support partition-by-keyword indexing? Hi, I agree wit