Lucene + Hadoop

2009-11-10 Thread Hrishikesh Agashe
Hi, I am trying to use Hadoop for Lucene index creation. I have to create multiple indexes based on contents of the files (i.e. if author is "hrishikesh", it should be added to a index for "hrishikesh". There has to be a separate index for every author). For this, I am keeping multiple IndexWri

Re: Lucene + Hadoop

2009-11-10 Thread Otis Gospodnetic
UIMA, NLP, NER, IR - Original Message > From: Hrishikesh Agashe > To: "common-user@hadoop.apache.org" > Sent: Tue, November 10, 2009 4:56:33 AM > Subject: Lucene + Hadoop > > Hi, > > I am trying to use Hadoop for Lucene index creation. I have to cre

Re: Lucene + Hadoop

2009-11-10 Thread Eason.Lee
ikesh Agashe > > To: "common-user@hadoop.apache.org" > > Sent: Tue, November 10, 2009 4:56:33 AM > > Subject: Lucene + Hadoop > > > > Hi, > > > > I am trying to use Hadoop for Lucene index creation. I have to create > multiple > > indexes

Re: Lucene + Hadoop

2009-11-11 Thread Sagar
Checkout MultipleOutputFormat (it is same as per u r implementation ) Having separate index for author may not be a good idea. U can have one index for all authors and query it per author But, I m not sure of requirements -Sagar Hrishikesh Agashe wrote: Hi, I am trying to use Hadoop for Lucene

Server sizing Lucene + Hadoop

2012-05-31 Thread jcuencaa
Hello! I need to do a capacity planning or a server sizing for a Lucene + Hadoop server, it means, plan how many servers and hardware (CPU, memory, etc.) do I need to accomplish with the maximum amount of work that my organization requires in a given period. I haven’t found documentation