Hi,
I am trying to use Hadoop for Lucene index creation. I have to create multiple
indexes based on contents of the files (i.e. if author is "hrishikesh", it
should be added to a index for "hrishikesh". There has to be a separate index
for every author). For this, I am keeping multiple IndexWri
UIMA, NLP, NER, IR
- Original Message
> From: Hrishikesh Agashe
> To: "common-user@hadoop.apache.org"
> Sent: Tue, November 10, 2009 4:56:33 AM
> Subject: Lucene + Hadoop
>
> Hi,
>
> I am trying to use Hadoop for Lucene index creation. I have to cre
ikesh Agashe
> > To: "common-user@hadoop.apache.org"
> > Sent: Tue, November 10, 2009 4:56:33 AM
> > Subject: Lucene + Hadoop
> >
> > Hi,
> >
> > I am trying to use Hadoop for Lucene index creation. I have to create
> multiple
> > indexes
Checkout MultipleOutputFormat
(it is same as per u r implementation )
Having separate index for author may not be a good idea.
U can have one index for all authors and query it per author
But, I m not sure of requirements
-Sagar
Hrishikesh Agashe wrote:
Hi,
I am trying to use Hadoop for Lucene
Hello!
I need to do a capacity planning or a server sizing for a Lucene + Hadoop
server, it means, plan how many servers and hardware (CPU, memory, etc.) do
I need to accomplish with the maximum amount of work that my organization
requires in a given period.
I haven’t found documentation