Here is my source code where I convert pdf files to text for indexing, I
got this source code from lucene in action examples and adapted it for my
convenience, I hop you could help me to fix this problem, anyway if you know
another more efficient way to do it please tell me how to:
import java.i
Hi every body:
I am getting a problem during the indexing process, I am indexing big
amounts of texts most of them in pdf format I am using pdf box 0.6 version.
The space in hard disk before that the indexing process begin is around 120
Gb but incredibly even when my lucene index doesn't have y
Here are pieces of my source code:
First of all, I search in all the indexes given a query String with a
parallel searcher. As you can see I make a multi field query. Then you can
see the index format I use, I store in the index all the fields. My index is
optimized.
public Hits search
f this helps much, but from your description, I think
there's
a fundamental, correctable problem because nobody would use the product if
it gave such poor search times. And lots of people use it.
Best
Erick
On 10/11/06, Ariel Isaac Romero Cartaya <[EMAIL PROTECTED]> wrote:
>
> Hi e
Hi everybody:
I have a big problem making prallel searches in big indexes.
I have indexed with lucene over 60 000 articles, I have distributed the
indexes in 10 computers nodes so each index not exceed the 60 MB of size. I
makes parallel searches in those indexes but I get the search resu
ged into master index before, that's why I
can't put the boolean parameter true.
I really need help, please.
I'm open to any suggestion.
On 5/15/06, Daniel Naber <[EMAIL PROTECTED]> wrote:
On Montag 15 Mai 2006 19:51, Ariel Isaac Romero wrote:
> IndexReader indexTo
Hi every body:
I need to know how to merge an index into another.
I have a master index whose another indexes are added to it from others
nodes . I want to merge indexes from the
others nodes to master index, I made this method:
public void merge(String MasterIndexDir, String IndexTo
Hi every body:
How do I know the memory size of my RAMDirectory ?
I need to control the memory size of my RAM directory to serialized the
index to disk when ram directory memory get the 100 MB size.
I have a distributed enviroment
I really need to find the way, I must control the size of the inde