Re: AND query in SHOULD

2007-11-24 Thread Rapthor
Thanks for this example. I am uncertain about one detail: How do I achieve a search for multiple keywords. Not just green tree but also short road, sky, bird? Is there a chance to add those keywords to the Query q = qp.parse(\green tree\); command? Shai Erera wrote: How about using

Re: Help on FuzzyLikeThisQuery

2007-11-24 Thread markharw00d
Cool Coder wrote: Is there anyway I can specify which terms are MUST, I mean they have to appear in the result and some terms are optional, One hands off approach you could try with this is to rewrite the fuzzyQuery and then set the minimum number of terms you want a match on. e.g.

Re: AND query in SHOULD

2007-11-24 Thread Shai Erera
Hi Not sure I understand the question. You can add as many keywords as you want to the query (like \green tree\ \short road\ sky bird) and it should behave the same (i.e., search in each field. Shai On Nov 24, 2007 10:26 AM, Rapthor [EMAIL PROTECTED] wrote: Thanks for this example. I am

Index: mixing the structure of persistence

2007-11-24 Thread Haroldo Nascimento
Hi, I have a question ? Lucene offers a mixing structure of storage of index, that is, first do search in memoria (ARMDirectory) and in case of not found do search in index file automatically ? For example: Load part of index in memory for do the search fastest. Thnaks

Why exactly are fuzzy queries so slow?

2007-11-24 Thread Timo Nentwig
Hi! I search an 1.5 gig index and fuzzy queries are really slow; something like avg. ~500ms (IndexSearcher.search(Query, HitCollector)). When performing exact queries I archieve response times 25ms. What is it that makes fuzzy queries so slow? Increased index access due to more terms, i.e.

Re: Why exactly are fuzzy queries so slow?

2007-11-24 Thread Mathieu Lecarme
fuzzy are simply not indexed. If you wont to search quickly with fuzzy search, you should index word and their ngrams, it's the do you mean pattern. you first select used word wich share ngram with the query word, the distance is computed with levenstein, and you use this word as a

Problem with indexing

2007-11-24 Thread Liaqat Ali
Hi All, I have one question. I have a small demonstration program in Java to index a single fine stored on the disk. When i index a file with .txt extension containing English text and then use LUKE to retrieve information from Index it retrieves a little bit data. But the problem occurs when

Re: Help on FuzzyLikeThisQuery

2007-11-24 Thread Cool Coder
Now I can see lot improvement on my related help search result. Let me tell you that I have a non-token list which removes all irrelevant tokens from selected help topic. So after filtering all non tokens from the selected help topic, I search help system and show all results. But I am somehow

Re: Why exactly are fuzzy queries so slow?

2007-11-24 Thread markharw00d
The added IO is one factor. Another is the CPU load from doing many edit-distance comparisons between index terms and the provided search term. You can limit the number of edit distance comparisons conducted by setting the minimum prefix length. This is a property of the QueryParser if parsing

How to delete old index

2007-11-24 Thread Cool Coder
Hi, I used have index refreshed in every 4 hr. However after each refresh, I can see two index. I am not sure how can I delete old index. On starting of indexing process, I create Writer writer = new IndexWriter(luceneDirectory,false,luceneAnalyzer,true); At the end of