What is the difference between the AND and + operator?

2010-11-29 Thread yang Yang
What is the difference between the AND and + operator? ALso,what is the difference between a query and a filter? For example String[] fields={name,address,classId}; If I want to search the document whose classId is '4 and whose name or address contain Zhongzhou Road No 200,I can use two manner:

Re: What is the difference between the AND and + operator?

2010-11-29 Thread Anshum
Hi Yang, About the difference between a filter and a query, the essential one is, filter does not calculate score/relevance and so the sort would vary. Also, having No to be escaped while query formation is, by using appropriate query parser. You may also use an analyzer and search for all terms

Re: best practice: 1.4 billions documents

2010-11-29 Thread Ganesh
I am using ParallelMultiSearcher and querying more than 50 shards. I get wrong results for query A NOT B. I am getting correct results if i query A -B. I am also getting correct results for wildcard and fuzzy. What is the solution? I need to use IndexSearcher with MultiReader? Regards Ganesh

Re: What is the difference between the AND and + operator?

2010-11-29 Thread maven apache
2010/11/29 Anshum ansh...@gmail.com Hi Yang, About the difference between a filter and a query, the essential one is, filter does not calculate score/relevance and so the sort would vary. Thanks,got it. Is any performance between them? Also, having No to be escaped while query formation

Analyzer

2010-11-29 Thread manjula wijewickrema
Hi, In my work, I am using Lucene and two java classes. In the first one, I index a document and in the second one, I try to search the most relevant document for the indexed document in the first one. In the first java class, I use the SnowballAnalyzer in the createIndex method and

Re: How to Cache Filter Results between Servers

2010-11-29 Thread Michael McCandless
IndexReader does not implement hashCode, so it falls back to Object.hashCode which means even if you open an IndexReader on the same index, each one will get a different hashCode. Note that modern versions of Lucene do per-segment searching, which means the toplevel MultiReader is never sent to

Re: IndexWriter Class

2010-11-29 Thread Michael McCandless
Please don't hijack threads; start a new thread instead. Mike 2010/11/28 jiandong yang jdyangch...@gmail.com: hello, all, here comes my question: as far as I know, lucene now offer the feature that ones can search some doc while the index which contains that specific doc is modifying. for

Re: How to Cache Filter Results between Servers

2010-11-29 Thread mark harwood
1. why ir.hashCode() returns different value every time I run this code? Presumably because it is a different object instance in a different JVM? IndexReader.hashCode() and IndexReader.equals() are not designed to represent/summarise the physical contents of an index. They are

Re: best practice: 1.4 billions documents

2010-11-29 Thread Ian Lea
Yes, try a searcher on top of a MultiReader. But I would have thought that A NOT B and A -B were equivalent so you may other issues. If so, post again in a new thread - with details, after switching to MultiReader, if the problem persists. -- Ian. On Mon, Nov 29, 2010 at 9:25 AM, Ganesh

Attributes and scoring

2010-11-29 Thread Nicolas ThiƩbaud
Hi everybody, I'm a bit confused and didn't find a clearly stated answer in the documentation. While analyzing text, Token filters can add data, or custom data to token by the means of Attributes (ex: TypeAttribute, OffsetAttribute, etc..). This information is available to the following filters,

precision and recall in lucene

2010-11-29 Thread Yakob
hello all I was wondering, if I want to measure precision and recall in lucene then what's the best way for me to do it? is there any sample cource code that I can use? thanks though -- http://jacobian.web.id - To unsubscribe,

Re: precision and recall in lucene

2010-11-29 Thread Erick Erickson
Define precision. Define recall. Define measure G Sorry to give in to my impulses, but this question is so broad it's unanswerable. Try looking at the Text REtrieval Conference for instance. Lots of very bright people spend significant amounts of their careers trying to just define what these

RE: Analyzer

2010-11-29 Thread Steven A Rowe
Hi Manjula, It's not terribly clear what you're doing here - I got lost in your description of your (two? or maybe four?) classes. Sometimes things are easier to understand if you provide more concrete detail. I suspect that you could benefit from reading the book Lucene in Action, 2nd

Re: precision and recall in lucene

2010-11-29 Thread Erick Erickson
Well, I guess I can answer your original question with no. There's no Lucene method that will give you these because they aren't defined. If you can answer the question given a corpus and a set of queries and the correct ordering of the relevant documents, how close does Lucene come to that

Re: Custom token attributes and payload. XML analyzing.

2010-11-29 Thread Erick Erickson
Sounds like payloads, have you investigated them? Here's a place to start: http://www.lucidimagination.com/blog/2009/08/05/getting-started-with-payloads/ http://www.lucidimagination.com/blog/2009/08/05/getting-started-with-payloads/ Best Erick On Mon, Nov 29, 2010 at 2:49 PM, Fabiano Nunes

Directory objects for index

2010-11-29 Thread Mark Kristensson
While implementing a solution for keeping warmed indexReaders around for our various indexes (so users don't have to wait while we open an indexReader for our one slow index), I've run into some serious problems trying to track down all of the outstanding file handles to my indexes. For a

Re: Analyzer

2010-11-29 Thread manjula wijewickrema
Hi Steve, Thanx a lot for your reply. Yes there are only two classes and it's corrcet that the way you have realized the problem. As you have instructed, I checked WhitespaceAnalyzer for querying (instead of StandardAnalyzer) and it seems to me that it gives better results rather than

Re: What is the difference between the AND and + operator?

2010-11-29 Thread Chris Hostetter
: Subject: What is the difference between the AND and + operator? In this query, y is mandatory, but documents that also match x will score higher then documents that only match y... x +y In both of these queries, x and y are both mandatory; a document only matching one of them will