[ANNOUNCE] Apache Lucene 4.9.1 released

2014-09-22 Thread Michael McCandless
September 2014, Apache Luceneā„¢ 4.9.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 4.9.1 Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires

sortedset vs taxonomy

2014-09-22 Thread Yonghui Zhao
If we want to implement simple facet counting feature, it seems we can do it via sortedset or taxonomy writer/reader. Seems sortedset is simpler but doesn't support hierarchical facet count such as A/B/C. I want to know what's advantage/disadvantage of sortedset or taxonomy? Is there any

How to use 'PhraseQuery' with Fuzzy?!

2014-09-22 Thread teko
Hello, Well, I need do a query to locate Phrases like that: Petruz Augusto Petruz Auguzto Petrs Augusto ... Well, in a single word, I can use fuzzy, but, how I can use it in phrases? I think that I need use PhraseQuery, but, I never get any result.. (code below). Thanks [code] IndexReader

phrase query, stop words, and highlighting?

2014-09-22 Thread Rob Nikander
Hi, I just noticed that a search like rooms to go is failing to highlight. (I use FastVectorHighlighter). I know it's caused the stop word (to). Is there a recommended way to fix this? I may just re-index without stop words, and see if that causes any problems. thanks, Rob

How to configure lucene 4.x to read 3.x index files

2014-09-22 Thread Patrick Mi
Hi there, I understood that Lucene V4 could read 3.x index files by configuring Lucene3xCodec but what exactly needs to be done here? I used DEMO code from V4.10.0 to generate v4 index files and could read them without problem. When I tried to read index files generated from V3 I got the

How to rank by name in the same score when using DisjunctionMaxQuery?

2014-09-22 Thread Harry Yu
Hi everyone, I searched an index with 3 its' fields({name, addr, fullname}), and used DisjunctionMaxQuery to rank the results by the max score of 3 fields. When the hits have same score, lucene would rank by doc Id(low doc Id would be front). But I don't want rank by doc Id in that case. I