Re: Phrase search with ComplexPhraseQueryParser/SpanQueryParser.

2014-03-06 Thread Modassar Ather
Hi Ahmet, As per your suggestion I have posted the request with example on Lucene-5205 jira ticket. Thanks, Modassar On Wed, Mar 5, 2014 at 8:44 PM, Ahmet Arslan wrote: > Hi Modassar, > > Can you post your request (with an example if possible) to lucene-5205 > jura ticket too? If you don't ha

Re: codec mismatch

2014-03-06 Thread Jason Wee
Hello Mike, Thank you and you were right in your first comment, the expected field, Lucene46FieldInfos is within the file _0.cfs. We have taken a closer look and in details. The problem was because copy bytes in hex form from cassandra to the byte array was wrong because the source offset was set

[blog post] Comparing Document Classification Functions of Lucene and Mahout

2014-03-06 Thread Koji Sekiguchi
Hello, I just posted an article on Comparing Document Classification Functions of Lucene and Mahout. http://soleami.com/blog/comparing-document-classification-functions-of-lucene-and-mahout.html Comments are welcome. :) Thanks! koji -- http://soleami.com/blog/comparing-document-classification

Square of Idf

2014-03-06 Thread Furkan KAMACI
Hi; Tf-Idf is explanation says that: *idf(t)* appears for *t* in both the query and the document, hence it is squared in the equation. DefaultSimilarity does not square it. What it the explanation of it? Thanks; Furkan KAMACI

tf/idf similarity with modified document similarity

2014-03-06 Thread Christian Reuschling
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hello, what is the best method to score documents similar to default similarity, but the document frequency should be calculated per query against the matching result document set, not statically against the whole corpus. Didn't found a good and pe