Re: Filtering query results based on relevance/acuracy

2009-09-29 Thread Alex
anybody can help ? On Sat, Sep 26, 2009 at 11:22 PM, Alex wrote: > Hi Otis and thank your for helping me out. > > Sorry for the late reply. > > > > Although a Phrase query or TermQuery would be perfectly suited in some > cases, this will not work in my case. > > Basically my application's searc

Re: Fuzzy phrase matching using SpanQuery?

2009-09-29 Thread Anshum
Hi Viksit, Why don't you try breaking the query and running a boolean boost query. Building something like ("A B C"~1000)^100 OR ("A B"~1000 OR "B C"~1000 OR "A C"~1000)^10 OR (A OR B OR C) Though this is not a fool proof way to do it and a manual merge is the right way. Also, I remember a similar

[ANN] Luke 0.9.9 release

2009-09-29 Thread Andrzej Bialecki
Hi all, I'm happy to announce the new release of Luke - the Lucene Index Toolbox. Binaries and sources are available for download at the usual place: http://www.getopt.org/luke/ http://www.getopt.org/luke/lukeall-0.9.9.jar http://www.getopt.org/luke/luke-0.9.9/luke-src-0

Re: [ANN] Luke 0.9.9 release

2009-09-29 Thread Erdinc Yilmazel
Thanks! On Tue, Sep 29, 2009 at 4:06 PM, Andrzej Bialecki wrote: > Hi all, > > I'm happy to announce the new release of Luke - the Lucene Index Toolbox. > > Binaries and sources are available for download at the usual place: > >http://www.getopt.org/luke/ >http://www.getopt.org/l

Filtering on two date fields simultaneously

2009-09-29 Thread Dragan Jotanovic
Hi, I was thinking a long time how to implement this kind of functionality but couldn't figure out anything appropriate. In my lucene document, I have two date fields: start and end date. As a search input I have current date (NOW). I need to display all documents for which current date is between

Re: Search with whitespaces

2009-09-29 Thread Alex Bredariol Grilo
To use ShingleFilter, I'd like to change its TOKEN_SEPARATOR, but it's final. Furthermore, I tryed to compile its source code but the compiler isn't finding some methods like addAtribute. Does someone know how could I do that? Alex On Fri, Sep 25, 2009 at 2:42 PM, Robert Muir wrote: > i might s

[ANN] Carrot2 version 3.1.0 released

2009-09-29 Thread Stanislaw Osinski
Dear All, [Apologies for cross-posting.] This is just to let you know that we've released version 3.1.0 of Carrot2 Search Results Clustering Engine. The 3.1.0 release comes with: * Experimental support for clustering Chinese Simplified content (based on Lucene's Smart Chinese Analyzer) * Docume

ANN: New release of Lucene Domain Index for Oracle

2009-09-29 Thread Marcelo Ochoa
Hi All: A new binary distribution of Lucene Domain Index (2.9.0.1.0) for Oracle 10g/11g has been released. Lucene Domain Index is integration of Lucene Project running inside the Oracle JVM and integrated to the SQL layer by adding a new index type. This new version uses latest Lucene 2.9.

Problem searching non analyzed fields

2009-09-29 Thread Paul Taylor
I use the same Analyzer for both creating an index and searching however I'm having a problem with some fields that I added with Field.Index.NOT_ANALYZED, how can I enforce they are also search without being analysed. I did this for some fields containg Guids and products codes because I did

Re: Problem searching non analyzed fields

2009-09-29 Thread Robert Muir
try checking out PerFieldAnalyzerWrapper, so you can specify how each field is handled, i.e. some fields with KeywordAnalyzer, other fields with StandardAnalyzer, etc. On Tue, Sep 29, 2009 at 6:16 PM, Paul Taylor wrote: > I use the same Analyzer for both creating an index and searching however

RE: Problem searching non analyzed fields

2009-09-29 Thread Uwe Schindler
Hallo, If you do not analyze fields during indexing, you cannot really use QueryParser on the search side (because the QueryParser itsself always analyzes the entered query string). If you added some fields using NOT_ANALYZED, just use a simple "new TermQuery(new Term(field, term))" to search for

Re: Search with whitespaces

2009-09-29 Thread Robert Muir
Alex, to fix your compile problem, check to see if you have lucene 2.9 core in your classpath. The analyzers contrib depends upon this. If you fix it so you can change TOKEN_SEPARATOR, maybe you want to submit a patch? :) On Tue, Sep 29, 2009 at 12:39 PM, Alex Bredariol Grilo wrote: > To use Shi

TopDocCollector limits

2009-09-29 Thread Max Lynch
Hi, I am developing a search system that doesn't do pagination (searches are run in the background and machine analyzed). However, TopDocCollector makes me put a limit on how many results I want back. For my system, each result found is important. How can I make it collect every result found? T

Re: TopDocCollector limits

2009-09-29 Thread Mark Miller
Max Lynch wrote: > Hi, > I am developing a search system that doesn't do pagination (searches are run > in the background and machine analyzed). However, TopDocCollector makes me > put a limit on how many results I want back. For my system, each result > found is important. How can I make it col

Whitespace/Standard Analyzer and punctuation

2009-09-29 Thread Max Lynch
I would like my searches to match "John Smith" when John Smith is in a document, but not separated with punctuation. For example, when I was using StandardAnalyzer, "John. Smith" was matching, which is wrong for me. Right now I am using WhitespaceAnalyzer but instead searching for "John Smith" "J