FW: Searchable Solutions Please

2004-11-02 Thread Karthik N S
Hi Guy's Apologies. I am little Confused with the Search Factor. If the Search Word 'kid' is suppose to return me kid , kid's , kidoos, children 1) Do I need to use Combination of more then one Analysers ??? , If so How. 2) Any Alternate modificatio

Re: How do Lucene applications deal with API changes?

2004-11-02 Thread Bill Janssen
I know, I know, it's bad form to answer your own question. But here's a test program to distinguish between Lucene 1.4.1 and 1.4.2. Bill import java.io.*; import java.lang.Class; import java.lang.reflect.Method; class testLucene { private static boolean compareArgs (Class[] args1, Class[]

How do Lucene applications deal with API changes?

2004-11-02 Thread Bill Janssen
Thanks to Bill Tschumy, who points out that Lucene 1.4.21 *breaks* the API exported by 1.4 by removing a parameter from QueryParser.getFieldQuery(). That means that my NewMultiFieldQueryParser also breaks, since it overrides that method. To fix, just remove the Analyzer parameter from the getField

Re: Search speed

2004-11-02 Thread Doug Cutting
Jeff Munson wrote: Single word searches return pretty fast, but when I try phrases, searching seems to slow considerably. [ ... ] However, if I use this query, contents:"all parts including picture tube guaranteed", it returns hits in 2890 millseconds. Other phrases take longer as well. You coul

Re: Search speed

2004-11-02 Thread Justin Swanhart
If you know all the phrases your are going to search for, you could modify an analyzer to make those phrases into whole terms when you are analyzing. Other than that, you can test the speed of breaking the phrase query up into term queries. You would have to do an AND on all the words in the phra

Re: Search speed

2004-11-02 Thread Paul Elschot
On Tuesday 02 November 2004 17:50, Jeff Munson wrote: > Thanks for the info Paul. The requirements of my search engine are that > I need to search for phrases like "death notice" or "world war ii". You > suggested that I break the phrases into words. Is there a way to break > the phrases into wo

RE: Search speed

2004-11-02 Thread Jeff Munson
Thanks for the info Paul. The requirements of my search engine are that I need to search for phrases like "death notice" or "world war ii". You suggested that I break the phrases into words. Is there a way to break the phrases into words, do the search, and just return the documents with the phr

Content-based similarity search in vector-space for Lucene

2004-11-02 Thread Marcel Hofmann
Hello! For my diploma (available in german), i have written a similarity search, that for an given document (query) returns documents, which content is gradual similar to the query-document. With this functionality, e.g. different versions from an document, plagiats of a publication or related arti

Re: commit lock, graceful handler

2004-11-02 Thread Nader Henein
Graceful, no, I started a discussion on this about two years ago, what I'm doing is a batched indexing so if a crash occurs the next time the application starts up I have an LuceneInit class that goes and ensures that all indecies have no locks on them by simply deleting the lock file and opti

Re: jaspq: dashed numerical values tokenized differently

2004-11-02 Thread Morus Walter
Daniel Taurat writes: > Hi, > I have just another stupid parser question: > There seems to be a special handling of the dash sign "-" different from > Lucene 1.2 at least in Lucene 1.4.RC3 > StandardAnalyzer. > > Examples (1.4RC3): > > A document containing the string "dash-test" is matched by th

Re: Search speed

2004-11-02 Thread Paul Elschot
On Monday 01 November 2004 21:02, Jeff Munson wrote: > I'm looking for tips on speeding up searches since I am a relatively new > user of Lucene. > > I've created a single index with 4.5 million documents. The index has > about 22 fields and one of those fields is the contents of the body tag >