Regarding range queries.

2005-08-08 Thread santo santo
Hi all, I am new user of lucene. This query is posted at least once on alomost all lucene mailing lists. The query being about handling of date fields. In my case I need to find documents with dates older than a particular date. So ideally I am not supposed to specify the lower bound. When using

Re: JDBC proxy implementing Lucene?

2005-08-08 Thread Chris Lu
An reuseable aspect will be extremely useful. And it's much more feasible than the JDBC approach. This will only benefit applications already using ORM tools. But this should be the trend though. On 8/8/05, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Hi Chris, > > --- Chris Lu <[EMAIL PROTECT

Re: JDBC proxy implementing Lucene?

2005-08-08 Thread Otis Gospodnetic
Hi Chris, --- Chris Lu <[EMAIL PROTECTED]> wrote: > Hi, Just an idea to make Lucene work with databases more easily. > > When I communicated with Shay Banon(Compass' author), it came to me > that maybe Lucene can be wrapped around JDBC drivers. Let's say it's > L-JDBC. > > So whenever an object

JDBC proxy implementing Lucene?

2005-08-08 Thread Chris Lu
Hi, Just an idea to make Lucene work with databases more easily. When I communicated with Shay Banon(Compass' author), it came to me that maybe Lucene can be wrapped around JDBC drivers. Let's say it's L-JDBC. So whenever an object is stored through JDBC, according to some XML configuration file,

Re: New Site Live Using Lucene

2005-08-08 Thread David Spencer
Otis Gospodnetic wrote: --- "Kevin L. Cobb" <[EMAIL PROTECTED]> wrote: Open Source C/C++ only? When are you going to include Open Source Java? We demand fair treatmant ;) There are several related sites: http://www.searchmorph.com/ Thanks for ref Otis. I run this site, and primarily inde

Re: Lucene score algorithm details?

2005-08-08 Thread Chris Lu
Document boost is distributed to each field when creating the index. It is not stored anywhere. So it's not in the equation to calculate score. -- Chris Lu Lucene Search RAD on Any Database http://www.dbsight.net On 8/8/05, Andrew Hudson <[EMAIL PROTECTED]> wrote: > Is the docboos

Re: Lucene score algorithm details?

2005-08-08 Thread Andrew Hudson
Is the docboost being used in scoring currently? I haven't been able to see a clear connection between it and the score that lucene calculates both empirically and in the scoring code itself. Andrew On 8/8/05, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > It's in the Javadoc for Similarity class

Re: Lucene score algorithm details?

2005-08-08 Thread Otis Gospodnetic
It's in the Javadoc for Similarity class. You can use that along with Explanation (class) to get a good picture of what's going on with scoring. Otis --- Dan Armbrust <[EMAIL PROTECTED]> wrote: > I know there used to be a webpage that gave the algorithm used by > Lucene > for scoring, along wi

Re: Excel Spreadsheet

2005-08-08 Thread christopher may
I think the easiest way to put it is my first cell will be a term. The following cells are then the summary or definition to the term and so on and so on. My search needs to find the term by letter since my page has that option and then return in two separate columns on the results page the Te

Lucene score algorithm details?

2005-08-08 Thread Dan Armbrust
I know there used to be a webpage that gave the algorithm used by Lucene for scoring, along with some info on what each variable controlled, to some extent... I was looking to brush up on what the idf controls (and what will happen if I override it) but I can't seem to find that page any longer

Re: Search shortly after adding a doc

2005-08-08 Thread Erik Hatcher
If you want to match a single document against a Query rapidly, try out the MemoryIndex as part of the contrib/memory codebase in Subversion. Erik On Aug 4, 2005, at 8:14 PM, [EMAIL PROTECTED] wrote: i want to use lucene to search shortly (within a second) after adding a document. c

RE: merging indexes together

2005-08-08 Thread Mordo, Aviran (EXP N-NANNATEK)
Why don't you just add the new information directly to the main index ? As long as you don't get a new IndexReader you should be able to access the old information. Once your indexing and deletion is done just get a new IndexReader instance to access the new documents. Aviran http://www.aviranspla

merging indexes together

2005-08-08 Thread Volodymyr Bychkoviak
Hello All. In my program I index new information to temporary dir and after then I delete outdated information from main index and add new information by calling indexWriter.addIndexes() method. This works fine when doc number is relatively small but when index size grows, every call to addInd

Re: setBoost(float) in org.apache.lucene.document.Field cannot be applied to (double)???

2005-08-08 Thread Stefan Groschupf
Hi, I run in the same problem some weeks ago as well. You can find following in the java doc: "Note: this value is not stored directly with the document in the index. Documents returned from IndexReader.document(int) and Hits.doc (int) may thus not have the same value present as when this fiel

RE: New Site Live Using Lucene

2005-08-08 Thread Otis Gospodnetic
--- "Kevin L. Cobb" <[EMAIL PROTECTED]> wrote: > Open Source C/C++ only? When are you going to include Open Source > Java? We demand fair treatmant ;) There are several related sites: http://www.searchmorph.com/ http://www.koders.com/ http://javadocs.org/ ... Otis . . . . . . . . . . . . . .

Re: Analyzer question

2005-08-08 Thread Erik Hatcher
On Aug 8, 2005, at 10:43 AM, Dan Armbrust wrote: It is my understanding that the StandardAnalyzer will remove underscores - so "some_word" be indexed as 'some' and 'word'. I want to keep the underscores, so I was thinking of changing over to an Analyzer that uses the WhiteSpaceTokenizer, Low

Re: Excel Spreadsheet

2005-08-08 Thread Erik Hatcher
On Aug 8, 2005, at 10:30 AM, christopher may wrote: I have a spreadsheet with the first cell being the term and the next cells hold the description. Is there a way I can build this into the index easily or is this going to take a custom analyzer ? Any help or ideas would be greatly appreciat

Analyzer question

2005-08-08 Thread Dan Armbrust
It is my understanding that the StandardAnalyzer will remove underscores - so "some_word" be indexed as 'some' and 'word'. I want to keep the underscores, so I was thinking of changing over to an Analyzer that uses the WhiteSpaceTokenizer, LowerCaseFilter, and StopFilter. What other tokenizin

Excel Spreadsheet

2005-08-08 Thread christopher may
I have a spreadsheet with the first cell being the term and the next cells hold the description. Is there a way I can build this into the index easily or is this going to take a custom analyzer ? Any help or ideas would be greatly appreciated. Thanks -

Re: Reply Split Search Word

2005-08-08 Thread Erik Hatcher
On Aug 8, 2005, at 7:44 AM, Karthik N S wrote: I would like to reformat the Question slightly , Words without double Quotes may also be present in the String. Also I have to apply the STOP - Analyzer to filter out common English words appearing within. Do u mind giving me a bit of

RE: Reply Split Search Word

2005-08-08 Thread Karthik N S
Hi Erik I would like to reformat the Question slightly , Words without double Quotes may also be present in the String. Also I have to apply the STOP - Analyzer to filter out common English words appearing within. Do u mind giving me a bit of src hint for the same... [ I am googled ou

Re: performance: gcj, sun, ibm ?

2005-08-08 Thread Paul Libbrecht
I heard recently that the Wikipedia team actually switched off of using Java lucene because of Java licenses (which we know isn't that perfect) and went on to .NET mono (or something like this) where an implementation of Lucene has been fine and stable, more stable than using gcc, it seems. p

RE: New Site Live Using Lucene

2005-08-08 Thread Kevin L. Cobb
Open Source C/C++ only? When are you going to include Open Source Java? We demand fair treatmant ;) -Original Message- From: Robert Schultz [mailto:[EMAIL PROTECTED] Sent: Sunday, August 07, 2005 6:18 PM To: java-user@lucene.apache.org Subject: New Site Live Using Lucene Not sure if

Re: Reply Split Search Word

2005-08-08 Thread Erik Hatcher
To have an analyzer split that string into 1-5 as you have listed will require you write a custom Analyzer to tokenize with double quotes in mind like that. Erik On Aug 8, 2005, at 12:06 AM, Karthik N S wrote: Hi Luceners Apologies. As I have already replied,Using Analysis I hav

Re: binary of highlighting?

2005-08-08 Thread Riccardo Daviddi
thank you! On 8/7/05, Erik Hatcher <[EMAIL PROTECTED]> wrote: > > On Aug 7, 2005, at 12:17 PM, Riccardo Daviddi wrote: > > Where can I get the binary of all the classes for highlighting? > > There have never been any official releases of the Sandbox/contrib > pieces (though that will change with