Re: Seeking advice on index parameter settings for large index

2005-03-30 Thread Doug Cutting
Chuck Williams wrote: index.setMaxBufferedDocs(10); // Buffer 10 documents at a time in memory (they could be big) You might use a larger value here for the index with the small documents. I've sucessfully used values as high as a 1000 when indexing documents that average a few

Re: pre computing possible search results narrowing and hit counts on those

2005-03-30 Thread Doug Cutting
Antony Sequeira wrote: A user does a search for say condominium, and i show him the 50,000 properties that meet that description. I need two other pieces of information for display - 1. I want to show a select box on the UI, which contains all the cities that appear in those 50,000 documents 2.

searcher question

2005-03-30 Thread Omar Didi
I am having a large index (100GB) and when i run the following code : String indexLocation = servlet.getServletContext().getInitParameter( com.lucene.index ); logger.log( Level.INFO, got the index location from: + indexLocation ); searcher = new IndexSearcher(indexLocation); logger.log(

Re: searcher question

2005-03-30 Thread Doug Cutting
Omar Didi wrote: I am having a large index (100GB) and when i run the following code : String indexLocation = servlet.getServletContext().getInitParameter( com.lucene.index ); logger.log( Level.INFO, got the index location from: + indexLocation ); searcher = new IndexSearcher(indexLocation);

RE: searcher question

2005-03-30 Thread Omar Didi
my java heap is between 128 and 1024 MB, I have 2GB of RAM and about 10 million documents in the index which is broken down to 6 indexes. I am using a multi-searcher to query the index. I am using lucene1.4.3. -Original Message- From: Doug Cutting [mailto:[EMAIL PROTECTED] Sent:

RE: searcher question

2005-03-30 Thread Michael Celona
Curious... what kind of search performance are you getting for an index this size... -Original Message- From: Omar Didi [mailto:[EMAIL PROTECTED] Sent: Wednesday, March 30, 2005 3:15 PM To: java-user@lucene.apache.org Subject: RE: searcher question my java heap is between 128 and 1024

Re: HTML pages highlighter

2005-03-30 Thread Erik Hatcher
How did you index contents? If you did not use a stored field type, then that is the issue. Erik On Mar 30, 2005, at 12:31 PM, Yagnesh Shah wrote: Hello Lucene-User, Is any one try to do highlighting with HTML pages? I am trying to do this using demo example by Keld H. Hansen

RE: HTML pages highlighter

2005-03-30 Thread Yagnesh Shah
Hi! Eric, One more thing, I am using the same HTMLDocument.java that comes with /trunk/src/demo/org/apache/lucene/demo -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, March 30, 2005 4:01 PM To: java-user@lucene.apache.org Subject: Re: HTML pages

error when query contains numbers

2005-03-30 Thread Omar Didi
hi guys, I am using a QueryParser to search the index. when the query has numbers, i don t get any results?? any suggestions?? - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Newbie question

2005-03-30 Thread Luis Medina
Newbie question here, is upgrading Lucene as easy as replacing the old Jar file with a newer version's Jar file? or do I need to recompile the application's code? Thanks, Luis - To unsubscribe, e-mail: [EMAIL PROTECTED] For

RE: HTML pages highlighter

2005-03-30 Thread Yagnesh Shah
Hi! Eric, I try to modified that with this but I get compile error. Do you have any code snippet of highlighting code to pull the contents from the original source? or Do you know how I can do field store? doc.add(new Field(contents, parser.getReader(), Field.Store.YES,

Re: error when query contains numbers

2005-03-30 Thread Erik Hatcher
On Mar 30, 2005, at 8:05 PM, Omar Didi wrote: the .toString() looks excactly like the query I enter: if I search for yahoo AND 200 it returns 0 hits. I am sure there are documents that have 200 hundreds in them. The analyzer I am using is a custom analyzer that has a list of stop words. I don t

RE: error when query contains numbers

2005-03-30 Thread Omar Didi
Thanks Eric, I have looked at the way the documents were indexed and they are using 90% of the code using in chapter 2 and 4 of your book LIA. except for the stop words. I will try to use Luke to see if there are any numbers indexed first. From: Erik Hatcher

LUKE [ NEW VERSION ]

2005-03-30 Thread Karthik N S
Hi Guys. Apologies. :( Can Somebody Please Tell me How to add Custom Analyzer's to the new Version of LUKE , or is there an existing Process to do the same. Thx in advance WITH WARM REGARDS HAVE A NICE DAY [ N.S.KARTHIK]

Re: LUKE [ NEW VERSION ]

2005-03-30 Thread Andrzej Bialecki
Karthik N S wrote: * * * Can Somebody Please Tell me How to add Custom Analyzer's to the new Version of LUKE , * The same way as to the old version - you put them on your classpath when you run Luke, like this: java -cp lukeall.jar;myAnalyzers.jar org.getopt.luke.Luke -- Best

new bie ..

2005-03-30 Thread pashupathinath
hi, i'm a new lucene user. i've few questions regarding indexing and searching. 1)how do i search within tokens ..for example if i've a string my name is abc123. using whitespace analyser i can search for any of these strings but when i search for 123 the search returns zero results. how can i

Re: pre computing possible search results narrowing and hit counts on those

2005-03-30 Thread Antony Sequeira
On Wed, 30 Mar 2005 09:42:32 -0800, Doug Cutting [EMAIL PROTECTED] wrote: Antony Sequeira wrote: A user does a search for say condominium, and i show him the 50,000 properties that meet that description. I need two other pieces of information for display - 1. I want to show a select box

Re: error when query contains numbers

2005-03-30 Thread Erik Hatcher
On Mar 30, 2005, at 4:41 PM, Omar Didi wrote: I am using a QueryParser to search the index. when the query has numbers, i don t get any results?? any suggestions?? What is the .toString of the Query object instance returned from QueryParser? What Analyzer are you using? How did you index the

Re: HTML pages highlighter

2005-03-30 Thread Erik Hatcher
On Mar 30, 2005, at 4:46 PM, Yagnesh Shah wrote: Hi! Eric, Erik - with a 'k' - Sorry, I let it slide once though :) I try to modified that with this but I get compile error. Do you have any code snippet of highlighting code to pull the contents from the original source? I have a whole book full

RE: error when query contains numbers

2005-03-30 Thread Omar Didi
the .toString() looks excactly like the query I enter: if I search for yahoo AND 200 it returns 0 hits. I am sure there are documents that have 200 hundreds in them. The analyzer I am using is a custom analyzer that has a list of stop words. I don t know much about the way data was indexed, I