Re: Related searches

2006-02-01 Thread Dave Kor
On 1/30/06, Leon Chaddock [EMAIL PROTECTED] wrote: Hi, Does anyone know if it is possible to show related searches with lucene, for example if someone searched for car insurance you could bring back the results and related searches like these One possible way is to use the vector space

Memory problem

2006-02-01 Thread Leon Chaddock
Hi All, We have a lucene index of over 10 000 000 docs at this time. When we try and run a search we get java.lang.OutOfMemoryError: Java heap space We have tried setting the xmx settings to 1gb but to no avail (the box has 4gb of memory available) . IS there any guidance on handling memory or

Re: Related searches

2006-02-01 Thread xing jiang
Hi, I got a question of doing the related search. For instance, if I want to say Support Vector Machine == SVM. Then, How can i use this information when retrieve documents. I dont think it can be added in the Synonym Filter. On 2/1/06, Dave Kor [EMAIL PROTECTED] wrote: On 1/30/06, Leon

RE: Memory problem

2006-02-01 Thread Nick Vincent
Hi Leon, I had a similar problem when doing a test import which I believe was actually down to object churn in parsing the data to create the Documents. I achieved a quick fix by calling System.gc() every thousand documents. Cheers, Nick From: Leon

Re: Memory problem

2006-02-01 Thread Leon Chaddock
Hi Nick, we didnt get the error on importing it was actually when conducting a search. Would this still help? Thanks Leon - Original Message - From: Nick Vincent [EMAIL PROTECTED] To: java-user@lucene.apache.org Sent: Wednesday, February 01, 2006 11:17 AM Subject: RE: Memory problem

Re: Memory problem

2006-02-01 Thread Volodymyr Bychkoviak
As long as you have many document in index there can many unique terms in index. Every 128th term(by default) is written to term info index for faster term lookup. This info is loaded entirely to memory when searching so this can increase memory usage. Note that this does not depends on number

Sorting by Score

2006-02-01 Thread Daniel . Clark
My primary sort is by relevance score and my secondary sort is by date. The Hits.getScore() method returns the score by 7 digits to the right of the decimal point. Therefore, If I round to only 2 decimal points in the display, the underlying 7 point score will be different in the sort. Example:

Re: Distributed vs Merged Searching

2006-02-01 Thread Grant Ingersoll
You might find http://hellonline.com/blog/?p=55 helpful. It discusses some issues with parallel distributed searches and may be helpful. How many documents are you expecting to index? And how many unique terms do you expect? Chun Wei Ho wrote: I am deploying a web application serving

Indexing and searching and item with attachments

2006-02-01 Thread Gwyn Carwardine
I want to be able to store items and attachments such that they are treated as a single document. On the other hand I want to be able to store them separately; there is not point in reindexing an attachment if I've simply changed the description. Here's an example: A system used to track job

update a document with unstored fields

2006-02-01 Thread Andrea Di Cesare
No virus found in this outgoing message. Checked by AVG Free Edition. Version: 7.1.375 / Virus Database: 267.14.25/247 - Release Date: 31/01/2006 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail:

Re: Memory problem

2006-02-01 Thread Chris Hostetter
it seems like there are a few common things that bite people over and over again that you should check first and foremost... 1) don't use more searchers/readers then you need. Every time you open an IndexSearcher/IndexReader resources are used which take up memory. for an application pointed

RE: Sorting by Score

2006-02-01 Thread Eric Isakson
Am I understanding your problem correctly? If you have two hits: Hit A score = .7876546 rounded score = .79date 2006Feb01 Hit B score = .7907536 rounded score = .79date 2006Jan01 You would like to have the hits sorted as A, B but you are getting the hits in the order B, A

RE: Sorting by Score

2006-02-01 Thread Chris Hostetter
: I've not used the sorting code yet, but it looks like you have to : provide some custom ScoreDocComparator by adding a SortField using the : SortField(String field, SortComparatorSource comparator) constructor. : I'm just not certain what you should specify for the field value since : you

RE: Sorting by Score

2006-02-01 Thread Eric Isakson
from your previous post about memory...Every time you sort on a field, a FieldCache array is constructed for that field. would that happen in this case for the field this comparator is not using? -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Chris

RE: Sorting by Score

2006-02-01 Thread Chris Hostetter
: from your previous post about memory...Every time you sort on a field, : a FieldCache array is constructed for that field. : : would that happen in this case for the field this comparator is not using? Ooohhh ... good question. As long as you write your own implimentation of

IndexWriter.java

2006-02-01 Thread Azlan Abdul Latiff
Does anyone have the file IndexWriter.java? Azlan This e-mail has been sent via JARING webmail at http://www.jaring.my - To unsubscribe, e-mail: [EMAIL

Re: IndexWriter.java

2006-02-01 Thread Chris Hostetter
: Does anyone have the file IndexWriter.java? your question is very perplexing. are you looking for the class that comes with lucene? ... http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/src/java/org/apache/lucene/index/IndexWriter.java?rev=372350view=markup ...or are you looking for some

Sending query to multiple servers and combine all Hits from them ?

2006-02-01 Thread Vikas Khengare
Hi Friends... I am doing search application which has following scenario. Architecture == 1] Common GUI 2] When user enter one query then It should go to 4 searcher server (All servers are on remote machines) 3] After searching all 4 server should return results i.e. Hits ( All

RE: Sending query to multiple servers and combine all Hits from them ?

2006-02-01 Thread Daniel Pfeifer
You search all four servers by doing this (the QueryParser in this example uses the Lucene 1.9 syntax): Searchable[] searchables = new Searchable[]{(Searchable) Naming.lookup(x1), (Searchable) Naming.lookup(x2), ...}; MultiSearcher multiSearcher = new MultiSearcher(searchables); Hits hits =

Re: Distributed vs Merged Searching

2006-02-01 Thread Chris Lamprecht
One issue is that if you are splitting the index in half (for example), getting some results from index A and some from index B, then you need to merge the results somewhere. But the scores coming from the two indexes are not related at all, for example, document 100 from index A has score 0.85,

Re: Distributed vs Merged Searching

2006-02-01 Thread Andrzej Bialecki
Chris Lamprecht wrote: One issue is that if you are splitting the index in half (for example), getting some results from index A and some from index B, then you need to merge the results somewhere. But the scores coming from the two indexes are not related at all, for example, document 100 from