Re: How to reflect index changes to search automatically

2007-07-12 Thread jafarim
With local indices, it is enough to reopen the IndexSearcher by calling close() and then renew the IndexSearcher object. How about RemoteSearchers? Is it necessary to re-initialize remote search server? --jaf On 7/12/07, Erick Erickson [EMAIL PROTECTED] wrote: In general, searchers cannot

Sorting direction

2007-05-26 Thread jafarim
Hi folks, - I am wondering whether there is any mechanism to specify the sorting direction (i.e. ascending/descending) without being obliged to make a custom Sorter. I add a timeOfIndexing field in order that I can retrive results based ob recentness later. But the sorter sorts in an ascending

Re: add in an existing document

2007-05-11 Thread jafarim
How about this idea: - a special Identifier field. - A DocumentHash class which calculates a hash value from a Document. - A query on Identifier before inserting new Documents to check if it already exists. --jaf On 5/10/07, STEFANOS STEFANOS [EMAIL PROTECTED] wrote: Hello, I would

Re: DBSight Turns Free! Instant Lucene Search on Database!

2007-04-24 Thread jafarim
Hi Chris, Can you explain how? I know the source is available but perhaps a short summary would be very useful for the list readers. --jaf On 4/24/07, Chris Lu [EMAIL PROTECTED] wrote: Hi, Saurabh, It's just one query and returns both hits and categorized counts. Chris --- Saurabh Dani

Re: Top Score Collector

2007-04-23 Thread jafarim
examined. I've just got to ask... Why do you (jafarim) want to fiddle with the threshold? How is this going to benefit the user over and above just getting the first N 100 docs from a Hits object? They're sorted already in relevancy order. Yonik's point that scores aren't comparable across

Top Score Collector

2007-04-22 Thread jafarim
Hi list. I am trying to implement some TopScoreHitCollector class; a kind of TopDocCollector which collects the documents the score of which is higher than a threshold. The threshold will be configurable in the constructor of the class. There is seemingly a document starvation about

Re: Top Score Collector

2007-04-22 Thread jafarim
Be aware that score thresholds don't work well in general since scores aren't really comparable from one query to another. What is I normalize the scores in such a manner that they become between 0 and 1? --jaf

updating a single searcher in MultiSearcher

2007-04-14 Thread jafarim
hi list, I am using a ParallelMultiSearcher in order to search a number of indices. As the rate of input to the index is very high I have to check indexSearcher.isCurrent quite often and have to reopen the index. There are however to issues: - In order to have a current version of a searcher, do

Field.lazy setter method?

2007-04-04 Thread jafarim
Hi I wonder why there is not setter method for the lazy member variable in Field class. Does that mean the propoerty is nominal and setting it does not have any effect, or am I missing some point? Any way, is there any way to tell lucene that a field is to be lazy-loaded, from the very beginning

Re: Field.lazy setter method?

2007-04-04 Thread jafarim
So, what's the usage of this propoerty in the Field class? On 4/4/07, Yonik Seeley [EMAIL PROTECTED] wrote: On 4/4/07, jafarim [EMAIL PROTECTED] wrote: Any way, is there any way to tell lucene that a field is to be lazy-loaded, from the very beginning of field construction? No, that data

Extracting a subset of an index

2007-04-03 Thread jafarim
Hi folks, I need to extract a subset of an index so that I can move some documents to another isolated machine to be searched locally. I'm not sure whether the following scenario is correct: - extracting the documents from the index by using one of the doc(i) methods - adding the same Document

Re: index word files ( doc )

2007-03-26 Thread jafarim
Good to know that your devised commercial feature is already offered by Enhydra Snapper as an open-source feature. Check here: http://www.enhydra.org/apps/snapper/index.html On 3/26/07, Ryan Ackley [EMAIL PROTECTED] wrote: Yes I do have plans for adding fast save support and support for more

Re: how to search over another search

2007-03-26 Thread jafarim
what do you mean by applying the result to the second one? On 3/26/07, Mohammad Norouzi [EMAIL PROTECTED] wrote: hi I have two separated index but there are some fields that are common between them. now I want to search from one index and then apply the result to the second one. what solution

Re: index word files ( doc )

2007-03-24 Thread jafarim
Can anyone make a comparison between the two, namely POI API and the one from textmining.org? On 3/24/07, Ryan Ackley [EMAIL PROTECTED] wrote: The site is down but you can download the word extractor library direct here: http://www.textmining.org/textmining.zip Going to fix the site this

Lazy Field Loading in IndexSearcher

2007-03-23 Thread jafarim
in IndexSearcher does not contain such facilities (seemingly). Hence, the Documents I get from the Hits.doc() would not benefit from the mentioned feature. Am I missing an important point or this is a desired feature to go on the todo list? --Jafarim

Re: index word files ( doc )

2007-03-23 Thread jafarim
Hi My experience is not much satisfactory. It breaks very easily on many files. On 3/23/07, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Hello, I am planning to index Word 2003 files. I read I have to use Jakarta Apache POI, but I also read on the POI site that their work with doc's is in an

Re: Lazy Field Loading in IndexSearcher

2007-03-23 Thread jafarim
: does not contain such facilities (seemingly). Hence, the Documents I get : from the Hits.doc() would not benefit from the mentioned feature. : Am I missing an important point or this is a desired feature to go on the : todo list? : --Jafarim -Hoss

Lazy field loading in

2007-03-21 Thread jafarim
in IndexSearcher does not contain such facilities. Hence, the Documents I get from the Hits.doc() would not benefit from the mentioned feature. Am I missing an important point or this is a desired feature to go on the todo list? --Jafarim

Storing whole documents in the index

2007-03-18 Thread jafarim
Hello It's a whil that I am using lucene and as most of people seemingly do, I used to save only some important fields of a docuemnt in the index. But recently I thought why not store the whole document bytes as an untokenized field in the index in order to ease the retrieval process? For example