regarding FieldSelector

2007-09-12 Thread Mohammad Norouzi
Hi all, Can anyone explain what is the FieldSelector and the usage or benefits of this structure? I read the javadocs but I can't get for what goal it is provided in Lucene. Thanks in advance -- Regards, Mohammad -- see my blog: http://brainable.blogspot.com/ another in

Some questions on transactions

2007-09-12 Thread Simon Wistow
I'm looking at doing a system which is looks something like this - I have an IndexSearcher open with a on-disk index but all writes go to a RAM based IndexWriter. Periodically I do 1. Close IndexSearcher 2. Open new IndexWriter in same location 3. Use addIndexes with

Re: Some questions on transactions

2007-09-12 Thread Michael McCandless
Simon Wistow [EMAIL PROTECTED] wrote: I'm looking at doing a system which is looks something like this - I have an IndexSearcher open with a on-disk index but all writes go to a RAM based IndexWriter. Periodically I do 1. Close IndexSearcher 2. Open new IndexWriter in same

Re: regarding FieldSelector

2007-09-12 Thread Grant Ingersoll
Hi Mohammad, The typical use cases are: 1. You have several small fields used in a results display and one or two large fields (i.e. the original document) and you don't want to pay the cost of loading the large fields for results display because most of them won't be chosen. When a

Re: regarding FieldSelector

2007-09-12 Thread Mohammad Norouzi
Hi Grant, Really thanks for your nice document about advanced Lucene. it was very useful for me. as I understand, we can set some large fields to be lazily loading, now my question is when it will be loaded? it make sense when we call doc.get(field_name) it will load from the index, Am I right?

PowerPoint Extraction

2007-09-12 Thread Krista Leopold
Hello all, I realize that I am asking a just barely Lucene question, but I am certain someone on this list knows the answer to what I am on a quest for. I want to use the HSLF portion of apache's POI to do text extraction for my index, but I am having a really hard time obtaining either the

Re: regarding FieldSelector

2007-09-12 Thread Erick Erickson
Well, it depends on what improve the search process means in your context G.. But I had a case similar to yours that I wrote up in the Wiki where my search times improved about 10X by using lazy loading. You might want to read that entry here...

Re: PowerPoint Extraction

2007-09-12 Thread Nick Burch
On Wed, 12 Sep 2007, Krista Leopold wrote: I realize that I am asking a just barely Lucene question, but I am certain someone on this list knows the answer to what I am on a quest for. I want to use the HSLF portion of apache's POI to do text extraction for my index, but I am having a really

Re: How to get documents similar to other document ?

2007-09-12 Thread Chris Hostetter
: Excuse me, Could you give more details ? : Are you telling me that functionality exists ? : Which class should I use for this ? MoreListThis is a class name ... it is in a contrib (which means it is not inlcuded in the main lucene jar, but it can be found by looking in the contrib directory

Tokenization question

2007-09-12 Thread Dan Luria
If I have a tokenized unstored field in a document, and I want to transfer the document to another index, is it possible to carry of the tokenization with terms? - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional