indexing going wrong

2007-08-11 Thread nachi
all, No sure if earlier mail went thru..so resending... Im new lucene and Im trying to develope a textual search module. I have written the following code ( this is research code) - File dir = new File(c:/test); IndexWriter writer = new IndexWriter(dir, new StandardAnalyzer(), true);

Re: Reading Existing index

2007-08-11 Thread Daniel Naber
On Saturday 11 August 2007 02:20, Aleesh wrote:  Need your help regarding reading existing index. Actually I am trying to read an existing index ans just wanted to know, is there a way to identify type of 'Analyzer' which was used at the index creation time? That information is not part of

Re: Lucene in large database contexts

2007-08-11 Thread Antonello Provenzano
Chris, Your solution is very interesting and I would like to consider it. At the moment I'm going to focus on (N)Hibernate Search (in the .NET version is released in the SVN), which also covers the requirement of persistence. If the result of the tests won't be successful, I will immediately

Re: indexing going wrong

2007-08-11 Thread Erick Erickson
A couple of things come to mind. But before I get to them, really, really, really get a copy of Luke. It'll allow you to examine your index and see if what's in there is really what you expect. It'll save you a world of hurt G Google luke lucene Also, use query.toString to see what the

Indexing correctly?

2007-08-11 Thread John Paul Sondag
Hi, I was hoping that maybe you guys could see if I'm somehow indexing inefficiently. I'm putting relevant parts of my code below. I've looked at the benchmarks page on Lucene and my indexing time is taking a substantial amount of time more than what I see posted. I'm not sure when I should

Re: 答复: 答复: Lucene in large database contexts

2007-08-11 Thread Mathieu Lecarme
With Compass, indexing is linked to your database transaction, when your object is persisted, it's indexed too. All your questions are managed cleanly and silently by Compass, just have a look to the source code if you don't wont to use this product. M. Le 10 août 07 à 12:24, Antonello

Re: Indexing correctly?

2007-08-11 Thread karl wettin
How much slower than anticipated is it? I would start by using a StringBuffer/Builder rather than appending (immutable) strings to each other. 11 aug 2007 kl. 19.05 skrev John Paul Sondag: Hi, I was hoping that maybe you guys could see if I'm somehow indexing inefficiently. I'm putting

Range queries in Lucene - numerical or lexicographical

2007-08-11 Thread Nilesh Bansal
Hi all, Lucene query parser synax page (http://lucene.apache.org/java/docs/queryparsersyntax.html) provides the following two examples of range query: mod_date:[20020101 TO 20030101] and title:{Aida TO Carmen} Now my question is, numerically 10 is greater than 2, but in string-only comparison 2

Re: indexing going wrong

2007-08-11 Thread nachi
oops...that was a control-c control-v error. Im indexing directory c:\test and using the index in c:\test for searching. I found the problem to be this - Im reusing the same document object in the for loop. I solved it by creating new document each time the loop runs... actually when if statement

Re: How to implement cut of score ?

2007-08-11 Thread Chris Hostetter
: I want to cut of records which has score below a threshold. : : Without DocCollector. this would be meaningless even if it were easier... http://wiki.apache.org/lucene-java/LuceneFAQ#head-912c1f237bb00259185353182948e5935f0c2f03 FAQ: Can I filter by score? -Hoss

Re: Indexing correctly?

2007-08-11 Thread John Paul Sondag
It takes roughly 6 hours for me to index a Gig of data. The benchmarks take quite a bit less if I'm reading it correctly. I'll try out the StringBuffer/Builder and let you know. Thanks for the quick response and if you have any more suggestions please let me know. --JP On 8/11/07, karl wettin

Re: configuration of lucene with jsp

2007-08-11 Thread Chris Hostetter
: Message-ID: : [EMAIL PROTECTED] : In-Reply-To: [EMAIL PROTECTED] http://people.apache.org/~hossman/#threadhijack When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if you change the subject line of your email,

Re: term location in doc

2007-08-11 Thread Chris Hostetter
: In-Reply-To: [EMAIL PROTECTED] http://people.apache.org/~hossman/#threadhijack When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if you change the subject line of your email, other mail headers still track which