Re: Boosting query - debuging

2009-05-13 Thread liat oren
Thanks a lot, Grant. Yes, this is the case, it is longer than TTD. Can you also explain me Why in finlin, we have the doc 35433 and in TTD, its 20? Are these the number of dcuments that contain any of the elements exist in eaxh word. So if word TTD contains only 6621468, then 20 is the number of do

IndexWriter stopped before commit

2009-05-13 Thread liat oren
Hi all, I ran a code that joins a list of indexes to one index. The commit and close of the writer is done when we finish looping on the list of the original paths. An error occured before it arrived to the commit part. The index size is big, so it looks like the data is there, but when I check th

How can i restrict search to only some documents.

2009-05-13 Thread Velaboy V
Hi, I am new to Lucene. I have a document repository laid out like this: \docsdir    \doc1dir\    \doc2dir\    \doc3dir\     ... Every docNdir contains several files (xml). While searching, i want to restrict the search Only to a set of documents (this is because the user who is searching has

Re: How can i restrict search to only some documents.

2009-05-13 Thread Anshum
Hi, There are several ways to do it, easier ones: 1. While indexing, you could store the directory and then filter results on directory while searching. 2. Run a prefix query while searching in case you can't index the directory, though I wouldn't have a reason to justify this approach. The above

Re: IndexWriter stopped before commit

2009-05-13 Thread Michael McCandless
Unfortunately, no. If the JRE crashes/exits without IndexWriter.commit (or close) being called, then the index will reflect none of the changes during that session. There will be partial files in there (that's why you see so much disk space being used) but there's no segments file describing what

Re: Alphanumeric Search Problem

2009-05-13 Thread Erick Erickson
I'd recommend you get a copy of Luke and examine what's actually in your index when anomalous things happen. In your first post you didn't specify what analyzer you used, I suspect you weren't getting the tokens broken up as you expected. Luke would have shown you. But if you're satisfied with

Re: Boosting query - debuging

2009-05-13 Thread Grant Ingersoll
On May 13, 2009, at 3:04 AM, liat oren wrote: Thanks a lot, Grant. Yes, this is the case, it is longer than TTD. Can you also explain me Why in finlin, we have the doc 35433 and in TTD, its 20? Are these the number of dcuments that contain any of the elements exist in eaxh word. My unde

Upper limit on document field value length ?

2009-05-13 Thread Joel Halbert
Is there a limit to the size of a field which Lucene will index? i.e. for very large field values are only the first n tokens or n characters indexed? If so is there a way of upping/removing this limit? Rgs, Joel - To unsubs

RE: Upper limit on document field value length ?

2009-05-13 Thread Uwe Schindler
See http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/index/IndexWriter. MaxFieldLength.html And the corresponding IndexWriter ctors. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Joel Halbert

RE: Upper limit on document field value length ?

2009-05-13 Thread Joel Halbert
Thanks Uwe. -Original Message- From: Uwe Schindler Reply-To: java-user@lucene.apache.org To: java-user@lucene.apache.org Subject: RE: Upper limit on document field value length ? Date: Wed, 13 May 2009 15:57:26 +0200 See http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/index/Ind

Lucene Per-User Relevancy

2009-05-13 Thread EJLeVin1
Hi, I am kind of new to Lucene so please bear with me if what I'm asking sounds ridiculous. I am trying to get lucene to have higher amounts of relevancy based on keywords that the user has marked in their profile as important, and also based on keywords that the user dislikes. Right now I a

Re: InstantiatedIndex Memory required

2009-05-13 Thread Karl Wettin
Hi Ravichandra, this is a question better fitted the java-users maillinglist. On this list we talk about the development of the Lucene API rather than how to use it. To answer your question, there is no simple formula that says how much RAM an InstantiatedIndex will consume given the FSDi

OS not allowing me to get native fs lock

2009-05-13 Thread Newman, Billy
I know that this is not really a lucene problem but looking around I have not been able to find much about it. I recently ran into a problem where I cannot obtain a lock due to a problem with native file system locks. The strange thing is that this was working a few days ago and I have just st

Re: german analyers xes me

2009-05-13 Thread Daniel Naber
On Tuesday 12 May 2009, Timon Roth wrote: > the queryparser is feeded with the germananalyzer and translates the > phrase to "offentlich finanx abgaberech". Have you checked the FAQ at http://wiki.apache.org/lucene-java/LuceneFAQ#head-3558e5121806fb4fce80fc022d889484a9248b71 ? If that doesn't he

Help with phrase indexing

2009-05-13 Thread Ridzwan Aminuddin
Hi all, Is Lucene able to index phrases instead if individual terms? If it is, can we also feed it a 'thesaurus or dictionary' of phrases that it should look out for when indexing. Thanks in advance, Ridzwan

Re: pagination search results

2009-05-13 Thread Daniel Susanto
Do you have any example for that?? I'm using JSP, what i need is paging just like in google search... :D thx Daniel Susanto http://susantodaniel.wordpress.com --- On Tue, 5/12/09, Ian Lea wrote: From: Ian Lea Subject: Re: pagination search results To: java-user@lucene.apache.org Date: Tuesd

Re: pagination search results

2009-05-13 Thread Nate
No. http://tinyurl.com/qwhhrp -Nate On Wed, May 13, 2009 at 8:32 PM, Daniel Susanto wrote: > Do you have any example for that?? I'm using JSP, what i need is paging just > like in google search... > > :D > > thx > > Daniel Susanto > http://susantodaniel.wordpress.com > > --- On Tue, 5/12/09, I

Re: Help with phrase indexing

2009-05-13 Thread Anshum
If I'm interpreting your need correctly, you want to index untokenized strings, is it? Even if you aren't looking for untokenized indexing, you could always use/design a suitable analyzer which indexes your docuement exactly as you want, Yes, what you also are looking at is some kind of a synonym a