Lucene 1.9.1 - Newbie - How to determine what Hit

2006-03-09 Thread David Ahlschläger
Hi I am new to Lucene only been using it for 2 Days now. I got the Lucene in Action Book whitch got me on the road. I am currently having troble trying to do the following: 1. I indexd my Files to the RAMDirectory, then close the Index Writer. 2. Documnets have only 2 Fields. "filename" which

RE: steps for building lucene 1.9

2006-03-09 Thread Haritha_Parvatham
Hi Patrick, Thanks for your reply. Still iam facing problem with lucene 1.9.I have downloaded binaries as well as source code. When iam trying to run the binaries.Iam getting an error. I have followed the your instructions . I have deployed the luceneweb.war in tomcat.Iam getting the front-end l

Re: Best practice for searching html

2006-03-09 Thread Raul Raja Martinez
Hi, Unfortunately I can't change the way things are indexed, so I guess I need some short of utility class that will turn Martínez into Martínez and then just search for that term. I have also this problem using the StandardAnalyzer: If I search for "cádiz" in luke the query gets parsed as

Adjusting WRITE_LOCK_TIMEOUT in 1.9.1

2006-03-09 Thread Bill Janssen
I don't see how to adjust the value of IndexWriter's WRITE_LOCK_TIMEOUT in 1.9. Since the property org.apache.lucene.writeLockTimeout is no longer consulted, the value of IndexWriter.WRITE_LOCK_TIMEOUT is final, and there's no setter, what's the deal? Bill ---

1.9.1 problem with MatchAllDocsQuery

2006-03-09 Thread javier muguruza
Hi all, I was using lucene-1.9-rc1-dev.jar (got it from luke website I think). I was running a query whose tostring() was: -body:angel -body:darpa -body:protocol +MatchAllDocsQuery1.0 When I upgraded to 1.9.1, I am getting an exception: ava.lang.ArrayIndexOutOfBoundsException: 0 at org.a

Re: Compressed BitSet

2006-03-09 Thread Paul Elschot
On Thursday 09 March 2006 14:25, eks dev wrote: ... > > PS1: If you are interested in compressed bit sets, try > to search for "utilities for compact sparse filters > lucene" Or look here: http://issues.apache.org/jira/browse/LUCENE-328 Regards, Paul Elschot --

Re: Lucene Ranking/scoring

2006-03-09 Thread Yonik Seeley
On 3/9/06, Yang Sun <[EMAIL PROTECTED]> wrote: > Hi Yonik, > Thanks very much for your suggestion. The query boost works great for > keyword matching. But in my case, I need to rank the results by date and > title. For example, title:foo^2 abstract:foo^1.5 date:2004^3 will only boost > the document

RE: [ANN] Searchable 0.6.2 - Annotations for Lucene

2006-03-09 Thread Ramana Jelda
Hi, Exactly, I have already implemented in my application using annotations. and some changes to query input in my application. I don't see where you provide Analyzer annotation. It seems u forgot it. Nice coincidence. Jelda > -Original Message- > From: Seth Fitzsimmons [mailto:[EMAI

Re: Store object into Lucene index

2006-03-09 Thread Øyvind Stegard
On Thursday 09 March 2006 15:54, Yonik Seeley wrote: > On 3/9/06, Øyvind Stegard <[EMAIL PROTECTED]> wrote: > > - How does many stored fields eventually affect indexing/query > > performance compared to if no fields were stored (only indexed) ? > > Additional stored fields should have no effect on

Re: Store object into Lucene index

2006-03-09 Thread Yonik Seeley
On 3/9/06, Øyvind Stegard <[EMAIL PROTECTED]> wrote: > - How does many stored fields eventually affect indexing/query performance > compared to if no fields were stored (only indexed) ? Additional stored fields should have no effect on querying (the internal information about a field is looked up

[ANN] Searchable 0.6.2 - Annotations for Lucene

2006-03-09 Thread Seth Fitzsimmons
Searchable is a toolkit for Lucene that harnesses the power of annotations to specify what properties to index and how to treat them. A sample annotated interface: public interface Sample extends Searchable { public Integer getId(); @Indexed(boost=2.0F) public String getName(); @Indexed

Re: Store object into Lucene index

2006-03-09 Thread Øyvind Stegard
On Thursday 09 March 2006 14:55, Samuru Jackson wrote: > Is there a way to save an object to the a lucene index? > > In my project I noticed that the performance bottleneck is my > database. Lucene gives back a result in no time but to retrieve the > corresponding data sets at the backend in the da

Re: Best practice for searching html

2006-03-09 Thread Yonik Seeley
On 3/9/06, Raul Raja Martinez <[EMAIL PROTECTED]> wrote: > Hi I have a lot of html indexed such as: > > Martínez > > Of course my users are gonna search for Martínez and they're not gonna > get a match. > > Is there a common approach to solve this kind of problem in lucene, > Maybe some utility cla

Re: Store object into Lucene index

2006-03-09 Thread Yonik Seeley
On 3/9/06, Samuru Jackson <[EMAIL PROTECTED]> wrote: > Is there a way to save an object to the a lucene index? Any field may be stored as well as indexed, or stored and not indexed. If a field is stored only (not indexed), you can opt to store it as binary or compressed binary. See the JavaDoc fo

Re: Lucene performance question

2006-03-09 Thread DanielFeinstein
I'm using the following java options: JAVA_OPTS='-Xmx1524m -Xms1524m -Djava.awt.headless=true' --- Grant Ingersoll <[EMAIL PROTECTED]> wrote: > What is your Java max heap size set to? This is the > -Xmx Java option. > > Daniel Feinstein wrote: > > Hi, > > > > My lucene index is not big (about

Store object into Lucene index

2006-03-09 Thread Samuru Jackson
Is there a way to save an object to the a lucene index? In my project I noticed that the performance bottleneck is my database. Lucene gives back a result in no time but to retrieve the corresponding data sets at the backend in the database can take long especially if you need to execeute many que

Re: Lucene performance question

2006-03-09 Thread Grant Ingersoll
What is your Java max heap size set to? This is the -Xmx Java option. Daniel Feinstein wrote: Hi, My lucene index is not big (about 150M). My computer has 2G RAM but for some reason when I'm trying to store my index using org.apache.lucene.store.RAMDirectory it fails with java out of memory

Re: Compressed BitSet

2006-03-09 Thread eks dev
Just a short one, it rocks in some cases (when actual BitSet/IntSet is compressable, long runs of set or clear bits...). Very good general BitSet representation I have tried it and found no bugs so far (+- 2 months of using it) Unfortunately, there is an issue with Licence (not ASF compatible :(

Re: Best practice for searching html

2006-03-09 Thread Jens Kraemer
Hi! On Thu, Mar 09, 2006 at 04:31:23AM -0800, Raul Raja Martinez wrote: > Hi I have a lot of html indexed such as: > > Martínez > > Of course my users are gonna search for Martínez and they're not gonna > get a match. > > Is there a common approach to solve this kind of problem in lucene, > M

Lucene performance question

2006-03-09 Thread Daniel Feinstein
Hi, My lucene index is not big (about 150M). My computer has 2G RAM but for some reason when I'm trying to store my index using org.apache.lucene.store.RAMDirectory it fails with java out of memory exception. Also sometimes for the same search query time spent on search could raise in 10-20 tim

Best practice for searching html

2006-03-09 Thread Raul Raja Martinez
Hi I have a lot of html indexed such as: Martínez Of course my users are gonna search for Martínez and they're not gonna get a match. Is there a common approach to solve this kind of problem in lucene, Maybe some utility class or something? Any help is appreciated. Best regards. Raul.

Compressed BitSet

2006-03-09 Thread Dupriez Christophe
I am about to evaluate Lucene for replacing current IR system of Belgian Poison Center. We are used to manage query refinements by combining different searches with boolean operators: we thus have to store the results (or otherwise to remake them). Our current software does this by paging compres

Aperture 2006.1 alpha 2 released

2006-03-09 Thread Christiaan Fluit
A little while ago I announced the existence of the Aperture project, founded by my company together with the DFKI institute. We just released Aperture 2006.1 alpha 2, which may be of interest to all Lucene users dealing with crawling and text extraction. The project page is located at:

Re: steps for building lucene 1.9

2006-03-09 Thread Patrick Kimber
Hi Haritha Hope the following helps: Build Lucene Core from SVN Download the lucene Subversion repository from: http://svn.apache.org/repos/asf/lucene/java/trunk Note: The CVS repository is still accessible but is out of date. I downloaded to: C:\src\lucene-svn\ To build (using ANT): cd C:\sr

steps for building lucene 1.9

2006-03-09 Thread Haritha_Parvatham
Hi, I have downloaded lucene 1.9 version .please tell me how to build it.Iam finding so many errors in lucene 1.9 source code. Thanks. Haritha DISCLAIMER: This email (including any attachments) is intended for the sole use of the intended recipient/s and may contain material that is CONFIDENTI

RE: 1.4.3 and 64bit support? out of memory??

2006-03-09 Thread zzzzz shalev
hey, i ran a small test and i have 12,055,022 terms in the index, i have a strong feeling that the OS is not allowing the new Term[12055022] allocation JVM - 64bit Linux - 16GB RAM any ideas? Andrew Schetinin <[EMAIL PROTECTED]> wrote: Hi, That'

Re: Atomic index/search for a phrase

2006-03-09 Thread Erik Hatcher
On Mar 8, 2006, at 11:13 PM, Urvashi Gadi wrote: I am trying index and search a phrase (multiple words seperated by spaces). How should i index it so that it remains atomic. I have observed that if i index the phrase are keyword, lucene doesn't let me retrive the phrase in search. Please

Re: RangeQuery, FilterdQuery and HitCollector

2006-03-09 Thread Youngho Cho
Hello Erik, Thanks for your info. It passed !. Thanks again, Youngho - Original Message - From: "Erik Hatcher" <[EMAIL PROTECTED]> To: Sent: Thursday, March 09, 2006 5:12 PM Subject: Re: RangeQuery, FilterdQuery and HitCollector > Youngho, > > Try the lasted trunk codebase. I have

Re: RangeQuery, FilterdQuery and HitCollector

2006-03-09 Thread markharw00d
FilteredQuery has the side effect of passing zero scoring docs to the hitcollector. This does break the contract for HitCollector.collect method because the JavaDocs state: "Called once for every non-zero scoring document, with the document number and its score." The quick fix is to simply add a t

Re: RangeQuery, FilterdQuery and HitCollector

2006-03-09 Thread Erik Hatcher
Youngho, Try the lasted trunk codebase. I have recently applied a contributed patch to FilteredQuery. Using the latest codebase, your test passes for me. Erik On Mar 9, 2006, at 1:09 AM, Youngho Cho wrote: Hello, I would like to use a Filter for rangeQuery ( to avoid potentia