Re: Simple Web Search

2008-06-17 Thread Lukas Vlcek
Hi, If your content is stored in database then you might be also interested in Compass (I have a very positive experience with this product). Hibernate search can be other interesting product for you (I don't have any experience with this product so I am not able to tell you). Lukas On Tue, Jun

Re: Displaying and highlighting results from a Wild Card and Fuzzy search using Lucene in Java

2008-06-17 Thread Mark Miller
I am not seeing the problem. One small thing is that you don't need to make another Reader...you can get the one from the IndexSearcher. But I dont see why your are getting that error...that is the type of error you would get if you were trying to use IndexReaders Protected constructor outside

Re: Simple Web Search

2008-06-17 Thread Ruslan Sivak
The thing is I don't necessarily want to search the db, as I don't know what page to send the user to to view the result. I want to have something crawl the site and then be able to know the url of the match. Russ Lukas Vlcek wrote: Hi, If your content is stored in database then you might

Re: How to handle plural?

2008-06-17 Thread Grant Ingersoll
Really could use a sample sentence or two, maybe even a paragraph, just to make sure I'm following you here. However, this sounds like, to me anyway, as a job for some higher order processing. If I understand you right, you are asking for a system that can reason about the types of things

Re: Displaying and highlighting results from a Wild Card and Fuzzy search using Lucene in Java

2008-06-17 Thread Chris Hostetter
: why your are getting that error...that is the type of error you would get if : you were trying to use IndexReaders Protected constructor outside the index : package...but you are correctly using the static open call... not quite... : IndexReader ir = new IndexReader().open(fsDir);

huge tii files

2008-06-17 Thread tsuraan
I have a collection of indices with a total of about 7,000,000 documents between them all. When I attempt to run a search over these indices, the searching process's memory usage increases to ~1.7GB if I allow java to use that much memory. If I don't (my normal memory cap is 512MB), I get the

RE: huge tii files

2008-06-17 Thread Alex
you can invoke IndexReader.setTermInfosIndexDivisor prior to any search to control the fraction of .tii file read into memory. _ 聰明搜尋和瀏覽網路的免費工具列 — MSN 搜尋工具列 http://toolbar.live.com/

Re: huge tii files

2008-06-17 Thread tsuraan
That's really nice. Thanks! I'm guessing the answer is no, but is there an equivalent to that for lucene-2.2.0? Upgrading shouldn't be much of a problem anyhow (we've been doing it since 1.9), but out of curiosity... On 17/06/2008, Alex [EMAIL PROTECTED] wrote: you can invoke

RE: huge tii files

2008-06-17 Thread Steven A Rowe
Hi tsuraan, On 06/17/2008 at 2:31 PM, tsuraan wrote: I'm guessing the answer is no, but is there an equivalent to that for lucene-2.2.0? Not exactly equivalent, but: from the apidoc for the 2.3.2 version of setTermInfosIndexDivisor(int)

Re: Lucene Transactional semantics

2008-06-17 Thread Adrian Tarau
I had the same problem, so one year ago I implemented transactions on top of Lucene(I had an idea how to do it, but I also peeked a little bit in Compass sources). Basically I create a new index every time when a new transaction is started and this new index is made visible if commit is

Re: Displaying and highlighting results from a Wild Card and Fuzzy search using Lucene in Java

2008-06-17 Thread Mark Miller
Chris Hostetter wrote: : why your are getting that error...that is the type of error you would get if : you were trying to use IndexReaders Protected constructor outside the index : package...but you are correctly using the static open call... not quite... : IndexReader ir = new

Snowball Analyzer and apostrophes

2008-06-17 Thread Max Metral
So I'm using Snowball Analyzer on a field for business titles. The value Charlie's Sandwich Shoppe becomes charli sandwich shopp. This happens partly because the StandardAnalyzer strips off the apostrophe-s entirely, and then the Snowballer takes off the e. The problem is when someone comes in

Re: number of hits per document

2008-06-17 Thread Chris Hostetter
: I could do it that way, but couting the spans per document is specific to : SpanQuerys. I would still have to count hits for TermQuerys separately. I : was looking for a generic way to count hits for any instance of Query within : a document. the orriginal Query, Weight, and Scorer APIs

Lucene Developer

2008-06-17 Thread Jdanko
I am not here to waste anyone's time and don't believe my post violates the terms of use. If the admins wish to remove my posts I understand. I am recruiting for a Search Engineer (Lucene), this position is a direct hire opportunity for an e-commerce site in the Monterey, Ca area. Below is a