Re: How do I get a count of all search results inside of my content?

2006-01-16 Thread Chris Hostetter
1) There's no need to send the same message twice just because you didn't get a rapid response to hte first one ... in most parts of hte US this has been a three day weekend, so it's not that suprising that no one wrote a reply yet since the first time you asked this question friday night. 2) you

How do I get a count of all search results inside of my content?

2006-01-16 Thread Gary Mangum
I am trying to find out a quick way to get a complete count of all search results found in all of my Documents. Let me back up... I have split the content that I am searching into many Documents and then indexed this content. Each Document represents about one "paragraph" of data. Now I search

Re: One problem of using the lucene

2006-01-16 Thread jason
Hi, the following code is the SynonymFilter i wrote. import org.apache.lucene.analysis.*; import java.io.*; import java.util.*; /** * @author JIANG XING * * Jan 15, 2006 */ public class SynonymFilter extends TokenFilter { public static final String TOKEN_TYPE_SYNONYM = "SYNONYM";

RE: Memory

2006-01-16 Thread Aigner, Thomas
Thanks Paul, I did a man on top and sure enough there was a PPID command on Linux (f then B) for parent process. And yes, they always have the same parent command. Thanks for your help as I'm obviously still a noob on Unix. Tom -Original Message- From: Paul Smith [mailto:[EMAIL

Re: Memory

2006-01-16 Thread Paul Smith
If you look at the man page for 'ps' you'll see a switch that shows all the threads too (it's different on different unix flavours, so best to do look in the man page). Once you've shown the threads in 'ps' you'll see that the process that is appearing in top, and I'll bet it's parent is yo

Memory

2006-01-16 Thread Aigner, Thomas
Hi all, Is anyone experiencing possible memory problems on LINUX with Lucene search? Here is our scenario, we have a service that lives on LINUX that takes all incoming request through a port and does the search. Only 1 IndexSearcher is instantiated to do this from our service. When I r

Re: Finding similar documents

2006-01-16 Thread Stefan Gusenbauer
Grant Ingersoll wrote: I believe there is a MoreLikeThis class floating around somewhere (I think it is in the contrib/similarity package). The Lucene book also has a good example, and I have some examples at http://www.cnlp.org/apachecon2005 that demonstrate using term vectors to do this

AW: Part-Of Match

2006-01-16 Thread sven
Hi Hoss, thanks for the answer, and yes you have described the problem perfectly. I think you are right lucene is in fact not the best way of solving it. I decided to simply build a letter trie consisting of all concepts and then simply do a search with that document on the trie. This brings exact

Re: One problem of using the lucene

2006-01-16 Thread Erik Hatcher
Could you share the details of your SynonymFilter? Is it adding tokens into the same position as the original tokens (position increment of 0)? Are you using QueryParser for searching? If so, try TermQuery to eliminate the parser's analysis from the picture for the time being while trou

locked files after updating lucene to 1.4.3

2006-01-16 Thread Jens Ansorg
hi, I run into an issue after updating lucene libs from 1.3-final to 1.4.3. We have a batch job on our web server that recreates the lucene search index every night. This job deletes all index and creates a new one. This search index gets used by the lucene-powered search feature of the web

One problem of using the lucene

2006-01-16 Thread jason
Hi, I got a problem of using the lucene. I write a SynonymFilter which can add synonyms from the WordNet. Meanwhile, i used the SnowballFilter for term stemming. However, i got a problem when combining the two fiters. For instance, i got 17 documents containing the Term "support" and the follo