Re: SF.net search system

2005-07-05 Thread Andrzej Bialecki
Chris Conrad wrote: We're using the StandardAnalyzer. We are having very strange problems and I haven't been able to track them down. I know that the SourceForge Enterprise Edition team at VA has also seen the same kind of problem (i.e. search terms returning no results when there seem to

Re: SF.net search system

2005-07-05 Thread Chris Conrad
On Jun 29, 2005, at 4:37 PM, Chris Lu wrote: How is your crawler is done? I saw SF.net searches several types of documents, like "People", "Freshmeet.net", "Site Doc". Are they all from database? We don't crawl per se, we use triggers in the database to spool changes to a table which is the

Re: SF.net search system

2005-07-05 Thread Chris Conrad
Hi David, On Jun 29, 2005, at 4:25 PM, David Spencer wrote: Chris Conrad wrote: I know I've been asked before for a description of how SourceForge.net is using Lucene. I wrote a blog entry about it and thought people might be interested in seeing at a high level how it was designed.

PhraseQuery Incompatible format version

2005-07-05 Thread Yousef Ourabi
Hey All: Quick Question about the following error I am getting when I use a string tokenizer to parse the query string. The query I pass to is Great Expections / Great Expectations, both are two tokens however? phraseQuery.add(new Term("content", tok.nextToken())); IOException caught SearchByTerm:

Re: highlighter to use with 1.4.3?

2005-07-05 Thread Erik Hatcher
On Jul 5, 2005, at 4:48 PM, Leos Literak wrote: Hi, I'd like to play around with highlighter. I downloaded one from sandbox, but it doesn't compile against lucene stable 1.4.3. highlight/TokenSources.java:19: cannot find symbol [javac] symbol : class TermVectorOffsetInfo Is there some p

Re: Question on the Sandbox Highlighter

2005-07-05 Thread Erik Hatcher
On Jul 5, 2005, at 4:58 PM, Terence Lai wrote: I am currently using Lucene 1.4.2 with the highighter downloaded from Lucene In Action. The Highlighter class provides the following method to highlight the terms specified in the Query: /** * Highlights chosen terms in a text, extracting th

Question on the Sandbox Highlighter

2005-07-05 Thread Terence Lai
Hi, I am currently using Lucene 1.4.2 with the highighter downloaded from Lucene In Action. The Highlighter class provides the following method to highlight the terms specified in the Query: /** * Highlights chosen terms in a text, extracting the most relevant section. * The document text is

highlighter to use with 1.4.3?

2005-07-05 Thread Leos Literak
Hi, I'd like to play around with highlighter. I downloaded one from sandbox, but it doesn't compile against lucene stable 1.4.3. highlight/TokenSources.java:19: cannot find symbol [javac] symbol : class TermVectorOffsetInfo Is there some package that can be used with lucene 1.4.3 distribut

Re: question on filters

2005-07-05 Thread Erik Hatcher
On Jul 5, 2005, at 9:48 AM, Aigner, Thomas wrote: Hi all, I am creating a punctuation filter to filter certain punctuation out of the token stream. I am getting a "The field t.termText is not visible" error. I'm not sure what I would need to include to make this property visible (I a

question on filters

2005-07-05 Thread Aigner, Thomas
Hi all, I am creating a punctuation filter to filter certain punctuation out of the token stream. I am getting a "The field t.termText is not visible" error. I'm not sure what I would need to include to make this property visible (I am still new to Lucene and Java for that matter). I copi

if delete all docs in segment - when is segment deleted

2005-07-05 Thread Tony Schwartz
If every doc in a segment is deleted, when does the segment go away? Without me having to dig too deep, I was hoping someone could help me prepare for this eventuality. I have an index that grows infinitely. Old docs are deleted each day just before new docs for that day are added. If I set Ma

RE: free text search with numbers

2005-07-05 Thread BOUDOT Christian
Thanks for the hint. Cheers Chris -Original Message- From: sergiu gordea [mailto:[EMAIL PROTECTED] Sent: 05 July 2005 10:02 To: java-user@lucene.apache.org Subject: Re: free text search with numbers Hi Christian, That syntax is not entirely correct. Search in the mailing list for "*

RE: free text search with numbers

2005-07-05 Thread BOUDOT Christian
Your questions are not dump but I am :-S I am not familiar with JavaCC and the .jj files. I didn't know I had to generate the java files with JavaCC. Of course it works now. There is still a lot to discover for me :-) Many thanks for your help and patience. Cheers Chris -Original Message--

Re: free text search with numbers

2005-07-05 Thread Erik Hatcher
On Jul 5, 2005, at 2:26 AM, BOUDOT Christian wrote: :-) I changed the main lines and compiled the QueryParser.java after that I compiled the entire package, build a new jar file and installed it on my server. (I followed the same procedure I used when I added a new analyzer) Sorry if the

Re: Most Useful Lucene Taglib?

2005-07-05 Thread Erik Hatcher
On Jul 4, 2005, at 9:45 PM, Chris Fellows wrote: Perhaps it may be an overkill of a taglib, but my familiarity with Lucene dates to only this weekend. On the other hand there has been a sustained trend to move away from scriptlets. By no means was I suggesting scriptlets, though pragmatical

Re: free text search with numbers

2005-07-05 Thread sergiu gordea
Hi Christian, That syntax is not entirely correct. Search in the mailing list for "*term" or suffix queries a few months ago I submitted the correct grammar that enables suffix queries. Best, Sergiu BOUDOT Christian wrote: I have found in the QueryParser.jj those lines of comments: // OG:

Re: Most Useful Lucene Taglib?

2005-07-05 Thread Paul Libbrecht
Le 5 juil. 05, à 03:45, Chris Fellows a écrit : IS there a strong web client user base of Lucene? I would estimate this to at least 50% of, say, the java-user@lucene.apache.org mailing-list, really a personal guess, though. paul