RE: Dates and others

2003-11-26 Thread Dion Almaer
Hi guys - So I am getting happier with search, and just pushed the lucene version live at: http://www.theserverside.com (on the leftbar) and: http://www.theserverside.com/home/search/index.jsp The only real item that I still want to tweak more is getting recent results higher in the list. I wa

Re: log4j.properties

2003-11-26 Thread Victor Hadianto
> java -Dlog4j.configuration=log4j.xml org.pdfbox.searchengine.lucene.IndexFiles > -create -index c:\\index .. Hmm try to create log4j.xml instead of log4j.properties as specified in the command line parameter. - Original Message - From: "Tun Lin" <[EMAIL PROTECTED]> To: "'Lucene User

Re: Collaborative Filtering API

2003-11-26 Thread Steven J. Owens
On Tue, Nov 25, 2003 at 01:18:19PM -0500, Michael Giles wrote: > Yes, he was the lead Ph.D. student on the GroupLens project at Minnesota. I've actually worked on a system that bundled GroupLens. I think it was Vignette StoryServer. The Vignette docs were incredibly dense with MarketingNewS

Re: Search Question - not returning desired results

2003-11-26 Thread Erik Hatcher
On Wednesday, November 26, 2003, at 11:08 AM, Pleasant, Tracy wrote: But now i have another question. Let's say I have 'return_results.pl' in the document in one of the fields. Actually there is a little bit more to it than understanding the analysis phase, and you were right in saying you need

Re: Search Question - not returning desired results

2003-11-26 Thread Erik Hatcher
On Wednesday, November 26, 2003, at 11:08 AM, Pleasant, Tracy wrote: But now i have another question. Let's say I have 'return_results.pl' in the document in one of the fields. When I search for return_res* or return_res~ it won't return the document. But searching for any of these does return th

Re: Search Question - not returning desired results

2003-11-26 Thread Erik Hatcher
On Wednesday, November 26, 2003, at 11:33 AM, Pleasant, Tracy wrote: Your website says: org.apache.lucene.analysis.standard.StandardAnalyzer: [xy&z] [corporation] [EMAIL PROTECTED] [com] When I run it it keeps the entire email '[EMAIL PROTECTED] but according to your website it separa

RE: Eliminating duplicate result

2003-11-26 Thread Pleasant, Tracy
You are searching for the same term and you are searching the same index twice, it will return the same results... I don't get what you are asking. -Original Message- From: Dragan Jotanovic [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 26, 2003 3:19 AM To: Lucene Users List Subje

RE: Search Question - not returning desired results

2003-11-26 Thread Pleasant, Tracy
Erik, I think there may be a typo in the website. When I run the AnalyzerDemo : Analzying "xy&z corporation - [EMAIL PROTECTED]" org.apache.lucene.analysis.standard.StandardAnalyzer: [xy&z] [corporation] [EMAIL PROTECTED] Your website says: org.apache.lucene.analys

RE: Search Question - not returning desired results

2003-11-26 Thread Pleasant, Tracy
It seems like what I should we using is something more like a SimpleAnalyzer or StopAnalyzer. I've changed my code and the query to use SimpleAnalyzer. But now i have another question. Let's say I have 'return_results.pl' in the document in one of the fields. When I search for return_res* or r

RE: Search Question - not returning desired results

2003-11-26 Thread Pleasant, Tracy
Thanks this helps a lot :) -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 26, 2003 4:58 AM To: Lucene Users List Subject: Re: Search Question - not returning desired results On Tuesday, November 25, 2003, at 12:11 PM, Pleasant, Tracy wrote

RE: log4j.properties

2003-11-26 Thread Tun Lin
I have integrated Lucene and PDFBox and tried the following command to index files java -Dlog4j.configuration=log4j.xml org.pdfbox.searchengine.lucene.IndexFiles -create -index c:\\index .. But I have the following error message: log4j:WARN No appenders could be found for logger (org.pdfbox.pdf

Re: log4j.properties

2003-11-26 Thread Stephane Vaucher
As I've said previously, it's a log4j problem and not a lucene probleme, you should post there. sv On Wed, 26 Nov 2003, Tun Lin wrote: > I have created the following "log4j.properties" and put it in your classpath but > it still has that error. Anyone can help? > > log4j.rootCategory=stdout >

RE: Tokenizing text custom way

2003-11-26 Thread MOYSE Gilles (Cetelem)
Do you want to define expressions, i.e. a set of terms that must be intpreted as a whole ? For instance, when the Analyzer catchs "time" followed by "out" it returns "time_out" ? -Message d'origine- De : Dragan Jotanovic [mailto:[EMAIL PROTECTED] Envoyé : mercredi 26 novembre 2003 12:12 À

Re: Tokenizing text custom way

2003-11-26 Thread Erik Hatcher
On Wednesday, November 26, 2003, at 06:12 AM, Dragan Jotanovic wrote: You will need to write a custom analyzer. Don't worry, though it's quite straightforward. You will also need to write a Tokenizer, but Lucene helps you a lot here. Wouldn't I achieve the same result if I index "time out"

Re: Chinese input.

2003-11-26 Thread Otis Gospodnetic
Maybe this will help? http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23545 Otis --- Tun Lin <[EMAIL PROTECTED]> wrote: > Hi, > > May I know how do I analyse Chinese input from Chinese text in > Lucene? > > Do I use Analyser function in Lucene? If yes, how to go about using > it? >

Re: Tokenizing text custom way

2003-11-26 Thread Dragan Jotanovic
> You will need to write a custom analyzer. Don't worry, though it's > quite straightforward. You will also need to write a Tokenizer, but > Lucene helps you a lot here. Wouldn't I achieve the same result if I index "time out" like "time_out", using StandardAnalyzer and later if I search for

Re: Search Question - not returning desired results

2003-11-26 Thread Erik Hatcher
On Tuesday, November 25, 2003, at 12:11 PM, Pleasant, Tracy wrote: The documents I have index contain information regarding file names also. For instance 'return_results.pl' or something like that may be in the document fields. I am not understanding Lucene's way of searching: 1. If I search

Re: Tokenizing text custom way

2003-11-26 Thread Erik Hatcher
On Tuesday, November 25, 2003, at 06:41 AM, Dragan Jotanovic wrote: Hi. I need to tokenize text while indexing but I don't want space to be delimiter. Delimiter should be my custom character (for example comma). I understand that I would probably need to implement my own analyzer, but could som

Re: Tokenizing text custom way

2003-11-26 Thread Erik Hatcher
woah that seems like an awfully complex answer to the question of how to tokenize at a comma rather than a space! %-) On Tuesday, November 25, 2003, at 11:48 AM, MOYSE Gilles (Cetelem) wrote: Hi. You should define expressions. To define expressions, you first have to define an expression

Re: log4j.properties

2003-11-26 Thread Erik Hatcher
What does this have to do with Lucene? On Wednesday, November 26, 2003, at 01:04 AM, Tun Lin wrote: I have created the following "log4j.properties" and put it in your classpath but it still has that error. Anyone can help? log4j.rootCategory=stdout log4j.appender.stdout=org.apache.log4j.Conso

Re: unexpected results from query

2003-11-26 Thread Erik Hatcher
On Tuesday, November 25, 2003, at 10:45 PM, marc wrote: Hi, assume a field has the following text "Adenylate kinase (mitochondrial GTP:AMP phosphotransferase) " the following searches all return this document AMP & & can someone explain this to me..i figured that only the first query would be

Re: Eliminating duplicate result

2003-11-26 Thread Dragan Jotanovic
> When you are doing two searches are you searching for two different terms? > No, I am searching for the same term. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]