Hi guys -
So I am getting happier with search, and just pushed the lucene version live at:
http://www.theserverside.com (on the leftbar) and:
http://www.theserverside.com/home/search/index.jsp
The only real item that I still want to tweak more is getting recent results higher in
the list.
I wa
> java -Dlog4j.configuration=log4j.xml
org.pdfbox.searchengine.lucene.IndexFiles
> -create -index c:\\index ..
Hmm try to create log4j.xml instead of log4j.properties as specified in the
command line parameter.
- Original Message -
From: "Tun Lin" <[EMAIL PROTECTED]>
To: "'Lucene User
On Tue, Nov 25, 2003 at 01:18:19PM -0500, Michael Giles wrote:
> Yes, he was the lead Ph.D. student on the GroupLens project at Minnesota.
I've actually worked on a system that bundled GroupLens. I think
it was Vignette StoryServer. The Vignette docs were incredibly dense
with MarketingNewS
On Wednesday, November 26, 2003, at 11:08 AM, Pleasant, Tracy wrote:
But now i have another question.
Let's say I have 'return_results.pl' in the document in one of the
fields.
Actually there is a little bit more to it than understanding the
analysis phase, and you were right in saying you need
On Wednesday, November 26, 2003, at 11:08 AM, Pleasant, Tracy wrote:
But now i have another question.
Let's say I have 'return_results.pl' in the document in one of the
fields.
When I search for return_res* or return_res~ it won't return the
document.
But searching for any of these does return th
On Wednesday, November 26, 2003, at 11:33 AM, Pleasant, Tracy wrote:
Your website says:
org.apache.lucene.analysis.standard.StandardAnalyzer:
[xy&z] [corporation] [EMAIL PROTECTED] [com]
When I run it it keeps the entire email '[EMAIL PROTECTED]
but according to your website it separa
You are searching for the same term and you are searching the same index twice, it
will return the same results...
I don't get what you are asking.
-Original Message-
From: Dragan Jotanovic [mailto:[EMAIL PROTECTED]
Sent: Wednesday, November 26, 2003 3:19 AM
To: Lucene Users List
Subje
Erik,
I think there may be a typo in the website.
When I run the AnalyzerDemo :
Analzying "xy&z corporation - [EMAIL PROTECTED]"
org.apache.lucene.analysis.standard.StandardAnalyzer:
[xy&z] [corporation] [EMAIL PROTECTED]
Your website says:
org.apache.lucene.analys
It seems like what I should we using is something more like a
SimpleAnalyzer or StopAnalyzer.
I've changed my code and the query to use SimpleAnalyzer.
But now i have another question.
Let's say I have 'return_results.pl' in the document in one of the
fields.
When I search for return_res* or r
Thanks this helps a lot :)
-Original Message-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: Wednesday, November 26, 2003 4:58 AM
To: Lucene Users List
Subject: Re: Search Question - not returning desired results
On Tuesday, November 25, 2003, at 12:11 PM, Pleasant, Tracy wrote
I have integrated Lucene and PDFBox and tried the following command to index
files
java -Dlog4j.configuration=log4j.xml org.pdfbox.searchengine.lucene.IndexFiles
-create -index c:\\index ..
But I have the following error message:
log4j:WARN No appenders could be found for logger (org.pdfbox.pdf
As I've said previously, it's a log4j problem and not a lucene probleme,
you should post there.
sv
On Wed, 26 Nov 2003, Tun Lin wrote:
> I have created the following "log4j.properties" and put it in your classpath but
> it still has that error. Anyone can help?
>
> log4j.rootCategory=stdout
>
Do you want to define expressions, i.e. a set of terms that must be
intpreted as a whole ?
For instance, when the Analyzer catchs "time" followed by "out" it returns
"time_out" ?
-Message d'origine-
De : Dragan Jotanovic [mailto:[EMAIL PROTECTED]
Envoyé : mercredi 26 novembre 2003 12:12
À
On Wednesday, November 26, 2003, at 06:12 AM, Dragan Jotanovic wrote:
You will need to write a custom analyzer. Don't worry, though
it's
quite straightforward. You will also need to write a Tokenizer, but
Lucene helps you a lot here.
Wouldn't I achieve the same result if I index "time out"
Maybe this will help?
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23545
Otis
--- Tun Lin <[EMAIL PROTECTED]> wrote:
> Hi,
>
> May I know how do I analyse Chinese input from Chinese text in
> Lucene?
>
> Do I use Analyser function in Lucene? If yes, how to go about using
> it?
>
> You will need to write a custom analyzer. Don't worry, though it's
> quite straightforward. You will also need to write a Tokenizer, but
> Lucene helps you a lot here.
Wouldn't I achieve the same result if I index "time out" like "time_out",
using StandardAnalyzer and later if I search for
On Tuesday, November 25, 2003, at 12:11 PM, Pleasant, Tracy wrote:
The documents I have index contain information regarding file names
also.
For instance 'return_results.pl' or something like that may be in the
document fields.
I am not understanding Lucene's way of searching:
1. If I search
On Tuesday, November 25, 2003, at 06:41 AM, Dragan Jotanovic wrote:
Hi. I need to tokenize text while indexing but I don't want space to
be delimiter. Delimiter should be my custom character (for example
comma). I understand that I would probably need to implement my own
analyzer, but could som
woah that seems like an awfully complex answer to the question of
how to tokenize at a comma rather than a space! %-)
On Tuesday, November 25, 2003, at 11:48 AM, MOYSE Gilles (Cetelem)
wrote:
Hi.
You should define expressions.
To define expressions, you first have to define an expression
What does this have to do with Lucene?
On Wednesday, November 26, 2003, at 01:04 AM, Tun Lin wrote:
I have created the following "log4j.properties" and put it in your
classpath but
it still has that error. Anyone can help?
log4j.rootCategory=stdout
log4j.appender.stdout=org.apache.log4j.Conso
On Tuesday, November 25, 2003, at 10:45 PM, marc wrote:
Hi,
assume a field has the following text
"Adenylate kinase (mitochondrial GTP:AMP phosphotransferase) "
the following searches all return this document
AMP
&
&
can someone explain this to me..i figured that only the first query
would be
> When you are doing two searches are you searching for two different terms?
>
No, I am searching for the same term.
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
22 matches
Mail list logo