I thank you both. I have it already partly implemented here. It seems easy.
At least this should carry through my product until I can really get to use
Lucene. I am not sure how far I can take MySql with stemmed, indexed key
words, but should give me maybe 6 monthes at least of something use
This is so cool Otis. I was just to write this off of something in the FAQ,
but this is better then what I was doing.
This rocks!!! Thank you.
JohnE
P.S.: I am assuming you use org.apache.lucene.analysis.Token? There are
three Token's under Lucene.
- Original Message -
From: Ot
Is there a way to use Lucene stemming and stop word removal without using the
rest of the tool? I am downloading the code now, but I imagine the answer
might be deeply burried. I would like to be able to send in a phrase and get
back a collection of keywords if possible.
I am thinking of us
I was thinking that perhaps I can pre-stem words before sticking them in a
search field in the database perhaps using Lucene stemming code, then try to
use the Natural Language Search found in MySql 4.1.1. I am confident the
MySql product can't keep up with Lucene yet, but at least they hvae
Very cool Luke. I am not quite there yet. I am half way through implementing
the queue approach, but I have hit walls that are making me sit back and figure
out my strategy. I have a struts/tomcat/ojb/mysql project that can
potentially have a million records and growing over time and updates
I am interested in pursuing experienced peoples' understanding as I have half
the queue approach developed already.
I am not following why you don't like the queue approach Sergiu. From what I
gathered from this board, if you do lots of updates, the opening of the
WriterIndex is very intensiv
It really seems like I am not the only person having this issue.
So far I am seeing 2 solutions and honestly I don't love either totally. I am
thinking that without changes to Lucene itself, the best "general" way to
implement this might be to have a queue of changes and have Lucene work off
t
I am new to Lucene, but have a large project in production on the web using
other apache software including Tomcat, Struts, OJB, and others.
The database I need to support will hopefully grow to millions of records.
Right now it only has thousands but it is growing. These documents get
upda