On Jan 29, 2008, at 9:29 AM, christophe blin wrote:
Hi,
thanks for the pointer to the ellision filter, but I am currently
stuck with
lucene-core-2.2.0 found in maven2 central repository (do not contain
this
class). I'll watch for an upgrade to 2.3 in the future.
2.3 should be available in the repository:
http://repo1.maven.org/maven2/org/apache/lucene/lucene-core/2.3.0/
BTW, I think there is an error in the current javadoc because the
sentence
"Note that StandardTokenizer sees " ' " as a space, and cuts it
out." is
wrong (the standard tokenizer keeps the apostrophe as part of the
word, and
so it does not cut it out, as far as I can tell).
I believe this is correct, but then the StandardFilter then removes it.
Concerning the fact to do an EuropeanStandardFilter, I am afraid I
can only
do this for french and english (that is the implementation of
http://tinyurl.com/ynskw3 in fact).
Please don't use Tiny URL. We like to know where a link is going
before we click it. :-)
I have think about doing it but as 90% of my documents are french I
mainly
concentrate on it at the moment.
Thaks for the quick and accurate answer
Steven A Rowe wrote:
Hi Chris,
Looks like the ElisionFilter handles the French problems you
mentioned
This sounds really ambitious [to have a EuropeanStandardFilter, but
interesting - have you considered doing this?
Steve
--
View this message in context:
http://www.nabble.com/Apostrophe-filtering-in-StandardFilter-tp15156768p15160200.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------
Grant Ingersoll
http://lucene.grantingersoll.com
http://www.lucenebootcamp.com
Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]