*This message was transferred with a trial version of CommuniGate(tm) Pro*

I'm actually using a WhitespaceAnalyzer and WhitespaceTokenizer that I
wrote. It parses terms on, of course, whitespace. Is there a way to get a
list of the tokens that are generated after indexing?

-----Original Message-----
From: Brian Goetz [mailto:[EMAIL PROTECTED]]
Sent: Sunday, March 10, 2002 7:57 AM
To: Lucene Users List
Subject: Re: Using the term "C/C++" in queries


*This message was transferred with a trial version of CommuniGate(tm) Pro*

>Has anyone had any experience/problems with using "C/C++" in Lucene
queries?
>When I do a query using that as a term I get back a bunch of records that
do
>not contain the term. The resulting query ends up being like: +(+c/c++)
when
>parsed by the QueryParser. I've also tried just adding Term objects to a
>query myself and it has the same affect. Thanks for any help.

If you're using any of the basic analyzers, the / and + terms will be
dropped in tokenization.  If you want to search for terms like C++, you'll
need a tokenizer that treats + as a letter.


--
Brian Goetz
Quiotix Corporation
[EMAIL PROTECTED]           Tel: 650-843-1300            Fax: 650-324-8032

http://www.quiotix.com


--
To unsubscribe, e-mail:
<mailto:[EMAIL PROTECTED]>
For additional commands, e-mail:
<mailto:[EMAIL PROTECTED]>



--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to