Wang
Sent: 7/16/2009 12:09:05 PM
Subject: Re: searching for c++, c#, etc...
If you escape the character + or #, the sentence:
"I know java + c++" would not skip +, furthermore, it breaks query parsing,
where + is reserved.
-John
On Thu, Jul 16, 2009 at 9:04 AM, John Wang wrote:
> Thi
ally, the recipient should check this email and any attachments
>> for the presence of viruses. The company accepts no liability for any damage
>> caused by any virus transmitted by this email. Main Sequence Technologies,
>> Inc. 4420 Sherwin Rd. Willoughby OH 44094 www.pcrecruite
---
> To: java-user@lucene.apache.org, Chris Salem
> From: Danil TORIN
> Sent: 7/16/2009 10:28:37 AM
> Subject: Re: searching for c++, c#, etc...
>
>
> Try WhitespaceAnalyzer for both indexing and searching.
> On search-time you may also need to escape "+", "(&quo
10:28:37 AM
Subject: Re: searching for c++, c#, etc...
Try WhitespaceAnalyzer for both indexing and searching.
On search-time you may also need to escape "+", "(", ")" with "\".
"#" shouldn't need escaping.
On Thu, Jul 16, 2009 at 17:23,
#x27;t say anything about # having to be escaped.
> Do I have to escape during indexing too?
> Sincerely,
> Chris Salem
>
>
>
> - Original Message -
> To: java-user@lucene.apache.org, Chris Salem
> From: Ian Lea
> Sent: 7/16/2009 5:12:53 AM
> Subject: Re: searching f
to be escaped.
Do I have to escape during indexing too?
Sincerely,
Chris Salem
- Original Message -
To: java-user@lucene.apache.org, Chris Salem
From: Ian Lea
Sent: 7/16/2009 5:12:53 AM
Subject: Re: searching for c++, c#, etc...
Hi
Escaping should work. See
http://luc
Hi
Escaping should work. See
http://lucene.apache.org/java/2_4_1/queryparsersyntax.html and
QueryParser.escape(). And you need to be sure that your analyzer
isn't removing the plus signs and that you use the same analyzer for
indexing and searching.
Googling for something like "lucene escape"
Hello,
I'm trying to search for the terms like c++ but the parser is stripping off the
++. I tried escaping the ++ with slashes but it's still stripping it off. I
could replace + with "plus", is that the best way to do it? How come escaping
isn't working?
thanks
Sincerely,
Chris Salem
Thanks everyone. I appreciate the help.
I think I will write my own tokenizer, because I do not have a
predefined list of words with symbols.
I will modify the grammar by defining a SYMBOL token as John suggested
and redefine ALPHANUM to include it.
Regards,
Alex Soto
On Tue, Jun 24, 2008 at 1
This isn't ideal, but if you have a defined list of such terms, you
may find it easier to filter these terms out into a separate field
for indexing.
-h
--
Hira, N.R.
Solutions Architect
Cognocys, Inc.
(773) 251-7453
On 24-Ju
I don't think there is a simpler way. I think you will have to modify
the tokenizer. Once you go beyond basic human-readable text, you always
end up having to do that. I have modified the JavaCC version of
StandardTokenizer for allowing symbols to pass through, but I've never
used the JFlex ve
Hello:
I have a problem where I need to search for the term "C++".
If I use StandardAnalyzer, the "+" characters are removed and the
search is done on just the "c" character which is not what is
intended.
Yet, I need to use standard analyzer for the other benefits it provides.
I think I need to w
12 matches
Mail list logo