Re: searching for c++, c#, etc...

2009-07-16 Thread Chris Salem
Wang Sent: 7/16/2009 12:09:05 PM Subject: Re: searching for c++, c#, etc... If you escape the character + or #, the sentence: "I know java + c++" would not skip +, furthermore, it breaks query parsing, where + is reserved. -John On Thu, Jul 16, 2009 at 9:04 AM, John Wang wrote: > Thi

Re: searching for c++, c#, etc...

2009-07-16 Thread John Wang
ally, the recipient should check this email and any attachments >> for the presence of viruses. The company accepts no liability for any damage >> caused by any virus transmitted by this email. Main Sequence Technologies, >> Inc. 4420 Sherwin Rd. Willoughby OH 44094 www.pcrecruite

Re: searching for c++, c#, etc...

2009-07-16 Thread John Wang
--- > To: java-user@lucene.apache.org, Chris Salem > From: Danil TORIN > Sent: 7/16/2009 10:28:37 AM > Subject: Re: searching for c++, c#, etc... > > > Try WhitespaceAnalyzer for both indexing and searching. > On search-time you may also need to escape "+", "(&quo

Re: searching for c++, c#, etc...

2009-07-16 Thread Chris Salem
10:28:37 AM Subject: Re: searching for c++, c#, etc... Try WhitespaceAnalyzer for both indexing and searching. On search-time you may also need to escape "+", "(", ")" with "\". "#" shouldn't need escaping. On Thu, Jul 16, 2009 at 17:23,

Re: searching for c++, c#, etc...

2009-07-16 Thread Danil Ε’ORIN
#x27;t say anything about # having to be escaped. > Do I have to escape during indexing too? > Sincerely, > Chris Salem > > > > - Original Message - > To: java-user@lucene.apache.org, Chris Salem > From: Ian Lea > Sent: 7/16/2009 5:12:53 AM > Subject: Re: searching f

Re: searching for c++, c#, etc...

2009-07-16 Thread Chris Salem
to be escaped. Do I have to escape during indexing too? Sincerely, Chris Salem - Original Message - To: java-user@lucene.apache.org, Chris Salem From: Ian Lea Sent: 7/16/2009 5:12:53 AM Subject: Re: searching for c++, c#, etc... Hi Escaping should work. See http://luc

Re: searching for c++, c#, etc...

2009-07-16 Thread Ian Lea
Hi Escaping should work. See http://lucene.apache.org/java/2_4_1/queryparsersyntax.html and QueryParser.escape(). And you need to be sure that your analyzer isn't removing the plus signs and that you use the same analyzer for indexing and searching. Googling for something like "lucene escape"

searching for c++, c#, etc...

2009-07-15 Thread Chris Salem
Hello, I'm trying to search for the terms like c++ but the parser is stripping off the ++. I tried escaping the ++ with slashes but it's still stripping it off. I could replace + with "plus", is that the best way to do it? How come escaping isn't working? thanks Sincerely, Chris Salem

Re: searching for C++

2008-06-24 Thread Alex Soto
Thanks everyone. I appreciate the help. I think I will write my own tokenizer, because I do not have a predefined list of words with symbols. I will modify the grammar by defining a SYMBOL token as John suggested and redefine ALPHANUM to include it. Regards, Alex Soto On Tue, Jun 24, 2008 at 1

Re: searching for C++

2008-06-24 Thread N. Hira
This isn't ideal, but if you have a defined list of such terms, you may find it easier to filter these terms out into a separate field for indexing. -h -- Hira, N.R. Solutions Architect Cognocys, Inc. (773) 251-7453 On 24-Ju

Re: searching for C++

2008-06-24 Thread John Byrne
I don't think there is a simpler way. I think you will have to modify the tokenizer. Once you go beyond basic human-readable text, you always end up having to do that. I have modified the JavaCC version of StandardTokenizer for allowing symbols to pass through, but I've never used the JFlex ve

searching for C++

2008-06-24 Thread Alex Soto
Hello: I have a problem where I need to search for the term "C++". If I use StandardAnalyzer, the "+" characters are removed and the search is done on just the "c" character which is not what is intended. Yet, I need to use standard analyzer for the other benefits it provides. I think I need to w