Re: Standard Analyzer

2008-08-25 Thread tom
AUTOMATIC REPLY Tom Roberts is out of the office till 2nd September 2008. LUX reopens on 1st September 2008 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Standard Analyzer

2008-08-25 Thread Karl Wettin
25 aug 2008 kl. 09.19 skrev Kalani Ruwanpathirana: Hi, I am using StandardAnalyzer when creating the Lucene index. It indexes the word "wo&rk" as it is but does not index the word "wo*rk" in that manner. Can I index such words (including * and ?) as it is? Otherwise I have no way to ind

Re: Standard Analyzer

2008-08-25 Thread Kalani Ruwanpathirana
Hi, Thanks, I tried WhitespaceAnalyzer too, but it seems case sensitive. If I need to search for words like "correct?", "" (it escapes <, > and another few characters too) I need to index those kind of words. On Mon, Aug 25, 2008 at 1:15 PM, Karl Wettin <[EMAIL PROTECTED]> wrote: > > 25 aug 200

Re: Standard Analyzer

2008-08-25 Thread Karl Wettin
25 aug 2008 kl. 11.14 skrev Kalani Ruwanpathirana: Hi, Thanks, I tried WhitespaceAnalyzer too, but it seems case sensitive. Then you simply add a LowercaseFilter to the chain in the Analyzer: public final class WhitespaceAnalyzer extends Analyzer { public TokenStream tokenStream(String fi

Re: Re: Standard Analyzer

2008-08-25 Thread tom
AUTOMATIC REPLY Tom Roberts is out of the office till 2nd September 2008. LUX reopens on 1st September 2008 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Standard Analyzer Escapes

2007-07-13 Thread Yonik Seeley
I just tried some things fast via the Solr admin interface, and everything seems fine. I think you are probably confusing what the parser does vs what the analyzer does. Try your tests with an un-tokenized field to remove that effect. -Yonik On 7/13/07, Walt Stoneburner <[EMAIL PROTECTED]> wrote

Re: Standard Analyzer Escapes

2007-07-13 Thread Mark Miller
This is certainly the case. StandardAnalyzer has a regex matcher that looks for a possible company name involving an & or an @. The QueryParser is escaping the '&' -- all of the affects described are standard results of using the StandardAnalzyer. Any double '&&' will break text, but 'sdfdf&dfs