Grant, Thanks for your reply and the pointer to the custom code sample. I will be checking into that today. I did delve into the src for the OOTB analyzers and was aware of what they did. Still, the StandardAnalyzer does not do what I want. The real difference between my needs and the results of the StandardAnalyzer is that what I want is the union of the StandardAnalyzer and the StopAnalyzer. If you refer back to my original example...
An example of the data might be as follows: Hello XY&Z Corporation - [EMAIL PROTECTED] I would like the following terms to come out of the analyzer: [hello] [xy&z] [corporation] [EMAIL PROTECTED] [com] //this is the StandardAnalyzer output as well as [xy] [z] [abc] [example] I figured that creating a custom analyzer is the only way to do that, but unfortunately I am not that familiar with how the analyzers "really" work internally( I am more of a mathematician than a lexicon). If you have any other thoughts or ideas I would love to hear. Thanks, Tim Grant Ingersoll-6 wrote: > > So, I think the answer is that StandardAnalyzer already has what you > state you want. Is it, perhaps, that certain stopwords that you are > interested in are not currently being stopped? > -- View this message in context: http://www.nabble.com/Custom-Analyzer-Help-please-tf3469904.html#a9716016 Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
