Hi, I don't fully understand the question, but I'll try... One option is to simply not remove stop words. The second option is to use a trick like the one Nutch uses: http://lucene.apache.org/nutch/apidocs/org/apache/nutch/analysis/CommonGrams.html
Are there more options? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ________________________________ From: fei dong <dongfei...@gmail.com> To: solr-user <solr-user@lucene.apache.org> Sent: Friday, February 6, 2009 3:09:27 AM Subject: how to exactly query in the multitype I am using the "text" field type in the schema.xml provides basic text search for English text. But, it has a surprise: the actual text given to this field is not indexed as-is, and therefore searching for the raw text may not work. If you search "To Be Or Not To Be" or "s.h.e." in a "text" field, none of these words will found this document. If I query "K & B" ( an artist name) or, the result that only appears "K" is not what I suppose to. It's better to sometimes retrieval that can omit the stop word and sometimes keep the stop word. So that will not only index the text after removing stop word, but also index the raw text. How to support that requirement?