Hi All, I've been working with Lucene since the beginning of the year and would like to add nutch's bi-gram and automated proximaty search fuctionality to my project. While doing my reasearch I came across a pdf written by Doug Cutting with the title:
Nutch & Lucene with a date of Dec, 2003. In the document it say that the Nutch analysis does not use stop word lists and does not use stemmers. I understand not using stop words since nutch uses bi-grams with common words but could someone explain the decision to not use stemmers? Thanks, Andrew ------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
