Hi, Stemmer works based on some predefined rules. Examples for rules are "word that ends with 'e'". So, if you want to get a meaning word after preprocessing, then better use lemmatization.
Regards, Rakesh P > On 03-Jul-2017, at 10:24 PM, Ling <[email protected]> wrote: > > Hi, I noticed that some words are stemmed like the following: > > iphone -> iphon > tmobile -> T-mobil > > Is there some parameter to control this behavior? In such cases, those > stems are actually harmful, making them become unknown words in text. Since > these are quite common, I am just curious whether there is a way to change > the default behavior. > > Thanks. > Ling
