Hi,
Stemmer works based on some predefined rules. Examples for rules are "word that 
ends with 'e'". So, if you want to get a meaning word after preprocessing, then 
better use lemmatization. 

Regards,
Rakesh P

> On 03-Jul-2017, at 10:24 PM, Ling <[email protected]> wrote:
> 
> Hi, I noticed that some words are stemmed like the following:
> 
> iphone ->  iphon
> tmobile -> T-mobil
> 
> Is there some parameter to control this behavior? In such cases, those
> stems are actually harmful, making them become unknown words in text. Since
> these are quite common, I am just curious whether there is a way to change
> the default behavior.
> 
> Thanks.
> Ling

Reply via email to