Hi, the problem with lemma is that, for "tmoble", the lemma returned by openNLP is "null", not "tmoble".
Why is it? On Mon, Jul 3, 2017 at 6:54 PM, Rakesh P <[email protected]> wrote: > Hi, > Stemmer works based on some predefined rules. Examples for rules are "word > that ends with 'e'". So, if you want to get a meaning word after > preprocessing, then better use lemmatization. > > Regards, > Rakesh P > > > On 03-Jul-2017, at 10:24 PM, Ling <[email protected]> wrote: > > > > Hi, I noticed that some words are stemmed like the following: > > > > iphone -> iphon > > tmobile -> T-mobil > > > > Is there some parameter to control this behavior? In such cases, those > > stems are actually harmful, making them become unknown words in text. > Since > > these are quite common, I am just curious whether there is a way to > change > > the default behavior. > > > > Thanks. > > Ling >
