What model or dictionary are you using with the lemmatizer?

jds

On Thu, Jul 6, 2017 at 6:05 PM, Ling <[email protected]> wrote:

> Hi, the problem with lemma is that, for "tmoble", the lemma returned by
> openNLP is "null", not "tmoble".
>
> Why is it?
>
> On Mon, Jul 3, 2017 at 6:54 PM, Rakesh P <[email protected]> wrote:
>
> > Hi,
> > Stemmer works based on some predefined rules. Examples for rules are
> "word
> > that ends with 'e'". So, if you want to get a meaning word after
> > preprocessing, then better use lemmatization.
> >
> > Regards,
> > Rakesh P
> >
> > > On 03-Jul-2017, at 10:24 PM, Ling <[email protected]> wrote:
> > >
> > > Hi, I noticed that some words are stemmed like the following:
> > >
> > > iphone ->  iphon
> > > tmobile -> T-mobil
> > >
> > > Is there some parameter to control this behavior? In such cases, those
> > > stems are actually harmful, making them become unknown words in text.
> > Since
> > > these are quite common, I am just curious whether there is a way to
> > change
> > > the default behavior.
> > >
> > > Thanks.
> > > Ling
> >
>

Reply via email to