Jörn, I can help removing deprecated code. I started with
PlainTextByLineStream. It is used everywhere so there is a lot to change.


2016-11-08 9:08 GMT-02:00 Joern Kottmann <kottm...@gmail.com>:

> I suggest we remove more deprecated code, there is still a lot which could
> be removed and is really old.
> It is a bit of a boring task, if anyone has some spare cycles help would be
> welcome.
>
> Jörn
>
> On Tue, Nov 8, 2016 at 9:59 AM, Aliaksandr Autayeu <aliaksa...@autayeu.com
> >
> wrote:
>
> > +1 for 1.7 (also due to lemmatized changes and removal of deprecated
> code).
> >
> > On 8 November 2016 at 09:48, Rodrigo Agerri <rage...@apache.org> wrote:
> >
> > > Hello,
> > >
> > > +1 1.7.0 in next release and +1 for a yearly release
> > >
> > > Just to provide some info, the main changes in the lemmatizer have
> been:
> > >
> > > 1. Added a supervised statistical lemmatizer, usable from the CLI and
> > > API. The supervised lemmaitzer now provides a much better coverage for
> > > unknown words with respect to the previously existing dictionary-based
> > > one.
> > > 2. The lemmatizer component has been rewritten and the API therefore
> > > has substantially changed. Thus, the changes in the Dictionary-based
> > > lemmatizer are not backward compatible. In any case, I do not think
> > > that so many people was using it and the change at using the API is
> > > minor.
> > >
> > > The new statistical lemmatizer can support the Dictionary-based
> > > lemmatizers often used to provide features for components such as Word
> > > Sense Disambiguation, Opinion Mining/Sentiment Analysis, etc. In this
> > > regard, it will be nice to aim at working on the development of those
> > > two components for their release. Maybe the next release is too close,
> > > but definitely for the next one.
> > >
> > > Cheers,
> > >
> > > Rodrigo
> > >
> > > On Mon, Nov 7, 2016 at 7:01 PM, Russ, Daniel (NIH/CIT) [E]
> > > <dr...@mail.nih.gov> wrote:
> > > > Also the lemmatizer has significantly changed.  I vote 1.7
> > > >
> > > > On 11/7/16, 12:59 PM, "Joern Kottmann" <kottm...@gmail.com> wrote:
> > > >
> > > >     Hello all,
> > > >
> > > >     since our last release it has been a while and we received quite
> a
> > > few
> > > >     changes which would be nice to get released.
> > > >
> > > >     There are still some open Jira issues, but mostly smaller things
> > that
> > > >     can be wrapped up rather quickly.
> > > >
> > > >     Is there anything important missing which should go into the next
> > > >     release? Otherwise I think we should also aim for more frequent
> > > >     released and just make one again early next year, with all the
> > stuff
> > > we
> > > >     might miss out now.
> > > >
> > > >     We took in a patch - as part of OPENNLP-830 - to replace our
> > > self-made
> > > >     hash table with the java.util.HashMap. This change is not
> backward
> > > >     compatible for folks who extend AbstractModel.
> > > >
> > > >     Should we go with 1.6.1 as a next version or should we make 1.7.0
> > to
> > > >     reflect that?
> > > >
> > > >     Previously we only had backward incompatible changes in versions
> > > which
> > > >     bumped by the second number. Maybe that is better choice. It will
> > > >     probably break some peoples code when they update.
> > > >
> > > >     We also have lots of deprecated API still in OpenNLP, should we
> try
> > > to
> > > >     remove as much as possible of it now?
> > > >
> > > >     Jörn
> > > >
> > > >
> > >
> >
>

Reply via email to