Embedding based classifiers are good for some use cases ( especially if you
are modeling for similarity ), but opennlp use case ( initial ones ) are
more for understanding text as in tokenization, Part of Speech, Chunking,
Dependency and so on. I would not say one can replace the other.

We use both in our pipeline, especially when we want to understand.

--Thanks and Regards
Vaijanath

On Fri, Mar 15, 2024 at 8:09 AM Michael Wechner <[email protected]>
wrote:

> I have done an implementation within Katie
>
>
> https://github.com/wyona/katie-backend/blob/main/src/main/java/com/wyona/katie/services/ClassificationServiceEmbeddingsCentroidsImpl.java
>
> and I am currently testing it and will try to compare it with the Max
> Entropy Implementation of OpenNLP.
>
> Will keep ypu posted
>
> Thanks
>
> Michael
>
> Am Fr., 15. März 2024 um 14:33 Uhr schrieb Markus Jelsma <
> [email protected]>:
>
> > I would certainly want to see if this algorithm performs any better than
> > the ones that are shipped with OpenNLP.
> >
> > Is there already a Jira ticket associated with this task?
> >
> > Op do 29 feb 2024 om 20:28 schreef Michael Wechner <
> > [email protected]
> > >:
> >
> > > Hi Vaijanath
> > >
> > > Thanks for your feedback!
> > >
> > > I just received your message, but my previous reply I did not receive.
> > >
> > > I am also subscribed to other ASF mailing lists and don't experience
> this
> > > issue there.
> > >
> > > Thanks
> > >
> > > Michael
> > >
> > > Am Do., 29. Feb. 2024 um 21:10 Uhr schrieb Vaijanath Rao <
> > > [email protected]>:
> > >
> > > > Hi Michael,
> > > >
> > > > I am not sure of delay, but It's possible ( for the delay),  if you
> > have
> > > > subscribed to options such as all the emails or daily digest.
> > > >
> > > > --Thanks and regards
> > > > Vaijanath
> > > >
> > > > On Thu, Feb 29, 2024 at 6:10 AM Michael Wechner <
> > > [email protected]>
> > > > wrote:
> > > >
> > > > > Thanks for your feedback! I will have a closer look at
> > > > > https://github.com/jelmerk/hnswlib
> > > > >
> > > > > Btw, do others also experience strange issues re this mailing list?
> > > > > For example I am subscribed with another email, but do not receive
> > > > messages
> > > > > or it took a very long time to receive my own post.
> > > > >
> > > > > Thanks
> > > > >
> > > > > Michael
> > > > >
> > > > > Am Do., 29. Feb. 2024 um 14:45 Uhr schrieb Vaijanath Rao <
> > > > > [email protected]>:
> > > > >
> > > > > > That might be good. We use https://github.com/jelmerk/hnswlib
> for
> > > > local
> > > > > > testing and other vendors for real time usage. With Lucene,
> > > > > > speed, scalability  and updates were an issue for us.
> > > > > >
> > > > > > --Thanks and Regards
> > > > > > Vaijanath
> > > > > >
> > > > > > On Thu, Feb 29, 2024 at 12:50 AM Michael Wechner <
> > > > > [email protected]
> > > > > > >
> > > > > > wrote:
> > > > > >
> > > > > > > Hi
> > > > > > >
> > > > > > > I recently found the following article re using text embeddings
> > for
> > > > > text
> > > > > > > classifications
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://medium.com/@juanc.olamendy/unlocking-the-power-of-text-classification-with-embeddings-7bcbb5912790
> > > > > > >
> > > > > > > Has anyone tried this?
> > > > > > >
> > > > > > > I am considering to implement this using Lucene Vector search.
> > > Could
> > > > > this
> > > > > > > be a contribution to OpenNLP?
> > > > > > >
> > > > > > > Thanks
> > > > > > >
> > > > > > > Michael
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > I am feeling fine, healthier and Happier, what about you
> > > > > >
> > > > >
> > > >
> > > >
> > > > --
> > > > I am feeling fine, healthier and Happier, what about you
> > > >
> > >
> >
>


-- 
I am feeling fine, healthier and Happier, what about you

Reply via email to