Fwd: Word sense disambiguation

2018-02-24 Thread Anthony Beylerian
ny links to papers? > How does the module perform when being evaluated against Senseval? > > How much work do you think it's necessary in order to have a functioning > WSD module in the context of OpenNLP? > > Thanks, > Cristian > > > > On Tue, Feb 20, 2018 at 8:09 AM

Re: Word sense disambiguation

2018-02-19 Thread Anthony Beylerian
Hi Cristian, Thank you for your interest. The WSD module is currently experimental, so as far as I am aware there is no timeline for it. You can find the sandboxed version here: https://github.com/apache/opennlp-sandbox/tree/master/opennlp-wsd I personally didn't have the time to revisit this

Re: [VOTE] Migrate our main repositories to GitHub

2017-06-27 Thread Anthony Beylerian
+1 On Tue, Jun 27, 2017 at 10:45 PM, Dan Russ wrote: > +1 > > > On Jun 27, 2017, at 9:28 AM, William Colen > wrote: > > > > +1 > > > > > > 2017-06-27 9:35 GMT-03:00 Suneel Marthi : > > > >> +1 > >> > >> मेरे iPhone से

Re: Fw: ApacheCon Europe 2016: Talk accepted!

2016-10-03 Thread Anthony Beylerian
Great! Thank you for sharing. For those who cannot attend, will slides/video be available? Anthony On Thu, Sep 29, 2016 at 12:14 AM, Tommaso Teofili wrote: > Very interesting ! > Thanks for letting us know. > > Tommaso > Il giorno mer 28 set 2016 alle 17:05 Boris

Re: Access to Git

2016-09-14 Thread Anthony Beylerian
Hello, Concerning the workflow, how about using Gitflow? [1] Advantages are: - keeps a clean master branch, work is on the develop branch - good for multiple (historical) versions - good integration with sourcetree Please consider. Thanks, Anthony [1] :

Re: Migrate to Git?

2016-08-19 Thread Anthony Beylerian
@Jörn @Richard I believe less bloat is always better for code housekeeping. For example, although it is small, I think having the site code along with the toolkit code just seems a bit untidy. How about we at least separate those two? It could also be useful to make a more feature rich site in

Re: Migrate to Git?

2016-08-19 Thread Anthony Beylerian
+1 for separate repositories. Since they will be under the Apache Github Organization, it will also be neater to browse them like this: https://github.com/apache?query=opennlp I recommend we keep the repository names starting with opennlp- For example :

Re: DeepLearning4J as a ML for OpenNLP

2016-07-02 Thread Anthony Beylerian
@William I think what you meant previously by feature2vec would be to deep-learn with any discrete state, not just with words, am I right? Extra side-information could possibly help improve some results, but this would make things overly complicated in my opinion. @Boris, Thank you very much, I

Re: DeepLearning4J as a ML for OpenNLP

2016-06-29 Thread Anthony Beylerian
rse-trees< > https://github.com/bgalitsky/relevance-based-on-parse-trees> > github.com > Automatically exported from > code.google.com/p/relevance-based-on-parse-trees > > > > > > From: Anthony Beylerian <anthony.beyler...@gmail.com> > Sent: Wednesday, June 29, 2016 2:13:38 AM > To: dev@opennlp.apache.org > Subject: Re: DeepLearning4J as a ML for OpenNLP > > +1 would be willing to help out when possible >

Re: DeepLearning4J as a ML for OpenNLP

2016-06-29 Thread Anthony Beylerian
; > Boris > > > > [https://avatars3.githubusercontent.com/u/1051120?v=3=400]< > > https://github.com/bgalitsky/relevance-based-on-parse-trees> > > > > bgalitsky/relevance-based-on-parse-trees< > > https://github.com/bgalitsky/relevance-based-on-parse-tr

Re: Performances of OpenNLP tools

2016-06-29 Thread Anthony Beylerian
> > > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > > >> Office: 168-519, Mailstop: 168-527 > > > >> Email: chris.a.mattm...@nasa.gov > > > >> WWW: http://sunset.usc.edu/~mattmann/ > > > >> ++

Re: DeepLearning4J as a ML for OpenNLP

2016-06-29 Thread Anthony Beylerian
+1 would be willing to help out when possible

RE: Performances of OpenNLP tools

2016-06-21 Thread Anthony Beylerian
+1 Maybe we could put the results of the evaluator tests for each component somewhere on a webpage and on every release update them. This is of course provided there are reasonable data sets for testing each component. What do you think? Anthony > From: mondher.bouaz...@gmail.com > Date:

Re: [jira] [Updated] (TIKA-2000) Author profile parser

2016-06-16 Thread Anthony Beylerian
d a new field >> "*Estimated-Author-Age-Binary-Group" >> *to this. We can run multiple REST API call in parallel and >> enable/disable through property file. Basically let user define what all >> API it wants to run and we can club all the results together throu

Re: Profiler for OpenNLP

2016-06-09 Thread Anthony Beylerian
fice: 168-519, Mailstop: 168-527 > > > > Email: chris.a.mattm...@nasa.gov > > > > WWW: http://sunset.usc.edu/~mattmann/ > > > > ++++++++++ > > > > Director, Information Retrieval and Data Sci

Profiler for OpenNLP

2016-06-07 Thread Anthony Beylerian
Hello, We are currently working on an experimental author profiler that we think could be added to the toolkit. The profiler aims to detect the gender and age range of an author. Later we hope to add personality aspects such as: [extroverted, stable, agreeable, conscientious] We would like the

Re: Updates on SentimentAnalysisParser

2016-06-04 Thread Anthony Beylerian
Hi Anastasija, Good work sounds great, I will try to review it when it's available. Just curious, which approach has been implemented so far ? Best, Anthony On Sat, Jun 4, 2016 at 9:36 PM, Anastasija Mensikova < mensikova.anastas...@gmail.com> wrote: > Hello everyone, > > I hope you are

Re: GSoC 2016: OpenNLP Sentiment Analysis

2016-05-08 Thread Anthony Beylerian
gt;>On 25 April 2016 at 15:23, Madhawa Kasun Gunasekara >> >><madhaw...@gmail.com> wrote: >> >> >> >>Hi all, >> >> >> >> >> >>Shall we have the hangout session tomorrow (Tuesday) about 18:30 IST ? >> >> >

Re: GSoC 2016: OpenNLP Sentiment Analysis

2016-04-27 Thread Anthony Beylerian
/semeval2016/task5/ > > The task is well circumscribed plus data is publicly available, which > is good to try and make manageable objectives for a GSOC. > > Best, > > Rodrigo > > > > On Tue, Apr 26, 2016 at 6:10 PM, Anthony Beylerian > <anthony.beyler...@gma

Re: GSoC 2016: OpenNLP Sentiment Analysis

2016-04-24 Thread Anthony Beylerian
-527 >>> Email: chris.a.mattm...@nasa.gov >>> WWW: http://sunset.usc.edu/~mattmann/ >>> ++++++ >>> Director, Information Retrieval and Data Science Group (IRDS) >>> Adjunct Associate Profes

RE: GSOC2016 Sentiment Analysis

2016-03-29 Thread Anthony Beylerian
+ > > > > > > -Original Message- > From: Mondher Bouazizi <mondher.bouaz...@gmail.com> > Date: Monday, March 28, 2016 at 11:46 PM > To: Madhawa Kasun Gunasekara <madhaw...@gmail.com>, jpluser > <chris.a.mattm...@jpl.nasa.gov> >

RE: GSOC2016 Sentiment Analysis

2016-03-28 Thread Anthony Beylerian
Dear Chris, Thank you for starting the discussion. We are glad there is an interest in a sentiment analysis component. My colleague Mondher posted the two JIRA issues related to Sentiment Analysis [1][2] as references for our proposals [3][4] for GSoC. In fact, we have been researching this

RE: Word Sense Disambiguator

2015-11-02 Thread Anthony Beylerian
> From: cristian.petro...@gmail.com > To: dev@opennlp.apache.org > > Hi, > > Thanks Anthony for the info. > Does anybody else know when the WSD component will be merged into trunk and > possibly cut a release with it? > > Thanks > > On Sat, Sep 19, 2015 at 9:21 AM, Anthony

Re: Word Sense Disambiguator

2015-09-19 Thread Anthony Beylerian
t; >> 1 > >> to get the YAGO ids. > >> > >> Aliaksandr > >> > >> > >> On 9 September 2015 at 09:51, Cristian Petroaca < > >> cristian.petro...@gmail.com > >> > wrote: > >> > >> >

GSoC - WSD component

2015-09-01 Thread Anthony Beylerian
Hello, We have received the results concerning this year's GSoC. I am glad we have passed the final evaluation ! I would really like to thank Jörn and Rodrigo's support during the program. We have enjoyed the challenges and hope to contribute in the future. Concerning the next steps, we are

RE: Word Sense Disambiguator

2015-07-24 Thread Anthony Beylerian
+0200 Subject: Re: Word Sense Disambiguator From: kottm...@gmail.com To: dev@opennlp.apache.org It would be nice if you could share instructions on how to run it. I also would like to give it a try. Jörn On Fri, Jul 24, 2015 at 4:54 AM, Anthony Beylerian anthonybeyler...@hotmail.com wrote

RE: Word Sense Disambiguator

2015-07-23 Thread Anthony Beylerian
Hello, Yes for the moment we are only using WordNet for sense definitions.The plan is to complete the package by mid to late August, but if you like you can follow up on the progress from the sandbox. Best regards, Anthony Date: Thu, 23 Jul 2015 15:36:57 +0300 Subject: Word Sense Disambiguator

RE: WSD - Supervised techniques

2015-07-13 Thread Anthony Beylerian
Dear Rodrigo, Thank you for the feedback. I have added [1][2][3] issues regarding the below. Concerning the testers (IMSTester etc) they should be in src/test/java/ We can add docs in those to explain how to use each implementation. Actually, I am using the parser for Senseval3 that

RE: GSoC 2015 - WSD Module

2015-06-14 Thread Anthony Beylerian
Hi, Concerning this point, I would like to ask about BabelNet [1].The advantages of [1] is that it integrates WordNet, Wikipedia, Wiktionary, OmegaWiki, Wikidata, and Open Multi-WordNet. Also, the newest SemEval task (which results are just out [2]) relies on it. Howeover, the 2.5.1 version,

RE: GSoC 2015 - WSD Module

2015-06-10 Thread Anthony Beylerian
Hi, I attached an initial patch to OPENNLP-758. However, we are currently modifying things a bit since many approaches need to be supported, but would like your recommendations. Here are some notes : 1 - We used extJWNL 2- [WSDisambiguator] is the main interface 3- [Loader] loads the resources

RE: GSoC 2015 - WSD Module

2015-06-03 Thread Anthony Beylerian
Dear Jörn, Thank you for the reply.=== Yes in the draft WSDisambiguator is the main interface. === Yes for the disambiguate method the input is expected to be tokenized, it should be an input array. The second argument is for the

RE: GSoC 2015 - WSD Module

2015-05-18 Thread Anthony Beylerian
really appreciate discussing them and would like your guidance to iterate on this tool-set. Best regards, Anthony Beylerian, Mondher Bouazizi

GSoC - Self introduction

2015-05-03 Thread Anthony Beylerian
time. Anthony Beylerian Dear all, I am Mondher Bouazizi, from Tunisia. I am a Master's student at Keio University in Japan. My academic research is currently focusing on Data Mining. I am glad to inform you that my project proposal has been accepted for the Google Summer of Code 2015

RE: Word Sense Disambiguation

2015-02-18 Thread Anthony Beylerian
Thank you for the feedback, I believe that having separate interfaces as mentioned for sense provision and disambiguation would be a good idea. We will try to survey the techniques and study the library further to propose a first structure when possible. Best, Anthony Subject: Re: Word

RE: Word Sense Disambiguation

2015-02-13 Thread Anthony Beylerian
ahead and implement the proposed approach. One goal of this implementation will be to figure out the interface we want to have in OpenNLP for WSD. We can later extend OpenNLP with more implementations which are taking different approaches. Jörn On Thu, 2015-01-15 at 16:50 +0900, Anthony

Word Sense Disambiguation

2015-01-14 Thread Anthony Beylerian
Hello, I'm new here, I previously mentioned to Jörn about my colleagues and myself being interested in helping to implement this component, we were thinking of starting with simple knowledge based approaches, although they do not yield high accuracy, but as a first step they are relatively