Re: WSD-Supervised

2015-08-11 Thread Rodrigo Agerri
Hi Mondher, Yes, the preprocessing changes are all the same. With respect to the CLI and the Evaluator, they are common regardless of the type of WSD (unsupervised or supervised) so I was assuming that you were coordinating with Anthony as you have done before. Please note that it is very importa

Re: WSD-Supervised

2015-08-10 Thread Rodrigo Agerri
Hello Mondher, How is the all words IMS disambiguation progressing? We really need to focus on this to have a good candidate for integration in opennlp tools. The evaluator, the CLI and the all words supervised disambiguation should be the focus. Cheers, Rodrigo On Sat, Jul 18, 2015 at 5:40 AM,

WSD-Supervised

2015-07-17 Thread Mondher Bouazizi
Dear Rodrigo, I have made some major modifications on my part of the WSD component. I attached the patch to [1]. The main modifications are as follows: - Fixed the IMS approach (Supports now with Semsor3.0 data). - Implemented the IMS Evaluator. - Added and clarified some parts of the do

Re: WSD - Supervised techniques

2015-07-14 Thread Mondher Bouazizi
nsevalReader. > > That should be clearer, what do you think ? > > Anthony > > [1]: https://issues.apache.org/jira/browse/OPENNLP-794 > [2]: https://issues.apache.org/jira/browse/OPENNLP-795[3]: > https://issues.apache.org/jira/browse/OPENNLP-796 > > > From: rage...@apach

RE: WSD - Supervised techniques

2015-07-13 Thread Anthony Beylerian
? Anthony [1]: https://issues.apache.org/jira/browse/OPENNLP-794 [2]: https://issues.apache.org/jira/browse/OPENNLP-795[3]: https://issues.apache.org/jira/browse/OPENNLP-796 > From: rage...@apache.org > Date: Mon, 13 Jul 2015 15:50:00 +0200 > Subject: Re: WSD - Supervised techniques

Re: WSD - Supervised techniques

2015-07-13 Thread Rodrigo Agerri
Hello, It has been few public activity these last days. We believe that it is very important to step up in two directions wrt what is already commited in svn: 1. Finishing the WSDEvaluator 2. Provide the classes required to run the WSD tools from the CLI as any other component. 3. Formats: it wil

Re: WSD-Supervised Techniques

2015-07-02 Thread Rodrigo Agerri
Hello, Good, I will look at your path over the weekend and get back to you with any more specific comments/suggestions. With respect to the plans, be aware that the SST approach I mentioned is addressed as a sequence labelling problem, not as a classification problem. Instead of learning a classif

Re: WSD - Supervised techniques

2015-06-28 Thread Mondher Bouazizi
Hi everyone, I finished the first iteration of IMS approach for lexical sample disambiguation. Please find the patch uploaded on the jira issue [1]. I also created a tester (IMSTester) to run it. As I mentioned before, the approach is as follows: each time, the module is called to disambiguate a

Re: WSD - Supervised techniques

2015-06-24 Thread Joern Kottmann
On Fri, 2015-06-19 at 21:42 +0900, Mondher Bouazizi wrote: > Hi, > > Actually I have finished the implementation of most of the parts of the IMS > approach. I also made a parser for the Senseval-3 data. > > However I am currently working on two main points: > > - I am trying to figure out how to

Re: WSD - Supervised techniques

2015-06-19 Thread Mondher Bouazizi
Hi, Actually I have finished the implementation of most of the parts of the IMS approach. I also made a parser for the Senseval-3 data. However I am currently working on two main points: - I am trying to figure out how to use the MaxEnt classifier. Unfortunately there is no enough documentation,

Re: WSD - Supervised techniques

2015-06-19 Thread Rodrigo Agerri
Hi Mondher, On Fri, Jun 12, 2015 at 1:01 PM, Mondher Bouazizi wrote: > Dear Rodrigo, > > Here is what I am planning to do in the next step: > > 1- I am currently implementing the IMS method, and using Senseval 3 data. Hi, I guess you are training on semcor? http://web.eecs.umich.edu/~mihalcea/d

WSD - Supervised techniques

2015-06-12 Thread Mondher Bouazizi
Dear Rodrigo, Here is what I am planning to do in the next step: 1- I am currently implementing the IMS method, and using Senseval 3 data. Since the disambiguation training set, has to be very big (few hundreds of MBs if we want it to contain all the words),I thought, may be it would be better to