[Apertium-stuff] GSoC19 - Unsupervised weighting of automata progress update

Amr Mohamed Hosny Anwar Sun, 21 Jul 2019 19:27:29 -0700

Dear Francis, Nick, Tommi,

Hope this mail finds you well.
I would like to share with the blog posts that I have used to document 
the project's progress.
Firstly, The scores for the implemented methods that are computed using 
a custom script 
(https://github.com/apertium/lttoolbox/pull/55/files#diff-4791d142daa5e6d636af9488c64ef69a)
 
can be found here https://ak-blog.herokuapp.com/posts/7/


Secondly, I have done my best searching for relevant publications 
related to keywords such as: Morphological Disambiguation.
All the methods are supervised in one way or another.
I have documented my notes for the list of relevant publications here: 
https://ak-blog.herokuapp.com/posts/9/

Finally, I have made some tweaks to the supervised model and implemented 
a model based on the analyses length.
The model seems to be equivalent to the one that assigns the same weight 
to all the analyses and I believe this is a result of the way the 
lt-proc command works.
You can check my explanation/findings here: 
https://ak-blog.herokuapp.com/posts/10/

Looking forward to reading your advice on how to proceed with the project.
Additionally, Do you think we can make use of a parallel corpus for two 
languages in some way or another?
I know a parallel corpus is also somehow supervised but my intuition is 
that finding/developing parallel corpora is easier than 
finding/developing a tagged corpus.

Note: The blog is hosted using heroku as a free host so the first time 
you access a page might take some time to actually load :)

Regards,
Amr


_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

[Apertium-stuff] GSoC19 - Unsupervised weighting of automata progress update

Reply via email to