Hi Utku, Your proposal seems interesting.
1. Did you take look to apertium-ell ? How much could it help? 2. In your proposal, you speak about a corpus. You intend to reach 80% coverage. From what kind of corpus are you speaking? How much Romeyka is written? 3. Could you explain on what you understand by "modelling allomorphy"? Is that Apertium's morphological disambiguation? 4. Could you also explain how do you intend to tag "content phenomenon"? 5. I couldn't find anything about your coding challenge. The coding challenge is a must. It shows that you know to install and have a basic understanding of Apertium. Hèctor Missatge de Utku Turk <utkuturkb...@gmail.com> del dia dc., 14 d’abr. 2021 a les 15:58: > Hi, > > My name is Utku Türk. I am a linguistics student at Boğaziçi University, > Turkey. I want to attend GSoC with a Romeyka morphological analyzer > project. > > Romeyka is one of the many Modern Greek dialects spoken in Asia Minor. It > has no NLP footprint, and I believe it is an important first step for > Quantitative Language Contact and Dialectology studies. Its morphology and > lexicon are heavily influenced by Ancient Greek, Turkish, and Laz. > > The following link[1] is my draft for the GSoC proposal. Any feedback is > very much appreciated! > > [1]: > https://docs.google.com/document/d/1CJrD7TRJvFKKD5qsW_fnLdNbk1iQ2t4_dim3MA3g4_E/edit?usp=sharing > _______________________________________________ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff >
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff