Re: [Apertium-stuff] Applying for GSOC 2023 projects

2023-02-25 Thread Hèctor Alòs i Font
Hi Lahari, Did you take a look on this page: https://wiki.apertium.org/wiki/Starting_a_new_language_with_HFST ? Telugu has already a minimal skeleton in Apertium: https://github.com/apertium/apertium-tel We would have to do something similar to what I proposed for Hindi, with the difference

Re: [Apertium-stuff] Applying for GSOC 2023 projects

2023-02-25 Thread Lahari Sreeja Tallapaka
Sir, please explain how I should proceed with the Telugu Morphological Analyser. I've read the documentation available on Apertium Wiki Pages and looked at the apertium-tel GitHub page, which has very little information. There is information on how to make a new language pair; I'm unsure if that

Re: [Apertium-stuff] Applying for GSOC 2023 projects

2023-02-23 Thread Tanmai Khanna
Telugu is a fairly decently resources language now, and has some well performing neural translators, like Hindi. We can still consider working with it in Apertium but first must figure out what our goal is, with it. If there's no particular goal then a low resource language would be preferable.

Re: [Apertium-stuff] Applying for GSOC 2023 projects

2023-02-23 Thread Hèctor Alòs i Font
I agree with Daniel. Moreover, experience shows that it is also unrealistic to do such a complex project as a translator for distant languages in a GSoC. I would look at the Telugu morphological analyser currenly available in Apertium and concentrate on it. 90% of coverage could be a minimal goal.

Re: [Apertium-stuff] Applying for GSOC 2023 projects

2023-02-23 Thread Daniel Swanson
Hi Lahari, For translation pairs, Hindi-English has been tried several times without success. I would suggest considering Hindi-Telugu. For other project ideas or places to get started, you can check the wiki page for each idea and do the coding challenge. If an idea is missing a coding

[Apertium-stuff] Applying for GSOC 2023 projects

2023-02-22 Thread Lahari Sreeja Tallapaka
Greetings to the community, I am Lahari Sreeja from the Indian Institute of Technology(IIT), Bhilai. I have taken Machine Learning, Natural Language processing, and Information retrieval courses and have experience in frontend web development. I know Telugu, Hindi, and English languages. And Im