Re: [Apertium-stuff] Apply for GSOC [English-Bodo language pair]

2023-03-24 Thread Hèctor Alòs i Font
Hi, Maharaj,

Bodo seems to me to be an excellent language for Apertium. Unfortunately,
we have no morphological parser for it, even embryonic. That's why  to
develop one seems to me much more realistic than to think of an automatic
translator project between Bodo and another language (and even more so, if
it belongs to another language family).

It would therefore be a matter of doing this type of project:
https://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code#Develop_a_morphological_analyser


I've been looking a bit at the morphology of the language, and lexd [2]
seems the best choice for developing a parser (e.g. because of the
possessive prefixes).

It would now be a matter of making the coding challenge of the project.
There are about ten days left before the deadline. That is not a lot of
time.

I would suggest joining IRC to get faster help, if needrf:
https://wiki.apertium.org/wiki/IRC

Best regards,
Hèctor

[1] https://github.com/apertium/lexd

Missatge de Maharaj Brahma via Apertium-stuff <
apertium-stuff@lists.sourceforge.net> del dia dv., 24 de març 2023 a les
15:54:

> Dear Apertium folks,
>
> I'm writing to express my interest in participating in the GSoC with
> Apertium. I'm a first-year Ph.D. student in the NLP domain. I'm excited to
> work with the Apertium community on developing translation technology for
> low-resource languages, particularly Bodo. Bodo is a low-resource language
> primarily spoken in the Northeastern region of India. As an NLP researcher
> and native speaker of Bodo, I'm committed to building technology to
> preserve and promote indigenous languages like Bodo.
>
> I'm interested in adding a new language English-Bodo to the Apertium
> platform through this GSoC program or otherwise. I believe this can
> potentially impact the translation technology for low resources.
> Additionally, due to the following points:
> (i) There is no existing publicly available translation technology for
> English-Bodo.
> (ii) Bodo is a low-resource language (potentially ample contribution space
> remains).
> (iii) Belongs to the Sino-Tibetan language family, unlike other Indian
> languages like Hindi and Assamese.
>
> I would like to know if this is a potential project. If so, I would like
> to interact with potential mentors.
>
> Any comments on the English-Bodo language pair are most welcome.
>
> With Regards,
> Maharaj Brahma
> Research Scholar
> Dept. of Computer Science & Engineering
> CS23RESCH01004
>
> Disclaimer:- This footer text is to convey that this email is sent by one
> of the users of IITH. So, do not mark it as SPAM.
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Apply for GSOC [English-Bodo language pair]

2023-03-24 Thread Maharaj Brahma via Apertium-stuff
Thanks, Hector, for the quick reply.

I agree with your comments and acknowledge the difficulties. However, I
believe that language technologies such as Apertium can significantly
impact. Therefore, your suggestion of Developing a Morphological Analyser
for Bodo is a prerequisite to making English-Bodo language pair possible in
Apertium in the near future. I want to take this up if selected for the
GSoC 2023 program.

I hope to contribute to Apertium beyond the GSoC.

I have already installed apertium. I agree lexd is a best choice of parser
for Bodo as it is an aggulative, and morphologically rich language. Also,
I'm already on IRC (under the handle maharaj), and would like to know more
about the coding challenge.

With Regards,
Maharaj Brahma
Research Scholar
Dept. of Computer Science & Engineering
CS23RESCH01004


On Fri, Mar 24, 2023 at 11:29 PM Hèctor Alòs i Font 
wrote:

> Hi, Maharaj,
>
> Bodo seems to me to be an excellent language for Apertium. Unfortunately,
> we have no morphological parser for it, even embryonic. That's why  to
> develop one seems to me much more realistic than to think of an automatic
> translator project between Bodo and another language (and even more so, if
> it belongs to another language family).
>
> It would therefore be a matter of doing this type of project:
> https://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code#Develop_a_morphological_analyser
>
>
> I've been looking a bit at the morphology of the language, and lexd [2]
> seems the best choice for developing a parser (e.g. because of the
> possessive prefixes).
>
> It would now be a matter of making the coding challenge of the project.
> There are about ten days left before the deadline. That is not a lot of
> time.
>
> I would suggest joining IRC to get faster help, if needrf:
> https://wiki.apertium.org/wiki/IRC
>
> Best regards,
> Hèctor
>
> [1] https://github.com/apertium/lexd
>
> Missatge de Maharaj Brahma via Apertium-stuff <
> apertium-stuff@lists.sourceforge.net> del dia dv., 24 de març 2023 a les
> 15:54:
>
>> Dear Apertium folks,
>>
>> I'm writing to express my interest in participating in the GSoC with
>> Apertium. I'm a first-year Ph.D. student in the NLP domain. I'm excited to
>> work with the Apertium community on developing translation technology for
>> low-resource languages, particularly Bodo. Bodo is a low-resource language
>> primarily spoken in the Northeastern region of India. As an NLP researcher
>> and native speaker of Bodo, I'm committed to building technology to
>> preserve and promote indigenous languages like Bodo.
>>
>> I'm interested in adding a new language English-Bodo to the Apertium
>> platform through this GSoC program or otherwise. I believe this can
>> potentially impact the translation technology for low resources.
>> Additionally, due to the following points:
>> (i) There is no existing publicly available translation technology for
>> English-Bodo.
>> (ii) Bodo is a low-resource language (potentially ample contribution
>> space remains).
>> (iii) Belongs to the Sino-Tibetan language family, unlike other Indian
>> languages like Hindi and Assamese.
>>
>> I would like to know if this is a potential project. If so, I would like
>> to interact with potential mentors.
>>
>> Any comments on the English-Bodo language pair are most welcome.
>>
>> With Regards,
>> Maharaj Brahma
>> Research Scholar
>> Dept. of Computer Science & Engineering
>> CS23RESCH01004
>>
>> Disclaimer:- This footer text is to convey that this email is sent by
>> one of the users of IITH. So, do not mark it as SPAM.
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
>

-- 


Disclaimer:- This footer text is to convey that this email is sent by one 
of the users of IITH. So, do not mark it as SPAM.
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] GSoC 2023 proposal draft

2023-03-24 Thread Ahmed Siam
I have written a proposal draft (Internationalization of Apertium Tools).
I am looking for feedback.

Link: https://wiki.apertium.org/wiki/User:Ahmed_Siam/GSoC2023Proposal

-- Ahmed Siam
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Apply for GSOC [English-Bodo language pair]

2023-03-24 Thread Maharaj Brahma via Apertium-stuff
Dear Apertium folks,

I'm writing to express my interest in participating in the GSoC with
Apertium. I'm a first-year Ph.D. student in the NLP domain. I'm excited to
work with the Apertium community on developing translation technology for
low-resource languages, particularly Bodo. Bodo is a low-resource language
primarily spoken in the Northeastern region of India. As an NLP researcher
and native speaker of Bodo, I'm committed to building technology to
preserve and promote indigenous languages like Bodo.

I'm interested in adding a new language English-Bodo to the Apertium
platform through this GSoC program or otherwise. I believe this can
potentially impact the translation technology for low resources.
Additionally, due to the following points:
(i) There is no existing publicly available translation technology for
English-Bodo.
(ii) Bodo is a low-resource language (potentially ample contribution space
remains).
(iii) Belongs to the Sino-Tibetan language family, unlike other Indian
languages like Hindi and Assamese.

I would like to know if this is a potential project. If so, I would like to
interact with potential mentors.

Any comments on the English-Bodo language pair are most welcome.

With Regards,
Maharaj Brahma
Research Scholar
Dept. of Computer Science & Engineering
CS23RESCH01004

-- 


Disclaimer:- This footer text is to convey that this email is sent by one 
of the users of IITH. So, do not mark it as SPAM.
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Regarding GSOC Project

2023-03-24 Thread Gunbir Baveja
Hello,

I was playing with the api and found a potential issue
. Also, as per the
description on wiki, I am highly interested in implementing transliteration
for the python api as it is similar to a past project
 I had worked on that transliterates
english to hindi!

Sincerely,
Gunbir Singh Baveja


On Thu, Mar 23, 2023 at 2:15 AM Kevin Brubeck Unhammer 
wrote:

> >  Since there was no coding challenge mentioned on the wiki,
> > I'm assuming that there is none
>
> The task just links to the issue tracker
> https://github.com/apertium/apertium-python/issues
> so I'm guessing a good place to start would be to
>
> 1. try out the api, see what features are there
>
> 2. try out the rest of apertium, both through beta.apertium.org and the
>command line, try both language pairs and monolingual packages
>Install apertium using the nightly packages:
>https://wiki.apertium.org/wiki/Prerequisites_for_Debian
>
> 3. look at open issues, see if there's one where you (after having tried
>the existing packages) feel you understand the problem and try
>solving it
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff