Re: [Apertium-stuff] Fwd: Re : GSOC 2023

2023-02-24 Thread Khushi -
Thanks a lot for your feedback!
It would be great if you could tell me how should I get started with this
and what milestones should I aim to achieve in order to improve it.

Regards,
Khushi Harsure

[image: Mailtrack]

Email
delivery certified by
Mailtrack

25/02/23,
16:46:50

On Fri, 24 Feb 2023 at 23:46, Hèctor Alòs i Font 
wrote:

> Hi Kushi,
>
> First: Hindi-Marathi is already available on Google. I think you should
> reason out the usefulness of developing it in Apertium. A priori, it does
> not seem like a project that is going to be especially promising.
>
> As for your current question, why should the pair be created again from
> scratch? Have you seen something wrong on it? In principle, I don't see at
> all why the work that has been done before should be wasted. I would do it
> only if, after analysing it, it turns out that it is appalling (which would
> be weird).
>
> I don't know Hindi, but from what I saw two years ago, the morphological
> analyser seems to have a lot of room for improvement. It might make sense
> to concentrate on it and its morphological disambiguator. This would help
> to subsequently develop translators between low-resource Indo-Aryan
> languages and Hindi.
>
> Hèctor
>
> Missatge de Khushi - <12khushi...@gmail.com> del dia dv., 24 de febr.
> 2023 a les 19:58:
>
>> Respected sir,
>> Thanks a lot for your response. I am glad that you appreciate it. I
>> wanted to clear up some doubts before I start working on it.
>> I would like to know whether you want me to work on the existing marathi
>> - hindi translator or should i create a new one from scratch. In the former
>> case, what kind of improvements or contributions will be expected ?
>> Looking forward to hearing from you soon !
>>
>> Regards,
>> Khushi Harsure
>>
>>
>>
>> [image: Mailtrack]
>> 
>>  Email
>> delivery certified by
>> Mailtrack
>> 
>>  25/02/23,
>> 02:48:57
>>
>> On Fri, 24 Feb 2023 at 20:05, Daniel Swanson 
>> wrote:
>>
>>> Hi Khushi,
>>>
>>> Yeah, that sounds like a good project to me.
>>>
>>> Next steps would be opening a pull request on
>>> https://github.com/apertium/apertium-mar-hin and requesting a wiki
>>> account to write your workplan.
>>>
>>> Daniel
>>>
>>> On Fri, Feb 24, 2023 at 4:09 AM Khushi - <12khushi...@gmail.com> wrote:
>>>


 -- Forwarded message -
 From: Khushi - <12khushi...@gmail.com>
 Date: Fri, 24 Feb 2023 at 14:24
 Subject: Re : [Apertium-stuff] GSOC 2023
 To: 


 Hello !

 This is Khushi Harsure, an undergraduate student from India pursuing
 Computer Science. I'd like to participate in Google Summer Of Code 2023 at
 Apertium. The project involving addition of a new language pair has  caught
 my interest and being a native speaker, I was planning to work on addition
 of Hindi-Marathi pair. Previously Hindi-English and English-Marathi pairs
 have been added by past Gsoccers however Hindi-Marathi pair remains
 unworked upon. Before starting off, I wanted to get a confirmation whether
 this would be a potential Gsoc project.
I would also like to know the steps that should
 be followed after doing the installation of Apertium other than giving the
 coding challenge. Looking forward to hearing from you.

 Regards,
 Khushi Harsure



 [image: Mailtrack]
 
  Email
 delivery certified by
 Mailtrack
 
  24/02/23,
 14:23:55

 [image: Mailtrack]
 
  Email
 delivery certified by
 Mailtrack
 
  24/02/23,
 14:38:51
 ___
 Apertium-stuff mailing list
 Apertium-stuff@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/apertium-stuff

>>> ___
>>> Apertium-stuff mailing list
>>> Apertium-stuff@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
> ___
>

Re: [Apertium-stuff] Fwd: Re : GSOC 2023

2023-02-24 Thread Hèctor Alòs i Font
Hi Kushi,

First: Hindi-Marathi is already available on Google. I think you should
reason out the usefulness of developing it in Apertium. A priori, it does
not seem like a project that is going to be especially promising.

As for your current question, why should the pair be created again from
scratch? Have you seen something wrong on it? In principle, I don't see at
all why the work that has been done before should be wasted. I would do it
only if, after analysing it, it turns out that it is appalling (which would
be weird).

I don't know Hindi, but from what I saw two years ago, the morphological
analyser seems to have a lot of room for improvement. It might make sense
to concentrate on it and its morphological disambiguator. This would help
to subsequently develop translators between low-resource Indo-Aryan
languages and Hindi.

Hèctor

Missatge de Khushi - <12khushi...@gmail.com> del dia dv., 24 de febr. 2023
a les 19:58:

> Respected sir,
> Thanks a lot for your response. I am glad that you appreciate it. I wanted
> to clear up some doubts before I start working on it.
> I would like to know whether you want me to work on the existing marathi -
> hindi translator or should i create a new one from scratch. In the former
> case, what kind of improvements or contributions will be expected ?
> Looking forward to hearing from you soon !
>
> Regards,
> Khushi Harsure
>
>
>
> [image: Mailtrack]
> 
>  Email
> delivery certified by
> Mailtrack
> 
>  25/02/23,
> 02:48:57
>
> On Fri, 24 Feb 2023 at 20:05, Daniel Swanson 
> wrote:
>
>> Hi Khushi,
>>
>> Yeah, that sounds like a good project to me.
>>
>> Next steps would be opening a pull request on
>> https://github.com/apertium/apertium-mar-hin and requesting a wiki
>> account to write your workplan.
>>
>> Daniel
>>
>> On Fri, Feb 24, 2023 at 4:09 AM Khushi - <12khushi...@gmail.com> wrote:
>>
>>>
>>>
>>> -- Forwarded message -
>>> From: Khushi - <12khushi...@gmail.com>
>>> Date: Fri, 24 Feb 2023 at 14:24
>>> Subject: Re : [Apertium-stuff] GSOC 2023
>>> To: 
>>>
>>>
>>> Hello !
>>>
>>> This is Khushi Harsure, an undergraduate student from India pursuing
>>> Computer Science. I'd like to participate in Google Summer Of Code 2023 at
>>> Apertium. The project involving addition of a new language pair has  caught
>>> my interest and being a native speaker, I was planning to work on addition
>>> of Hindi-Marathi pair. Previously Hindi-English and English-Marathi pairs
>>> have been added by past Gsoccers however Hindi-Marathi pair remains
>>> unworked upon. Before starting off, I wanted to get a confirmation whether
>>> this would be a potential Gsoc project.
>>>I would also like to know the steps that should
>>> be followed after doing the installation of Apertium other than giving the
>>> coding challenge. Looking forward to hearing from you.
>>>
>>> Regards,
>>> Khushi Harsure
>>>
>>>
>>>
>>> [image: Mailtrack]
>>> 
>>>  Email
>>> delivery certified by
>>> Mailtrack
>>> 
>>>  24/02/23,
>>> 14:23:55
>>>
>>> [image: Mailtrack]
>>> 
>>>  Email
>>> delivery certified by
>>> Mailtrack
>>> 
>>>  24/02/23,
>>> 14:38:51
>>> ___
>>> Apertium-stuff mailing list
>>> Apertium-stuff@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSOC2023

2023-02-24 Thread Eiji Miyamoto
Thank you for your reply. The project seems cool to work on for GSOC2023,
and I would like to participate in. I reckon there are two tasks on the
page and could you tell me where to start?

On Fri, 24 Feb 2023 at 08:20, Kevin Brubeck Unhammer 
wrote:

> > I'd like to participate in Google Summer of Code 2023 at Apertium.
> > In particular, I'm interested in adding new language pair and I am
> > thinking to add Japanese-English as I speak Japanese. I took summer
> > school at Tokyo University online on natural language processing
> > before.
> > Could you tell me more about the project?
>
> Hi,
>
> Getting some support for Japanese would be great! I'm not sure if you
> saw the whole IRC discussion, but what we really need in that regard is
> support for the *tokenisation* step, where our regular methods[1] fail
> us, since the text might have no spaces and lots of
> tokenisation-ambiguity. There has been some prior work[2] and it's
> already listed as a potential GsoC project.
>
> Support for anything-Japanese depends on tokenisation. It's also a big
> enough job that it would qualify as a full GsoC project, so if you were
> hoping for jpn-eng in a summer you will be disappointeda (but having a
> toy language pair to test with would help!). On the other hand, if we
> get good spaceless tokenisation we open up the possibility for not just
> Japanese, but Thai, Lao, Chinese etc. – and of course all those writing
> systems used before the invention of the space character :)
>
> regards,
> Kevin
>
> [1] https://wiki.apertium.org/wiki/LRLM
> [2] http://hdl.handle.net/10066/20002
> [3]
> https://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Tokenisation_for_spaceless_orthographies
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Fwd: Re : GSOC 2023

2023-02-24 Thread Daniel Swanson
Hi Khushi,

Unless the existing one is both very old and very small, starting
from scratch is a complete waste of time.

The most important thing you can do to improve a translation pair is to add
entries to the dictionaries (monolingual and bilingual). Then, add rules to
improve the output as necessary.

Daniel

On Fri, Feb 24, 2023 at 11:58 AM Khushi - <12khushi...@gmail.com> wrote:

> Respected sir,
> Thanks a lot for your response. I am glad that you appreciate it. I wanted
> to clear up some doubts before I start working on it.
> I would like to know whether you want me to work on the existing marathi -
> hindi translator or should i create a new one from scratch. In the former
> case, what kind of improvements or contributions will be expected ?
> Looking forward to hearing from you soon !
>
> Regards,
> Khushi Harsure
>
>
>
> [image: Mailtrack]
> 
>  Email
> delivery certified by
> Mailtrack
> 
>  25/02/23,
> 02:48:57
>
> On Fri, 24 Feb 2023 at 20:05, Daniel Swanson 
> wrote:
>
>> Hi Khushi,
>>
>> Yeah, that sounds like a good project to me.
>>
>> Next steps would be opening a pull request on
>> https://github.com/apertium/apertium-mar-hin and requesting a wiki
>> account to write your workplan.
>>
>> Daniel
>>
>> On Fri, Feb 24, 2023 at 4:09 AM Khushi - <12khushi...@gmail.com> wrote:
>>
>>>
>>>
>>> -- Forwarded message -
>>> From: Khushi - <12khushi...@gmail.com>
>>> Date: Fri, 24 Feb 2023 at 14:24
>>> Subject: Re : [Apertium-stuff] GSOC 2023
>>> To: 
>>>
>>>
>>> Hello !
>>>
>>> This is Khushi Harsure, an undergraduate student from India pursuing
>>> Computer Science. I'd like to participate in Google Summer Of Code 2023 at
>>> Apertium. The project involving addition of a new language pair has  caught
>>> my interest and being a native speaker, I was planning to work on addition
>>> of Hindi-Marathi pair. Previously Hindi-English and English-Marathi pairs
>>> have been added by past Gsoccers however Hindi-Marathi pair remains
>>> unworked upon. Before starting off, I wanted to get a confirmation whether
>>> this would be a potential Gsoc project.
>>>I would also like to know the steps that should
>>> be followed after doing the installation of Apertium other than giving the
>>> coding challenge. Looking forward to hearing from you.
>>>
>>> Regards,
>>> Khushi Harsure
>>>
>>>
>>>
>>> [image: Mailtrack]
>>> 
>>>  Email
>>> delivery certified by
>>> Mailtrack
>>> 
>>>  24/02/23,
>>> 14:23:55
>>>
>>> [image: Mailtrack]
>>> 
>>>  Email
>>> delivery certified by
>>> Mailtrack
>>> 
>>>  24/02/23,
>>> 14:38:51
>>> ___
>>> Apertium-stuff mailing list
>>> Apertium-stuff@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>>
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Fwd: Re : GSOC 2023

2023-02-24 Thread Khushi -
Respected sir,
Thanks a lot for your response. I am glad that you appreciate it. I wanted
to clear up some doubts before I start working on it.
I would like to know whether you want me to work on the existing marathi -
hindi translator or should i create a new one from scratch. In the former
case, what kind of improvements or contributions will be expected ?
Looking forward to hearing from you soon !

Regards,
Khushi Harsure



[image: Mailtrack]

Email
delivery certified by
Mailtrack

25/02/23,
02:48:57

On Fri, 24 Feb 2023 at 20:05, Daniel Swanson 
wrote:

> Hi Khushi,
>
> Yeah, that sounds like a good project to me.
>
> Next steps would be opening a pull request on
> https://github.com/apertium/apertium-mar-hin and requesting a wiki
> account to write your workplan.
>
> Daniel
>
> On Fri, Feb 24, 2023 at 4:09 AM Khushi - <12khushi...@gmail.com> wrote:
>
>>
>>
>> -- Forwarded message -
>> From: Khushi - <12khushi...@gmail.com>
>> Date: Fri, 24 Feb 2023 at 14:24
>> Subject: Re : [Apertium-stuff] GSOC 2023
>> To: 
>>
>>
>> Hello !
>>
>> This is Khushi Harsure, an undergraduate student from India pursuing
>> Computer Science. I'd like to participate in Google Summer Of Code 2023 at
>> Apertium. The project involving addition of a new language pair has  caught
>> my interest and being a native speaker, I was planning to work on addition
>> of Hindi-Marathi pair. Previously Hindi-English and English-Marathi pairs
>> have been added by past Gsoccers however Hindi-Marathi pair remains
>> unworked upon. Before starting off, I wanted to get a confirmation whether
>> this would be a potential Gsoc project.
>>I would also like to know the steps that should be
>> followed after doing the installation of Apertium other than giving the
>> coding challenge. Looking forward to hearing from you.
>>
>> Regards,
>> Khushi Harsure
>>
>>
>>
>> [image: Mailtrack]
>> 
>>  Email
>> delivery certified by
>> Mailtrack
>> 
>>  24/02/23,
>> 14:23:55
>>
>> [image: Mailtrack]
>> 
>>  Email
>> delivery certified by
>> Mailtrack
>> 
>>  24/02/23,
>> 14:38:51
>> ___
>> Apertium-stuff mailing list
>> Apertium-stuff@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Adding Bhojpuri, Maithili and Magahi

2023-02-24 Thread Anil Singh
Hello,

If any native speaker of any of these languages, and also with a linguistic
background is interested, we can try to add/enhance support for these
languages to Apertium. We already have the paradigms, but they lack full
documentation, so that the feature-value sets for a paradigm entry are not
fully identified. Any native speaker with a linguistic  background (I don't
know any of these languages) could add that information so that we can fill
in these paradigms in Apertium. There is also some other data for these
languages.

Best regards,

Anil Kumar Singh
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Fwd: Re : GSOC 2023

2023-02-24 Thread Daniel Swanson
Hi Khushi,

Yeah, that sounds like a good project to me.

Next steps would be opening a pull request on
https://github.com/apertium/apertium-mar-hin and requesting a wiki account
to write your workplan.

Daniel

On Fri, Feb 24, 2023 at 4:09 AM Khushi - <12khushi...@gmail.com> wrote:

>
>
> -- Forwarded message -
> From: Khushi - <12khushi...@gmail.com>
> Date: Fri, 24 Feb 2023 at 14:24
> Subject: Re : [Apertium-stuff] GSOC 2023
> To: 
>
>
> Hello !
>
> This is Khushi Harsure, an undergraduate student from India pursuing
> Computer Science. I'd like to participate in Google Summer Of Code 2023 at
> Apertium. The project involving addition of a new language pair has  caught
> my interest and being a native speaker, I was planning to work on addition
> of Hindi-Marathi pair. Previously Hindi-English and English-Marathi pairs
> have been added by past Gsoccers however Hindi-Marathi pair remains
> unworked upon. Before starting off, I wanted to get a confirmation whether
> this would be a potential Gsoc project.
>I would also like to know the steps that should be
> followed after doing the installation of Apertium other than giving the
> coding challenge. Looking forward to hearing from you.
>
> Regards,
> Khushi Harsure
>
>
>
> [image: Mailtrack]
> 
>  Email
> delivery certified by
> Mailtrack
> 
>  24/02/23,
> 14:23:55
>
> [image: Mailtrack]
> 
>  Email
> delivery certified by
> Mailtrack
> 
>  24/02/23,
> 14:38:51
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Extra output from lt-proc

2023-02-24 Thread Zanga Chimombo
Much appreciated Daniel!

On Fri, Feb 24, 2023 at 4:02 PM Daniel Swanson
 wrote:
>
> The issue is that %[%+cl1%] and %[%+cl9%] are not mentioned in
> apertium-yao.yao.twoc, and thus you get non-matching pairs.
>
> Daniel
>
> On Fri, Feb 24, 2023 at 7:57 AM Zanga Chimombo  wrote:
> >
> > Gurus,
> >
> > I am getting extra output from lt-proc. From the lexc file
> > https://gitlab.com/zangaphee/CiBantu/-/blob/master/twoc/apertium-yao/apertium-yao.yao.lexc
> >
> > Lines 227 and 229, I expect two outputs (cl7 and cl9)
> >
> > However I get three:
> > echo yekulungwa | lt-proc yao.automorf.bin
> > ^yekulungwa/kulungwa/kulungwa/kulungwa$
> >
> > I cannot figure out where the extra  output is coming from!!!
> >
> > Can anyone see what I am doing wrong?
> >
> > Zanga.
> >
> >
> > ___
> > Apertium-stuff mailing list
> > Apertium-stuff@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] R: Re: Error in translation English-Italian

2023-02-24 Thread s_lombardini--- via Apertium-stuff
I think it is that: mediawiki.org sends messages about various contents that 
need to be translated from English.
SL



Inviato da Yahoo Mail su Android 
 
  Il Ven, 24 Feb, 2023 alle 14:23, Kevin Brubeck Unhammer ha 
scritto:   > I use Metawiki as a translator. Often I find that the English word
> 'for' is translated by Apertium with 'partorisca'. This Italian word
> is a verb meaning 'to give birth'. The correct translation is 'per'.
> Is it possible to fix it?


According to Kartik Mistry, apertium-eng-ita shouldn't be enabled in
Mediawiki Content Translation at all (it's incubator, not released).
Though they do have Google Translate and other MT systems doing eng-ita
there.

By Metawiki, do you mean the wikipedia content translation system
https://www.mediawiki.org/wiki/Content_translation
or something else?


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff
  
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Extra output from lt-proc

2023-02-24 Thread Daniel Swanson
The issue is that %[%+cl1%] and %[%+cl9%] are not mentioned in
apertium-yao.yao.twoc, and thus you get non-matching pairs.

Daniel

On Fri, Feb 24, 2023 at 7:57 AM Zanga Chimombo  wrote:
>
> Gurus,
>
> I am getting extra output from lt-proc. From the lexc file
> https://gitlab.com/zangaphee/CiBantu/-/blob/master/twoc/apertium-yao/apertium-yao.yao.lexc
>
> Lines 227 and 229, I expect two outputs (cl7 and cl9)
>
> However I get three:
> echo yekulungwa | lt-proc yao.automorf.bin
> ^yekulungwa/kulungwa/kulungwa/kulungwa$
>
> I cannot figure out where the extra  output is coming from!!!
>
> Can anyone see what I am doing wrong?
>
> Zanga.
>
>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Error in translation English-Italian

2023-02-24 Thread Kevin Brubeck Unhammer
> I use Metawiki as a translator. Often I find that the English word
> 'for' is translated by Apertium with 'partorisca'. This Italian word
> is a verb meaning 'to give birth'. The correct translation is 'per'.
> Is it possible to fix it?


According to Kartik Mistry, apertium-eng-ita shouldn't be enabled in
Mediawiki Content Translation at all (it's incubator, not released).
Though they do have Google Translate and other MT systems doing eng-ita
there.

By Metawiki, do you mean the wikipedia content translation system
https://www.mediawiki.org/wiki/Content_translation
or something else?




signature.asc
Description: PGP signature
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Extra output from lt-proc

2023-02-24 Thread Zanga Chimombo
Gurus,

I am getting extra output from lt-proc. From the lexc file
https://gitlab.com/zangaphee/CiBantu/-/blob/master/twoc/apertium-yao/apertium-yao.yao.lexc

Lines 227 and 229, I expect two outputs (cl7 and cl9)

However I get three:
echo yekulungwa | lt-proc yao.automorf.bin
^yekulungwa/kulungwa/kulungwa/kulungwa$

I cannot figure out where the extra  output is coming from!!!

Can anyone see what I am doing wrong?

Zanga.


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Error in translation English-Italian

2023-02-24 Thread Hèctor Alòs i Font
Grazie a te, Silvia, per aver segnalato il problema e perdona il mio sfogo.
In Apertium ci dedichiamo fondamentalmente allo sviluppo di traduttori
automatici per lingue con poche risorse (tipicamente lingue
minoritarie/minorizzate). Nel caso dell'italiano, abbiamo rilasciato il
sardo-italiano e so che si sta lavorando anche con l'italiano-siciliano.
Per la traduzione tra italiano, inglese, francese, russo, ecc. ci sono
altri sistemi che lo fanno molto meglio. Se c'è qualche sviluppo tra
l'inglese e l'italiano in Apertium, molto probabilmente è solo perché
qualcuno ha imparato come funziona il motore di traduzione.
Cordiali saluti,
Hèctor

Missatge de s_lombard...@yahoo.it  del dia dv., 24
de febr. 2023 a les 14:42:

> Thank you for answering and explaining some problems. I am a simple user,
> who tries to help!
> Silvia Lombardini
>
> Inviato da Yahoo Mail su Android
> 
>
> Il Ven, 24 Feb, 2023 alle 9:45, Hèctor Alòs i Font
>  ha scritto:
> I'm a bit surprised that Apertium's eng-ita is used anywhere since it is
> not released. But I am even more surprised that Wikimedia uses a
> non-released pair while it is not using even some of the released ones. In
> fact, I have been asking Wikimedia without any success for several years to
> incorporate the new language pairs and the new versions we have released.
> It is very disappointing, for example, for the Arpitans, who cannot
> generate articles on their Wikipedia much more easily because Wikimedia has
> not made available the translator that we have had available for three
> years. The same goes for Sardinian and, I think, for Silesian. If someone
> could fix this once and for all, several linguistic minorities will be very
> grateful.
>
> As for eng-ita, I'm afraid that correcting the bug in Apertium could not
> solve the problem as Wikimedia does not seem to be updating its Apertium
> translators. In fact, it is even possible that the error has already been
> corrected in Apertium for some time and has not been changed.
>
> Hèctor
>
> Missatge de s_lombardini--- via Apertium-stuff <
> apertium-stuff@lists.sourceforge.net> del dia dv., 24 de febr. 2023 a les
> 10:23:
>
> I use Metawiki as a translator. Often I find that the English word 'for'
> is translated by Apertium with 'partorisca'. This Italian word is a verb
> meaning 'to give birth'. The correct translation is 'per'. Is it possible
> to fix it?
>
> Silvia Lombardini
> Inviato da Yahoo Mail su Android
> 
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] R: Re: Error in translation English-Italian

2023-02-24 Thread s_lombardini--- via Apertium-stuff
Thank you for answering and explaining some problems. I am a simple user, who 
tries to help! Silvia Lombardini 

Inviato da Yahoo Mail su Android 
 
  Il Ven, 24 Feb, 2023 alle 9:45, Hèctor Alòs i Font ha 
scritto:   I'm a bit surprised that Apertium's eng-ita is used anywhere since 
it is not released. But I am even more surprised that Wikimedia uses a 
non-released pair while it is not using even some of the released ones. In 
fact, I have been asking Wikimedia without any success for several years to 
incorporate the new language pairs and the new versions we have released. It is 
very disappointing, for example, for the Arpitans, who cannot generate articles 
on their Wikipedia much more easily because Wikimedia has not made available 
the translator that we have had available for three years. The same goes for 
Sardinian and, I think, for Silesian. If someone could fix this once and for 
all, several linguistic minorities will be very grateful.
As for eng-ita, I'm afraid that correcting the bug in Apertium could not solve 
the problem as Wikimedia does not seem to be updating its Apertium translators. 
In fact, it is even possible that the error has already been corrected in 
Apertium for some time and has not been changed.
Hèctor
Missatge de s_lombardini--- via Apertium-stuff 
 del dia dv., 24 de febr. 2023 a les 
10:23:

I use Metawiki as a translator. Often I find that the English word 'for' is 
translated by Apertium with 'partorisca'. This Italian word is a verb meaning 
'to give birth'. The correct translation is 'per'. Is it possible to fix it?
Silvia LombardiniInviato da Yahoo Mail su 
Android___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

  
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Fwd: Re : GSOC 2023

2023-02-24 Thread Khushi -
-- Forwarded message -
From: Khushi - <12khushi...@gmail.com>
Date: Fri, 24 Feb 2023 at 14:24
Subject: Re : [Apertium-stuff] GSOC 2023
To: 


Hello !

This is Khushi Harsure, an undergraduate student from India pursuing
Computer Science. I'd like to participate in Google Summer Of Code 2023 at
Apertium. The project involving addition of a new language pair has  caught
my interest and being a native speaker, I was planning to work on addition
of Hindi-Marathi pair. Previously Hindi-English and English-Marathi pairs
have been added by past Gsoccers however Hindi-Marathi pair remains
unworked upon. Before starting off, I wanted to get a confirmation whether
this would be a potential Gsoc project.
   I would also like to know the steps that should be
followed after doing the installation of Apertium other than giving the
coding challenge. Looking forward to hearing from you.

Regards,
Khushi Harsure



[image: Mailtrack]

Email
delivery certified by
Mailtrack

24/02/23,
14:23:55

[image: Mailtrack]

Email
delivery certified by
Mailtrack

24/02/23,
14:38:51
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Error in translation English-Italian

2023-02-24 Thread Hèctor Alòs i Font
I'm a bit surprised that Apertium's eng-ita is used anywhere since it is
not released. But I am even more surprised that Wikimedia uses a
non-released pair while it is not using even some of the released ones. In
fact, I have been asking Wikimedia without any success for several years to
incorporate the new language pairs and the new versions we have released.
It is very disappointing, for example, for the Arpitans, who cannot
generate articles on their Wikipedia much more easily because Wikimedia has
not made available the translator that we have had available for three
years. The same goes for Sardinian and, I think, for Silesian. If someone
could fix this once and for all, several linguistic minorities will be very
grateful.

As for eng-ita, I'm afraid that correcting the bug in Apertium could not
solve the problem as Wikimedia does not seem to be updating its Apertium
translators. In fact, it is even possible that the error has already been
corrected in Apertium for some time and has not been changed.

Hèctor

Missatge de s_lombardini--- via Apertium-stuff <
apertium-stuff@lists.sourceforge.net> del dia dv., 24 de febr. 2023 a les
10:23:

> I use Metawiki as a translator. Often I find that the English word 'for'
> is translated by Apertium with 'partorisca'. This Italian word is a verb
> meaning 'to give birth'. The correct translation is 'per'. Is it possible
> to fix it?
>
> Silvia Lombardini
> Inviato da Yahoo Mail su Android
> 
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSOC2023

2023-02-24 Thread Kevin Brubeck Unhammer
> I'd like to participate in Google Summer of Code 2023 at Apertium.
> In particular, I'm interested in adding new language pair and I am
> thinking to add Japanese-English as I speak Japanese. I took summer
> school at Tokyo University online on natural language processing
> before.
> Could you tell me more about the project?

Hi,

Getting some support for Japanese would be great! I'm not sure if you
saw the whole IRC discussion, but what we really need in that regard is
support for the *tokenisation* step, where our regular methods[1] fail
us, since the text might have no spaces and lots of
tokenisation-ambiguity. There has been some prior work[2] and it's
already listed as a potential GsoC project.

Support for anything-Japanese depends on tokenisation. It's also a big
enough job that it would qualify as a full GsoC project, so if you were
hoping for jpn-eng in a summer you will be disappointeda (but having a
toy language pair to test with would help!). On the other hand, if we
get good spaceless tokenisation we open up the possibility for not just
Japanese, but Thai, Lao, Chinese etc. – and of course all those writing
systems used before the invention of the space character :)

regards,
Kevin

[1] https://wiki.apertium.org/wiki/LRLM
[2] http://hdl.handle.net/10066/20002 
[3] 
https://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Tokenisation_for_spaceless_orthographies


signature.asc
Description: PGP signature
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff