Re: [Apertium-stuff] Question sus la desambigüizacion dins Apertium

2024-06-19 Thread Aure Séguier

Thanks a lot !

How can I define src_lengadocian as the variable that means the source 
language is lengadocian ?





   AureSÉGUIER

Responsabla del pòle informatic

Congrès permanent de la lenga occitana







mobilePhone

+33 (0)5 32 00 00 64 
website

www.locongres.org 
address

La Ciutat - Creem! , 5-7 rue de la Fontaine, 64000 Pau






facebook 

twitter    

	linkedin 
 	


instagram   



Le 19/06/2024 à 10:53, Kevin Brubeck Unhammer a écrit :

Occitan can manage variety in its metadix file. My question is, is
there a way to manage variety in the .rlx file ?

There is :)


For instance, we have the word "bad", "evil" which is "mal" in
lengadocian and "mau" en gascon. But "mau" can also be a conjugated
verb (a pretty rare one). I did this rule in the RLX file : REMOVE V
IF (0 (""i));
But I would want this rule not to apply to lengadocian, where "mau"
can only be a conjugated verb.
Is that possible ? If not, is this something easy to implement ?

Yes. You could for example say that "src_lengadocian" is the variable
that signifies that the source language is lengadocian, and then have
one rule that picks the verb if source language is lengadocian:

 SELECT V IF (0 (""i))(0 (VAR:src_lengadocian)) ;

and one that removes it if not:

 REMOVE V IF (0 (""i)) (NEGATE 0 (VAR:src_lengadocian)) ;


I can't say for certain if this system makes things simpler or not for
you compared to metadix, but it allows for a lot more flexibility, with
much shorter compile times (since we have just one compiled FST which
contains all the variety).



___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Question sus la desambigüizacion dins Apertium

2024-06-18 Thread Aure Séguier

Hi

I read the documentation you mentioned but I didn't understood very well.

Occitan can manage variety in its metadix file. My question is, is there 
a way to manage variety in the .rlx file ?


For instance, we have the word "bad", "evil" which is "mal" in 
lengadocian and "mau" en gascon. But "mau" can also be a conjugated verb 
(a pretty rare one). I did this rule in the RLX file : REMOVE V IF (0 
(""i));
But I would want this rule not to apply to lengadocian, where "mau" can 
only be a conjugated verb.

Is that possible ? If not, is this something easy to implement ?

I need to have an answer to this question in order to know if it would 
be interesting to change the way we manage variety or not.


Thanks




   AureSÉGUIER

Responsabla del pòle informatic

Congrès permanent de la lenga occitana







mobilePhone

+33 (0)5 32 00 00 64 
website

www.locongres.org 
address

La Ciutat - Creem! , 5-7 rue de la Fontaine, 64000 Pau






facebook <https://www.facebook.com/lo.congres>

twitter <https://twitter.com/locongres>   

	linkedin 
<https://www.linkedin.com/company/congres-permanent-de-la-lenga-occitane/> 	


instagram <https://www.instagram.com/locongres/>  



Le 14/06/2024 à 13:09, Tino Didriksen a écrit :

G'day,

Questions like these should really go to the whole mailing list, so 
I've added it.


The pipe can handle language variations in a few ways.

There is the FST variant, to handle different scripts (e.g. Latin vs. 
Cyrillic) and false friends, which apertium-oci-fra uses for the 
_gascon mode. More recently, there is the preferences system, to 
handle semantic or preferential differences.


Both are documented at 
https://wiki.apertium.org/wiki/Dialectal_or_standard_variation - and 
the mailing list and IRC can answer further questions.


-- Tino Didriksen


On Tue, 4 Jun 2024 at 17:44, Aure Séguier  wrote:

Adiu

Soi Aure Séguier. Contribuissi a l'Apertium occitan dins
l'encastre de mon trabalh al Congrès permanent de la lenga occitana.

Coma sèm a soscar a i ajustar d'autras varietats (primièr
enriquesir l'occitan aranés, mas mai tard ajustar tanben lo
lemosin e lo provençal), sèm a soscar a la gestion de la varietat
de faiçon mai larga. Dins aquel encastre, ai una question rapòrt a
l'analisi morfosintaxica (Hectòr Alòs me diguèt qu'èras la persona
a la quala demandar).

Es possible de far de règlas de desambiguïzacion especificas a una
varietat ? Per exemple, en gascon, avèm los enonciatius ("que",
"ne", etc.) qu'existisson pas dins las autras varietats. Se
cambiam lo sistèma de gestion de las varietats, serà benlèu pas
pus possible d'indicar dins lo monodix que "que" (enonciatiu)
existís sonque en gascon. Riscarà d'èstre reconegut en lengadocian
e de faussar la traduccion. I a tanben d'autres cases especifics
("de" partitiu que se ditz quasi pas jamai en gascon, mas totjorn
en lengadocian...).

Se es pas possible de far de règlas especificas a una varietat, es
quicòm que se pòt pensar per l'avenidor ? Se òc, amb quala carga
de trabalh e qualas competéncias ?

Mercés

-- 
	



AureSÉGUIER

Responsabla del pòle informatic

Congrès permanent de la lenga occitana





mobilePhone

+33 (0)5 32 00 00 64 
website

www.locongres.org 
address

La Ciutat - Creem! , 5-7 rue de la Fontaine, 64000 Pau






facebook <https://www.facebook.com/lo.congres>
twitter <https://twitter.com/locongres>   
linkedin
<https://www.linkedin.com/company/congres-permanent-de-la-lenga-occitane/>

instagram <https://www.instagram.com/locongres/>  


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Can't connect to Apertium IRC

2023-07-03 Thread Aure Séguier

I had already done this. It did'nt solve the problem.

Aura Séguier, responsabla de projèctes e desvolopaira
Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org <mailto:a.segu...@locongres.org>
www.locongres.org <http://www.locongres.org>
Le 03/07/2023 à 15:07, Ahmed Siam a écrit :

Probably you didn't specify apertium channel when you logged in.
use this link when you log in: https://webchat.oftc.net/?channels=apertium

On Mon, Jul 3, 2023 at 3:27 PM Aure Séguier  
wrote:


Hi

I don't have the #apertium button, neither on Firefox nor on
Chrome. I'm on Ubuntu.

Thanks

Aura Séguier, responsabla de projèctes e desvolopaira
Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org
www.locongres.org <http://www.locongres.org>
Le 01/07/2023 à 02:49, Daniel Swanson a écrit :

Hi Aure,

image.png

I just checked, and if I have "Status" selected, as in this
picture, I get the behavior you described, but if I then click on
the "#apertium" button, everything works fine.

    Daniel

On Fri, Jun 30, 2023 at 8:16 PM Aure Séguier
 wrote:

Hi

Since Apertium IRC moved to ORTC, I have been unable to
connect. With a browser, at the url
http://webchat.oftc.net/?channels=apertium, when I try to
type something and send it, I have an error message saying
"Can't use this command in this window".

And I don't understand how to connect with a client. I tried
each one listed on the https://wiki.apertium.org/wiki/IRC but
I could'nt figure out how to make them work.
For instance, with HexChat, I don't know which network I must
choose (I tried some, but they didn't work).
With xchat and smuxi, I could'nt install it, the "make" file
seems to miss in the tar file I downloaded.
With weechat, when I type "/join #apertium", I have an error
message saying "irc : la commande "join" doit être exécutée
sur un tampon irc (serveur, canal ou privé)"

Is there a simple way to join Apertium IRC ? If it is, could
you explain me how to do so ?

Thanks a lot

-- 
Aura Séguier, responsabla de projèctes e desvolopaira

Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org
www.locongres.org <http://www.locongres.org>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff



___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff



___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Can't connect to Apertium IRC

2023-07-03 Thread Aure Séguier

Hi

I don't have the #apertium button, neither on Firefox nor on Chrome. I'm 
on Ubuntu.


Thanks

Aura Séguier, responsabla de projèctes e desvolopaira
Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org <mailto:a.segu...@locongres.org>
www.locongres.org <http://www.locongres.org>
Le 01/07/2023 à 02:49, Daniel Swanson a écrit :

Hi Aure,

image.png

I just checked, and if I have "Status" selected, as in this picture, I 
get the behavior you described, but if I then click on the "#apertium" 
button, everything works fine.


Daniel

On Fri, Jun 30, 2023 at 8:16 PM Aure Séguier  
wrote:


Hi

Since Apertium IRC moved to ORTC, I have been unable to connect.
With a browser, at the url
http://webchat.oftc.net/?channels=apertium, when I try to type
something and send it, I have an error message saying "Can't use
this command in this window".

And I don't understand how to connect with a client. I tried each
one listed on the https://wiki.apertium.org/wiki/IRC but I
could'nt figure out how to make them work.
For instance, with HexChat, I don't know which network I must
choose (I tried some, but they didn't work).
With xchat and smuxi, I could'nt install it, the "make" file seems
to miss in the tar file I downloaded.
With weechat, when I type "/join #apertium", I have an error
message saying "irc : la commande "join" doit être exécutée sur un
tampon irc (serveur, canal ou privé)"

Is there a simple way to join Apertium IRC ? If it is, could you
explain me how to do so ?

Thanks a lot

-- 
Aura Séguier, responsabla de projèctes e desvolopaira

Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org
www.locongres.org <http://www.locongres.org>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff



___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


[Apertium-stuff] Can't connect to Apertium IRC

2023-06-30 Thread Aure Séguier

Hi

Since Apertium IRC moved to ORTC, I have been unable to connect. With a 
browser, at the url http://webchat.oftc.net/?channels=apertium, when I 
try to type something and send it, I have an error message saying "Can't 
use this command in this window".


And I don't understand how to connect with a client. I tried each one 
listed on the https://wiki.apertium.org/wiki/IRC but I could'nt figure 
out how to make them work.
For instance, with HexChat, I don't know which network I must choose (I 
tried some, but they didn't work).
With xchat and smuxi, I could'nt install it, the "make" file seems to 
miss in the tar file I downloaded.
With weechat, when I type "/join #apertium", I have an error message 
saying "irc : la commande "join" doit être exécutée sur un tampon irc 
(serveur, canal ou privé)"


Is there a simple way to join Apertium IRC ? If it is, could you explain 
me how to do so ?


Thanks a lot

--
Aura Séguier, responsabla de projèctes e desvolopaira
Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org 
www.locongres.org ___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Changes to apertium-preprocess-transfer

2023-06-27 Thread Aure Séguier

Oh yes, we will ! Thanks a lot, it was a real problem !

Aura Séguier, responsabla de projèctes e desvolopaira
Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org 
www.locongres.org 
Le 27/06/2023 à 01:19, Xavi Ivars a écrit :

That's awesome!

Thanks Daniel!

I'm sure Hèctor (and everyone else working on oci-fra) will specially 
appreciate it 😊


Missatge de Daniel Swanson  del dia dl., 
26 de juny 2023 a les 23:01:


Greetings Apertiumers!

I recently identified a way that apertium-preprocess-transfer was
being rather inefficient and today I fixed it, so tomorrow you all
should be able to update to apertium 3.9.4 and see some improved
compile times for any pairs not using apertium-recursive, with
speedups between 10x and 7000x faster on the files I tested.

I'm writing this email to let you know that in the process
apertium-preprocess-transfer lost the ability to report partial
overlaps like the following:

Warning at line 6867, column 4: Paths to rule 27 blocked by rule 24.

And I just wanted to let you all know, in case someone was depending
on those. To compensate, I added a check to apertium-lint which can
report roughly the same information:

Warning (overlapping-paths) on line 6852: The sequence [preadv
vblex.pp n.*] matches both this rule and the rule on line 6628.

Daniel, who is trying to get better at not doing things that
potentially break people's workflows without telling them


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff



--
< Xavi Ivars >
< http://xavi.ivars.me >


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSoC 2023 Mentors & Ideas?

2023-01-31 Thread Aure Séguier

Hi,

Finally, I won't be able to be a mentor this year, because of health 
problems. I will try to think of another way to add these two occitan 
varieties to Apertium.


Regards,

Aura Séguier, responsabla de projèctes e desvolopaira
Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org <mailto:a.segu...@locongres.org>
www.locongres.org <http://www.locongres.org>
Le 27/01/2023 à 12:45, Aure Séguier a écrit :


Hi

I'm desperately trying to join Apertium IRC to talk about this but I 
can't, so I will use this mailing list instead.


We would like to propose subjects for GSoC related to occitan 
language. We have 2 ideas :
- Convert the occitan-french pair to recursive transfert. I think 
Hector Alos would agree to be the mentor on this one.
- Add two occitan varieties (provençal and limousin) to the 
occitan-french pair. I could mentor this one, and Lo Congrès would 
provide lexicons (for conjugations and translations) to pour in 
Apertium monodix and bidix.


I don't know how GSoC works. Can a language submit two subjects ? If 
not, could we apply this year for the recursive transfert and next 
year for the new varieties ?


How does it work to search for students with the required abilities ?

Thanks

Aura Séguier, responsabla de projèctes e desvolopaira
Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org <mailto:a.segu...@locongres.org>
www.locongres.org <http://www.locongres.org>
Le 25/01/2023 à 17:12, Xavi Ivars a écrit :

Me too, I'll mentor.

I think we need to do some cleanup on the ideas. TBH, I don't know 
where we are in some areas like website development, or Python API, 
and if the work that's remaining is worth a GSoC project


Missatge de Kevin Brubeck Unhammer  del dia dt., 
24 de gen. 2023 a les 14:37:


I'll mentor :)

--
Kevin Brubeck Unhammer

> GSoC 2023 org application is open, but do we have mentors for
this year?
> Please report in if you want to mentor.
>
> And as every year, please review
> https://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code -
> add/remove/amend ideas.
>
> -- Tino Didriksen
>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff



--
< Xavi Ivars >
< http://xavi.ivars.me >


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] GSoC 2023 Mentors & Ideas?

2023-01-27 Thread Aure Séguier

Hi

I'm desperately trying to join Apertium IRC to talk about this but I 
can't, so I will use this mailing list instead.


We would like to propose subjects for GSoC related to occitan language. 
We have 2 ideas :
- Convert the occitan-french pair to recursive transfert. I think Hector 
Alos would agree to be the mentor on this one.
- Add two occitan varieties (provençal and limousin) to the 
occitan-french pair. I could mentor this one, and Lo Congrès would 
provide lexicons (for conjugations and translations) to pour in Apertium 
monodix and bidix.


I don't know how GSoC works. Can a language submit two subjects ? If 
not, could we apply this year for the recursive transfert and next year 
for the new varieties ?


How does it work to search for students with the required abilities ?

Thanks

Aura Séguier, responsabla de projèctes e desvolopaira
Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org 
www.locongres.org 
Le 25/01/2023 à 17:12, Xavi Ivars a écrit :

Me too, I'll mentor.

I think we need to do some cleanup on the ideas. TBH, I don't know 
where we are in some areas like website development, or Python API, 
and if the work that's remaining is worth a GSoC project


Missatge de Kevin Brubeck Unhammer  del dia dt., 24 
de gen. 2023 a les 14:37:


I'll mentor :)

--
Kevin Brubeck Unhammer

> GSoC 2023 org application is open, but do we have mentors for
this year?
> Please report in if you want to mentor.
>
> And as every year, please review
> https://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code -
> add/remove/amend ideas.
>
> -- Tino Didriksen
>
> ___
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff



--
< Xavi Ivars >
< http://xavi.ivars.me >


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] New Occitan-French release

2022-11-04 Thread Aure Séguier

Hi,

I can help to make rules to know if there are enunciatives in a text or no.

About recognising which variety of occitan we are translating, we are 
currently developping a tool that can differentiate every dialect of 
occitan, but it isn't very efficient. Between Gascon and other dialects, 
it's OK, because Gascon is so different (except for Aranese, which is a 
subdialect of Gascon). But between Languedocien and other varieties 
(Provençal, Limousin...) there are many confusions.


At first, I was thinking about adding a "all dialects" dialect for the 
oc->fr direction. It would be useful when people don't know the dialect 
of a text, or for texts with many dialects (e.g. a website with articles 
in many dialects, like newspapers). Is that something that was already 
done for another language ? Is it something that could be easily done ?


Thanks

Aura Séguier, responsabla de projèctes e desvolopaira
Lo Congrès permanent de la lenga occitana
Ciutat - Creem !, 5-7 rue de la Fontaine, 64000 Pau
T. +33 (0)5 32 00 00 64
a.segu...@locongres.org 
www.locongres.org 
Le 04/11/2022 à 09:30, Kevin Brubeck Unhammer a écrit :

What if you do

lt-proc oci.automorf.bin | cg-proc enondetect.rlx.bin | cg-proc oci.rlx.bin | …

The first CG step would output a stream variable, so that what the next
step sees is

[]
^que/que/que$
[more text here]

If the next step is CG, it's just

  REMOVE:var-is-set (enon) IF (0 (VAR:non-enon)) ;

ie. remove enunciatives whenever the var is set.

One can also unset it in the middle of the stream (if doing corpus
runs), so output of the enon-detector is

[]
^que/que/que$
[more text here]
[]
^que/que/que$
[more text here]

and the REMOVE:var-is-set rule will remove enunciatives in the first
part, not after seeing the REMVARIABLE.


Then the problem of looking several windows ahead is restricted to that
first enon-detector step.




Alternatively, if we assume all the input is of the same language, we
just don't know what language it is ahead of time, then you could
do several passes, where one is a detector pipeline like

lt-proc oci.automorf.bin | cg-proc enondetect.rlx.bin

that outputs the STREAMCMD and then Apy would grep for that, and insert
the STREAMCMD at the start of the call to the regular pipeline

lt-proc oci.automorf.bin | cg-proc oci.rlx.bin | …

That won't automatically work in modes files, and won't work for corpus
tests if the corpus has a mix, but OTOH you could use 'export
AP_SETVAR=non-enon' to force the regular pipeline to insert the
STREAMCMD at the start.



___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Need help to decide language pairs for examples of markup handling

2020-09-09 Thread Aure Séguier

Hello

I don't know about its maturity in Apertium, but I know that basque 
language is a lot syntactically different from spanish and french (I 
think spa-eus pair is more mature). Maybe it could be a good example.


Regards,

Aure Séguier, desvolopaira
Lo Congrès permanent de la lenga occitana
Castèth d'Este/Château d'Este Av. de la Pléiade 64140 Billère/Vilhèra
T. +33 (0)5 59 13 06 40 - Fax : +33 (0)5 59 13 06 44
a.segu...@locongres.org <mailto:a.segu...@locongres.org>
www.locongres.org <http://www.locongres.org>


Le 09/09/2020 à 11:33, Tanmai Khanna a écrit :

Hey guys,
I'm writing a system demonstration to be submitted at LowResMT 2020 
about the recent project that was done as part of GSoC, titled 
"Translating the internet into low resource languages with Apertium" 
(Accepting snazzier title suggestions).


As part of this demonstration, I want to show some real world examples 
of how the new system of markup handling will help the translation of 
webpages and formatted documents - odt, pptx, rtx, etc. To show this 
effectively, I need to choose 3-4 released language pairs that are 
sufficiently syntactically divergent that they show the effect of 
markup reordering in the translation output. As far as I know, spa-cat 
is one of our most mature pairs, however I'm not sure how 
syntactically divergent it is. If it is, then I'm happy to be 
corrected. If your language pair has had issues with webpage 
translation and those issues are now solved (ish), then some examples 
would be really helpful.


TLDR: I need suggestions of language pairs which are mature, low 
resource (at least the target language), and which are syntactically 
divergent enough to see the benefits of markup handling in the 
translation. If you can provide examples, that'll be great as well. 
Any help will be sincerely appreciated :))


Thanks and Regards,
*तन्मय खन्ना *
*Tanmai Khanna*


___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] We need more evaluators

2020-08-31 Thread Aure Séguier

Hi,

I can do for occitan-french.

Regards,

Aure Séguier, desvolopaira
Lo Congrès permanent de la lenga occitana
Castèth d'Este/Château d'Este Av. de la Pléiade 64140 Billère/Vilhèra
T. +33 (0)5 59 13 06 40 - Fax : +33 (0)5 59 13 06 44
a.segu...@locongres.org <mailto:a.segu...@locongres.org>
www.locongres.org <http://www.locongres.org>


Le 22/08/2020 à 07:03, Hèctor Alòs i Font a écrit :
According to my experience, no, it is impossible. The quality of the 
test would be awful.


Missatge de Samuel Sloniker <mailto:scoopgra...@gmail.com>> del dia ds., 22 d’ag. 2020 a les 7:29:


Could this be done using a dictionary without knowing one of the
languages?

On Fri, Aug 21, 2020, 02:24 Mikel L. Forcada mailto:m...@dlsi.ua.es>> wrote:

Dear Daniel:

Thanks a million for your message.

please contact Shashwat Goel

<mailto:shashwat.g...@research.iiit.ac.in> so that he can send
you the evaluation files.

As to contributing to the eng-fra language pair, you would have to

(a) learn how to install Apertium as needed for language pair
development

(https://wiki.apertium.org/wiki/Installation#For_those_who_want_to_install_Apertium_locally.2C_and_developers).

(b) learn how dictionary entries look like (a lot can be learn
by imitating existing entries), make them, and try them locally.

(c) get access to GitHub (and learn to use git, etc.)

(d) submit pull requests with your changes

You can get a lot of guidance from fellow developers at our
IRC channel #apertium at irc.freenode.net
<http://irc.freenode.net>: https://wiki.apertium.org/wiki/IRC

Hope this helped!

Mikel

El 20/8/20 a les 19:09, Daniel Lamontagne ha escrit:

Hi,
I know very well French and English-but I am a French
native-so French-to-English would be great.
I'd like to help get the French-English going a little better
by getting it out of the 'incubator' area.
Something not too complicated and good instructions on how to
help.
Thanks.
P.S. I already am doing one for myself-database in csv file
and Python

Keep on the good work.
Daniel Lamontagne-KesKiDit


On Thu, Aug 20, 2020 at 7:54 AM Xavi Ivars
mailto:xavi.iv...@gmail.com>> wrote:

Hey Mikel,

You can add me to en-ca. I don't know too much french,
but I guess I could also help there

Missatge de Mikel L. Forcada mailto:m...@dlsi.ua.es>> del dia dj., 20 d’ag. 2020 a les
9:39:

Dear Apertiumers:

About 10 days ago I wrote a message asking for
apertiumers to volunteer in evaluating sets of 150
bilingual dictionary entries. We still need more
evaluators.

We need two more evaluators for Esperanto–English,
two more for French–Catalan and one for Occitan–French.

Please contact our GSoC student Shashwat Goel

<mailto:shashwat.g...@research.iiit.ac.in> if you can
help.

Thanks a million!

All the best

Mikel Forcada


[New] En - Es - Assigned to Hector, Mikel, Jorge.
More people not required
[New] En - Ca - Assigned to Hector, Mikel. One more
person would be helpful.
Eo - En - Done by Hector. Two more people would be
helpful.
Fr - Ca - Done by Hector. Two more people would be
helpful.
Oc - Fr - Done by Serge. Assigned to Gisele. One more
person would be helpful.

-- 
Mikel L. Forcadahttp://www.dlsi.ua.es/~mlf/

Departament de Llenguatges i Sistemes Informàtics
Universitat d'Alacant
E-03690 Sant Vicent del Raspeig
Spain
Office: +34 96 590 9776

___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
<mailto:Apertium-stuff@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/apertium-stuff



-- 
< Xavi Ivars >

< http://xavi.ivars.me >
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
<mailto:Apertium-stuff@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/apertium-stuff



___
Ape