Re: [Apertium-stuff] Apertium-eng generation errors

2018-01-09 Thread Marc Riera Irigoyen
Thanks Fran, that makes a lot of sense. I will update some entries that do
not follow that style and should be using it.

However, what should I do with proper nouns such as "MacDonald/Macdonald"?
Both forms are used and depend on personal preferences. Adding both as
separate entries in apertium-eng and the bidix seems to work, but they pop
up during testvoc because there is no way to tell one from the other when
used in caps "MACDONALD", and Apertium does the Arctic/Arctic thing.
Keeping only "Macdonald" in apertium-eng and both entries in the bidix
fixes the all-caps translations, but "MacDonald" becomes "Macdonald" right
at the end of the pipeline, the capital D is lost during generation in
apertium-eng.

Marc

2018-01-09 9:14 GMT+01:00 Francis Tyers :

> El 2018-01-08 11:31, Marc Riera Irigoyen escribió:
>
>> Hello all,
>>
>> I've noticed some weird behaviour when generating English output in
>> the English-Catalan pair. When translating "àrtic" to English, for
>> example, the adjective "Arctic" is found, but right at the end of the
>> pipeline, in the English generator, "Arctic" becomes
>> Arctic/Arctic and is sent as is in the output text.
>>
>> The entry in the English monodix is "artic" without caps, but this was
>> the case in the old en-ca pair and it worked. Moreover, I've altered
>> the bidix to simulate the same situation when generating to Catalan
>> and it works as expected, so it looks like it's specific to English.
>>
>> Anyone who knows what's going on here?
>>
>>
>   rctic n="expensive__adj"/>
>   arctic
>
> two entries. My proposal is to keep the first one and delete the second
> one.
>
> For adjectives/nouns that must always be written in uppercase then the
> first style
> of entry should be used. This avoids generation issues when the case
> changing in
> transfer means that some open class comes out in the wrong orthographic
> case.
>
> Fran
>



-- 

*Marc Riera Irigoyen*
Freelance Translator EN/JA>CA/ES

(+34) 652 492 008
--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


Re: [Apertium-stuff] Apertium-eng generation errors

2018-01-09 Thread Francis Tyers

El 2018-01-08 11:31, Marc Riera Irigoyen escribió:

Hello all,

I've noticed some weird behaviour when generating English output in
the English-Catalan pair. When translating "àrtic" to English, for
example, the adjective "Arctic" is found, but right at the end of the
pipeline, in the English generator, "Arctic" becomes
Arctic/Arctic and is sent as is in the output text.

The entry in the English monodix is "artic" without caps, but this was
the case in the old en-ca pair and it worked. Moreover, I've altered
the bidix to simulate the same situation when generating to Catalan
and it works as expected, so it looks like it's specific to English.

Anyone who knows what's going on here?



  rcticn="expensive__adj"/>

  arctic

two entries. My proposal is to keep the first one and delete the second 
one.


For adjectives/nouns that must always be written in uppercase then the 
first style
of entry should be used. This avoids generation issues when the case 
changing in
transfer means that some open class comes out in the wrong orthographic 
case.


Fran

--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
___
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff