Thanks for the answers. Missatge de Jonathan Washington <jonathan.n.washing...@gmail.com> del dia dt., 17 de set. 2019 a les 22:11:
> Jaume, are you planning on using this for translation or something else? > If for translation, how do you anticipate it improving translation quality? > These prefixes will be used for translating spa-cat, and they could be used also for other Romanic languages pairs. Hèctor Alòs is interested in it. I have tried the first option proposed by Kevin with just adjectives and some prefixes in Spanish: <pardef n="adj_prefixes"> <e><i>anti</i></e> <e><i>pro</i></e> <e><i>post</i></e> <e r="LR"><p><l>pos</l><r>post</r></p></e> <e><i>pre</i></e> <e><i/></e> </pardef> <pardef n="adj_prefixes_r"> <e><p><l>antir</l><r>anti</r></p></e> <e><p><l>pror</l><r>pro</r></p></e> <e><p><l>post</l><r>post</r></p></e> <e><p><l>prer</l><r>pre</r></p></e> <e r="LR"><p><l>anti</l><r>anti</r></p></e> <e r="LR"><p><l>pro</l><r>pro</r></p></e> <e r="LR"><p><l>pos</l><r>post</r></p></e> <e r="LR"><p><l>pre</l><r>pre</r></p></e> <e><i/></e> </pardef> In the Europarl corpus it finds around one new word (untranslated so far) every 5000 sentences. A few more prefixes can be added, and the same would be done with nouns and verbs. We'll need to create metadix files so that the dictionaries don't become cluttered with the new tags. The metadix will be useful also for other things. Some new words formed with prefixes can match existing words. All these should be discarded beforehand. prefiero (verb) = pre + fiero (adj) presumo (verb) = pre + sumo (adj) prerrogativa (noun) = pre + (r)rogativa (adj) I have tried adding a mark to the newly formed words and removing it with CG if necessary. It works fine. <e><p><l>pre</l><r>-prefix-pre</r></p></e> REMOVE:prefixes ("-prefix-.*"r) IF (0 ("-prefix-.*"r)); I think adding this feature is productive and worthwhile. What do you think (Hèctor, Marc, Xavi...)? Any suggestion to improve it? Jaume
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff