Am Fri, Jul 30, 2021 at 08:03:41AM -0500 schrieb Daniel Swanson: > > > > Another question is that in lot of expected files there seems to be > > all-capsed words for fin-* pairs, I am not sure how this has happened? > > I am guessing my apertium is older and some ICU changes have affected > > the output, perhaps in some code I've copypasted ununderstandingly to > > all fin-* pairs. > > > > For that example in particular (and likely others as well), the t1x > output code says > > <chunk name="NP" case="caseFirstWord"> > > which gives the chunk an all-caps lemma (caseFirstWord is a > non-existent variable, so it has no effect) and then the default > behavior of postchunk is to copy the chunk case to the words, so > > ^NP<N><FOOFOO>{^koti<n><sg><nom>$}$ > > becomes > > ^KOTI<n><sg><nom>$^.<punct>$ > > so I think this might be a case of inadvertently relying on a bug in > the old transfer case-handling functions and I'm not quite sure what > the appropriate solution is.
Mmh so the @case here is overall copypasted from the original template: <https://github.com/apertium/apertium-init/blob/master/bilingual-module/apertium-%7B%7BlanguageCode1%7D%7D-%7B%7BlanguageCode2%7D%7D.%7B%7BlanguageCode1%7D%7D-%7B%7BlanguageCode2%7D%7D.t1x#L38> I have never touched or looked it up from the manual, maybe others have not either, and yeah I didn't even know chunk names had an effect to the translation before now. -- Regards, Flammie <https://flammie.github.io> (Please note, that I will often include my replies inline instead of top or bottom of the mail)
signature.asc
Description: PGP signature
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff