mansur <6688...@gmail.com> čálii:

> Turned out disappears the last token in the meaning of Apertium, no matter
> it is a word or punctuation, just last part like ^./.<sent>$ or
> ^word/lemma<pos><tag1><tag2>$

Hm, yeah it seems the NUL needs to go after the `]' on each linebreak
(that's how apy does it). Something like a sed 's/^]/]\x00/' after
deformatting might work better.

I'm not sure how to avoid the final three NUL's at end-of-file, though
they're easy enough to postprocess out.

I'd still like to see a minimal test case where the regular pipeline
merges lines though, lt-proc and cg-proc really shouldn't do that
(unless you do things like REMCOHORT in CG).

Attachment: signature.asc
Description: PGP signature

_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to