mansur <6688...@gmail.com> čálii:

> Can we untag the file using apertium
> to get the text close to the original?

Just strip the analysis?

The quick and hacky way would be something like this (untested):

    sed 's%/[^$]*[$]%%g' | tr -d '^'

You'll remove a bit too much if there were slashes and such in input,
but maybe it doesn't matter too much if you're just checking things?
If you need it to be correct, you can do it in python with
apertium-streamparser – it'll be slower, but you should be able to get
back to the exact input you had.

Attachment: signature.asc
Description: PGP signature

_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to