Tanmai Khanna <khanna.tan...@gmail.com>
čálii:

> the analyser sees wordbound blanks as normal blanks,

So currently if I have the multiword "i dag", it'll recognize
"i<i>dag</i>" but it won't recognize "i <i>dag</i>"? (And I suppose if
I have the non-multiword "today" it won't recognize "to<i>day</i>".)

One possibility might be to have wordbound blanks match "space or
epsilon" in lt-proc – then it would recognize all of the above.

It seems OK that the wblank then applies to the whole LU, so what comes
out from translating "i<i>dag</i>" into English is "<i>today</i>".

Though maybe it would be hard to implement – does this happen a lot?

Attachment: signature.asc
Description: PGP signature

_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to