Tanmai Khanna <khanna.tan...@gmail.com> čálii: > the analyser sees wordbound blanks as normal blanks,
So currently if I have the multiword "i dag", it'll recognize "i<i>dag</i>" but it won't recognize "i <i>dag</i>"? (And I suppose if I have the non-multiword "today" it won't recognize "to<i>day</i>".) One possibility might be to have wordbound blanks match "space or epsilon" in lt-proc – then it would recognize all of the above. It seems OK that the wblank then applies to the whole LU, so what comes out from translating "i<i>dag</i>" into English is "<i>today</i>". Though maybe it would be hard to implement – does this happen a lot?
signature.asc
Description: PGP signature
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff