Re: suggestions in Morfologik spelling rule

2013-07-16 Thread Marcin Miłkowski
W dniu 2013-07-16 00:03, Jaume Ortolà i Font pisze: 2013/7/15 Marcin Miłkowski list-addr...@wp.pl: Hi Jaume, W dniu 2013-07-15 21:16, Jaume Ortolà i Font pisze: Hi, Marcin. I have tested the current code (1.8.0-SNAPSHOT) and everything is OK, all the changes are there. Thank you. Great.

Re: suggestions in Morfologik spelling rule

2013-07-16 Thread R.J. Baars
Coding word frequencies as a character is fine. I think it would be classes, logarithmic as far as I am concerned. Ruud W dniu 2013-07-16 00:03, Jaume Ortolà i Font pisze: 2013/7/15 Marcin Miłkowski list-addr...@wp.pl: Hi Jaume, W dniu 2013-07-15 21:16, Jaume Ortolà i Font pisze: Hi,

Re: suggestions in Morfologik spelling rule

2013-07-16 Thread Ruud Baars
By the way, I could help with words frequencies for some langauges. e.g. Portuguese, Spanish, Dutch. Ruud On 16-07-13 14:20, R.J. Baars wrote: Coding word frequencies as a character is fine. I think it would be classes, logarithmic as far as I am concerned. Ruud W dniu 2013-07-16 00:03,

Sentence tokenizer and non-breaking spaces

2013-07-16 Thread Daniel Naber
Hi, a user on the forum reports[1] that sentence boundaries are not detected when the space after the dot is a non-breaking space. I wonder if this is desired behavior. Does anybody have an opinion on that? Regards Daniel