Re: [Languagetool] Release LanguageTool 1.8 in progress

2012-07-01 Thread Marcin Miłkowski
W dniu 2012-06-30 22:01, Daniel Naber pisze: On Samstag, 30. Juni 2012, Marcin Miłkowski wrote: it works for me under linux when I move jar href=hunspell-linux-i386.jar/ Works for me, too - please everybody test whether the Start LanguageTool link on http://www.languagetool.org works for

Re: [Languagetool] Breton speller (and general tokenization issue)

2012-07-01 Thread Ruud Baars
Marcin, For Dutch, tokenisation on - would be really wrong, since it is a real word character, sometime required, sometimes optional. Dutch has the phenomenon of 'klinkerbotsing' (sonant collision) when two single sonants get glued together, and can be mistaken for one of the two-charcter

Re: [Languagetool] Breton speller (and general tokenization issue)

2012-07-01 Thread Marcin Miłkowski
W dniu 2012-07-01 19:46, Ruud Baars pisze: Marcin, For Dutch, tokenisation on - would be really wrong, since it is a real word character, sometime required, sometimes optional. We have it, Ruud. This is a different matter, data is prepared for Breton in a different way than we expect it to

Re: [Languagetool] Release LanguageTool 1.8 in progress

2012-07-01 Thread Yakov Reztsov
I am found encoding problem  of filter text on page http://community.languagetool.org/corpusMatch/list?lang=ru Filter text  (all non-ascii symbol) are displayed as ? symbol. Same problem exist for this page too:  http://community.languagetool.org/corpusMatch/list?lang=uk  -- Yakov Reztsov

Re: [Languagetool] Release LanguageTool 1.8 in progress

2012-07-01 Thread Daniel Naber
On Sonntag, 1. Juli 2012, Yakov Reztsov wrote: I am found encoding problem of filter text on page http://community.languagetool.org/corpusMatch/list?lang=ru Filter text (all non-ascii symbol) are displayed as ? symbol. Thanks, it's fixed (similar for other non-Latin1 languages). Regards

Re: [Languagetool] Breton speller (and general tokenization issue)

2012-07-01 Thread Dominique Pellé
Marcin Miłkowski list-addr...@wp.pl wrote: Hi Dominique, and all, I have started conversion of the Breton speller to the MorfologikSpeller format, and thanks to the new support of UTF-8, it was successful. But there is one outstanding issue: namely, the speller does not recognize any words

Re: [Languagetool] Breton speller (and general tokenization issue)

2012-07-01 Thread Marcin Miłkowski
W dniu 2012-07-01 22:53, Dominique Pellé pisze: Marcin Miłkowski list-addr...@wp.pl wrote: Hi Dominique, and all, I have started conversion of the Breton speller to the MorfologikSpeller format, and thanks to the new support of UTF-8, it was successful. But there is one outstanding issue: