Re: Testing against Tatoeba

2013-11-03 Thread Dominique Pellé
Daniel Naber wrote: > On 2013-11-03 08:29, Dominique Pellé wrote: > >> I see that the character encoding is wrong again >> in the Breton page at: >> >> http://community.languagetool.org/corpusMatch/list?lang=br > > It looks okay to me. The only thing that looks broken ("é") is actually > in the

Re: Testing against Tatoeba

2013-11-03 Thread Daniel Naber
On 2013-11-03 08:29, Dominique Pellé wrote: > I see that the character encoding is wrong again > in the Breton page at: > > http://community.languagetool.org/corpusMatch/list?lang=br It looks okay to me. The only thing that looks broken ("é") is actually in the page like that. Regards Danie

Re: Testing against Tatoeba

2013-11-03 Thread Dominique Pellé
Daniel Naber wrote: > On 2013-10-27 18:30, Dominique Pellé wrote: > > > I think I remember that you set up a cron job for French > > and Breton. Perhaps the locale is not set properly when > > starting from cron, and the process to check Wikipedia > > and Tatoeba depends in the locale (ideally it

Re: Testing against Tatoeba

2013-10-29 Thread Daniel Naber
On 2013-10-29 10:25, Marcin Miłkowski wrote: > As far as I can see, spelling rules are activated, and they made the > diff quite large... They were turned off previously. Indeed. I have just fixed that. Regards Daniel -- http://www.danielnaber.de ---

Re: Testing against Tatoeba

2013-10-29 Thread Marcin Miłkowski
W dniu 2013-10-28 20:56, Daniel Naber pisze: > On 2013-10-23 22:10, Dominique Pellé wrote: > >> Thanks for that! Will sentences from Tatoeba be used in the >> daily diff? (http://www.languagetool.org/regression-tests/?C=M;O=D) > > I have just activated that, if it works we'll get a large diff ton

Re: Testing against Tatoeba

2013-10-28 Thread Daniel Naber
On 2013-10-23 22:10, Dominique Pellé wrote: > Thanks for that! Will sentences from Tatoeba be used in the > daily diff? (http://www.languagetool.org/regression-tests/?C=M;O=D) I have just activated that, if it works we'll get a large diff tonight that includes Tatoeba. Regards Daniel -- h

Re: Testing against Tatoeba

2013-10-28 Thread Daniel Naber
On 2013-10-26 12:32, Dominique Pellé wrote: > The title says "LanguageTool Rule Matches in Wikipedia (81)". > It shoulld be "LanguageTool Rule Matches in Tatoeba (81)" This is fixed now, it requires Transifex to catch up and then maintainers to update the translation though. > Also if we filter

Re: Testing against Tatoeba

2013-10-27 Thread Daniel Naber
On 2013-10-27 18:30, Dominique Pellé wrote: > I think I remember that you set up a cron job for French > and Breton. Perhaps the locale is not set properly when > starting from cron, and the process to check Wikipedia > and Tatoeba depends in the locale (ideally it should not). I've changed the s

Re: Testing against Tatoeba

2013-10-27 Thread Dominique Pellé
Daniel Naber wrote: > On 2013-10-26 10:40, Dominique Pellé wrote: > >> Something else is buggy. The character encoding is >> messed up for sentences coming from Tatoeba. > > Not sure how that happened (it affected only French and Breton), but > it's fixed now after running the checks again. It'

Re: Testing against Tatoeba

2013-10-26 Thread Dominique Pellé
Daniel Naber wrote: > On 2013-10-26 10:40, Dominique Pellé wrote: > >> Something else is buggy. The character encoding is >> messed up for sentences coming from Tatoeba. > > Not sure how that happened (it affected only French and Breton), but > it's fixed now after running the checks again. > > R

Re: Testing against Tatoeba

2013-10-26 Thread Daniel Naber
On 2013-10-26 10:40, Dominique Pellé wrote: > Something else is buggy. The character encoding is > messed up for sentences coming from Tatoeba. Not sure how that happened (it affected only French and Breton), but it's fixed now after running the checks again. Regards Daniel -- http://www.d

Re: Testing against Tatoeba

2013-10-26 Thread Dominique Pellé
Daniel Naber wrote: > On 2013-10-23 22:49, Dominique Pellé wrote: > >> But something looks wrong. Going to... >> http://community.languagetool.org/corpusMatch/list?lang=fr >> ... and clicking on the drop down menu to select which rule to >> filter, I only see 7 rules. Surely I should see more th

Re: Testing against Tatoeba

2013-10-23 Thread Daniel Naber
On 2013-10-23 22:49, Dominique Pellé wrote: > But something looks wrong. Going to... > http://community.languagetool.org/corpusMatch/list?lang=fr > ... and clicking on the drop down menu to select which rule to > filter, I only see 7 rules. Surely I should see more than that > given that a large

Re: Testing against Tatoeba

2013-10-23 Thread Dominique Pellé
On Wed, Oct 23, 2013 at 10:40 PM, Daniel Naber wrote: > On 2013-10-23 13:48, Daniel Naber wrote: > >> editor to protect against false alarms. I will later activate it for >> our >> Wikipedia check (which then isn't a pure Wikipedia check anymore...). > > Tatoeba matches are now also part of our Wi

Re: Testing against Tatoeba

2013-10-23 Thread Daniel Naber
On 2013-10-23 13:48, Daniel Naber wrote: > editor to protect against false alarms. I will later activate it for > our > Wikipedia check (which then isn't a pure Wikipedia check anymore...). Tatoeba matches are now also part of our Wikipedia matches: http://community.languagetool.org/corpusMatch/

Re: Testing against Tatoeba

2013-10-23 Thread Daniel Naber
On 2013-10-23 22:10, Dominique Pellé wrote: > Thanks for that! Will sentences from Tatoeba be used in the > daily diff? (http://www.languagetool.org/regression-tests/?C=M;O=D) Not yet, but it's on my TODO list. In unrelated news: I have changed the link to the bug tracker on our homepage to p

Re: Testing against Tatoeba

2013-10-23 Thread Dominique Pellé
Daniel Naber wrote: > Hi, > > http://tatoeba.org is a free (CC-BY) collection of sentences in a lot of > languages. It complements Wikipedia as it contains a different style of > sentences, for example ones with the personal pronouns I, you, and we, > which rarely occur in the Wikipedia. I have mod

Re: Testing against Tatoeba

2013-10-23 Thread Marco A.G.Pinto
Hello! Moments ago I have translated the new sentences at Transifex and I noticed the difference between "documents" and "sentences" there. Kind regards,    >Marco A.G.Pinto --- On 23/10/2013

Testing against Tatoeba

2013-10-23 Thread Daniel Naber
Hi, http://tatoeba.org is a free (CC-BY) collection of sentences in a lot of languages. It complements Wikipedia as it contains a different style of sentences, for example ones with the personal pronouns I, you, and we, which rarely occur in the Wikipedia. I have modified the data on our serve