Re: [Languagetool] Portuguese support

2012-06-11 Thread Jan Schreiber
Excellent. Welcome on board! Am 10.06.2012 18:44, schrieb Juan Martorell: > Hi All, > > We've added support for Portuguese language > in LanguageTool. With > a total of 236 million speakers, Portuguese is the 6th most spoken > language in the wor

[Languagetool] mailing list downtime

2012-06-11 Thread Daniel Naber
As you may have noticed, the mailing lists where down for 36 hours or so. It was a problem at sourceforge and they fixed it after I sent a bug report. Hopefully this won't happen again, but just in case I'd like to remind you that we have both a twitter account (http://twitter.com/languagetoolo

Re: [Languagetool] color for spelling errors

2012-06-11 Thread Daniel Naber
On Sonntag, 10. Juni 2012, Dominique Pellé wrote: > > I managed to confuse your page by having several typos and checking > > again and again - code from the response showed up in the Text area. > > I think that I just fixed that. Can you try again? > The script was not handling well rules for w

Re: [Languagetool] Hunspell tests for affix files

2012-06-11 Thread Jaume Ortolà i Font
Marcin, I talked to the developers of the Catalan Hunspell dictionary. The WORDCHARS line will be added to the .aff file. LibreOffice ignores this line. By default it takes middle dot and apostrophe characters as word characters but not hyphen character. So this is a limitation of LibreOffice. R

Re: [Languagetool] color for spelling errors

2012-06-11 Thread Daniel Naber
On Montag, 11. Juni 2012, Dominique Pellé wrote: > That won't be easy to use when > using LT from the command line. Line and column > number make more sense to users. I know, but we should probably have those robust offsets for the use cases where they can be used (like your web interface), ad

Re: [Languagetool] color for spelling errors

2012-06-11 Thread Daniel Naber
On Sonntag, 10. Juni 2012, Dominique Pellé wrote: > I think that the following patch fixes it. > But the code is a bit hairy so I'm not 100% sure > it's OK. I have not checked-in: I don't have time to look at it now, but if you can write a test case that only works with your change (and if no o

Re: [Languagetool] Idea to facilitate debugging disambiguation rules

2012-06-11 Thread Marcin Miłkowski
Yeah, not writing a JUnit test makes everything bad. I will test it - it seems fine when I debug the disambiguator, so it must break somewhere else. Regards, Marcin W dniu 2012-06-11 20:43, Dominique Pellé pisze: > Thanks Marcin, that will be very useful for debugging > disambiguation rules. >

Re: [Languagetool] Idea to facilitate debugging disambiguation rules

2012-06-11 Thread Dominique Pellé
Thanks Marcin, that will be very useful for debugging disambiguation rules. There is something which I do not understand though. Take this example with the French sentence "Les avions" (= The planes). Both words have 2 POS tags in the French dictionary: $ egrep "^(les|avions)\s" lexique-dicollect

Re: [Languagetool] color for spelling errors

2012-06-11 Thread Dominique Pellé
Daniel Naber wrote: > On Sonntag, 10. Juni 2012, Dominique Pellé wrote: > >> fromy="1" fromx="5" toy="2" tox="10" >> >> The value tox="10" is wrong. It should be 2. > > Of course we should fix this, but shouldn't we > also just specify the position as from/to positions, > ignoring lines/columns?

Re: [Languagetool] Idea to facilitate debugging disambiguation rules

2012-06-11 Thread Marcin Miłkowski
Now it also supports the multiword chunker. Regards - Marcin W dniu 2012-06-11 19:11, Marcin Miłkowski pisze: > OK, I implemented this today. Note: it works only for the rule-based > disambiguator, any other disambiguators need to add the annotation on > their own. > > Regards, > Marcin > > W dni

Re: [Languagetool] Idea to facilitate debugging disambiguation rules

2012-06-11 Thread Marcin Miłkowski
OK, I implemented this today. Note: it works only for the rule-based disambiguator, any other disambiguators need to add the annotation on their own. Regards, Marcin W dniu 2012-05-28 14:08, Marcin Miłkowski pisze: > That requires some additions to the AnalyzedToken and multiple other > places.

Re: [Languagetool] addition to the Hunspell rule

2012-06-11 Thread Marcin Miłkowski
Actually, this is a limitation of LibreOffice: in principle, we should be able both to say that there is no error in the sentence (effectively, discarding hunspell results from LO), and to return spelling errors to be marked in red. In principle, because there is only definition of these featur

[Languagetool] Portuguese support

2012-06-11 Thread Juan Martorell
Hi All, We've added support for Portuguese language in LanguageTool. With a total of 236 million speakers, Portuguese is the 6th most spoken language in the world, the 3rd most spoken language in the western hemisphere, and the most spoken language

Re: [Languagetool] color for spelling errors

2012-06-11 Thread Dominique Pellé
Daniel Naber wrote: > On Samstag, 9. Juni 2012, Dominique Pellé wrote: > > Hi Dominique, > >> It should highlight spelling mistakes in yellow and >> grammar mistakes in red.  You can hover the mouse >> on the mistakes to see more information about the error. >> >> Let me know if this can be usefu

Re: [Languagetool] addition to the Hunspell rule

2012-06-11 Thread Jaume Ortolà i Font
OK. I know. This will make sense in Libre/OpenOffice only when we use LanguageTool alone and the spelling dictionary is disabled. Jaume 2012/6/10 Jaume Ortolà i Font > There is the possibility that some words that are included in the tagger > dictionary (or are tagged in the disambiguation

[Languagetool] addition to the Hunspell rule

2012-06-11 Thread Jaume Ortolà i Font
There is the possibility that some words that are included in the tagger dictionary (or are tagged in the disambiguation file) are marked as errors by Hunspell, because they are missing in the Hunspell dictionary. In order to avoid it we could add a condition in the Hunspell Java rule: mark as an e

Re: [Languagetool] color for spelling errors

2012-06-11 Thread Daniel Naber
On Sonntag, 10. Juni 2012, Dominique Pellé wrote: > fromy="1" fromx="5" toy="2" tox="10" > > The value tox="10" is wrong. It should be 2. Of course we should fix this, but shouldn't we also just specify the position as from/to positions, ignoring lines/columns? That's more robust. Regards Dan