Re: Duplicate entries in compounds.txt in ru, nl

2014-10-07 Thread Yakov Reztsov
Hi, Mon, 6 Oct 2014 21:47:40 +0200 от Dominique Pellé: Hi I've noticed that the Russian and Dutch compounds.txt files contain duplicate entries. Either the dupes should be removed, or maybe some of the dupe were meant to be the plural form or some other flexions. Can the language maintainers

Re: Duplicate entries in compounds.txt in ru, nl

2014-10-07 Thread Dominique Pellé
Yakov Reztsov yakovr...@mail.ru wrote: Hi, Mon, 6 Oct 2014 21:47:40 +0200 от Dominique Pellé: Hi I've noticed that the Russian and Dutch compounds.txt files contain duplicate entries. Either the dupes should be removed, or maybe some of the dupe were meant to be the plural form or

RE: LT HTTP server could not be started on host localhost port 8081, but netstat shows that port 8081 is not used before LT starts

2014-10-07 Thread Mike Unwalla
Hi Daniel, The problem is corrected. As best I remember, many months ago, when I did the LT server test, LT server worked on the machine. Some observations: 1) Yesterday, I disabled the anti-virus/firewall software and LT server worked fine for a short time. Then the problem started again. 2)

Re: LanguageTool nightly diff test

2014-10-07 Thread Daniel Naber
On 2014-10-06 23:05, dna...@users.sourceforge.net wrote: 1. http://languagetool.org/regression-tests/20141006/result_en_20141006.html FYI, the lower part of the English diff was caused by this Exception, which I unfortunately cannot reproduce: Exception in thread main

workflow optimization

2014-10-07 Thread Daniel Naber
Hi, a question to the developers: are you happy with your workflow when making changes to LT? Is anything too slow? Here's what I mean by workflow: * I typically make a change to the source or the rules * I run ./build.sh de test, which takes 30 seconds on my machine * when everything is fine,

looking for more semantic rules

2014-10-07 Thread Daniel Naber
Hi, our new rule that checks a date and its day of the week is very useful, I used it to find more than 200 errors in the German-language Wikipedia (I think I mentioned that): https://de.wikipedia.org/wiki/Benutzer:Dnaber/WikiCon2014#Weitere_Beispiel-Fehler If you have more ideas for such

Re: looking for more semantic rules

2014-10-07 Thread R.J. Baars
About more semantic rule, what about time consistency? About the date check, I have been looking at the code, wanting to make a Dutch version, but there is no locale that fits Netherlands and Belgium; in fact there is none. Is there a way to work around that? Ruud Hi, our new rule that

Re: looking for more semantic rules

2014-10-07 Thread Dominique Pellé
Hi Ruud You can have a look at the Java files DateCheckFilter.java for Catalan, Breton or Esperanto, for which there is also no Java locale. Dominique PM, R.J. Baars r.j.ba...@xs4all.nl wrote: About more semantic rule, what about time consistency? About the date check, I have been looking

wikicheck Dutch

2014-10-07 Thread R.J. Baars
It is better to disable the rule: OT_EINDE_ZIN_ONVERWACHT [1] And there is still quite a bit of output showing wiki markup. Would it be an idea to: - make a javascript component for the wiki page, interpreting the page layout (which is html then) and checking the texts from that point? Ruud

Re: looking for more semantic rules

2014-10-07 Thread Daniel Naber
On 2014-10-07 13:51, R.J. Baars wrote: It was not too difficult to translate that. I attached my proposal. I am not able however to add it to the Dutch codebase and test it, so if someone has some time to spare ... I have added it. You will still need to add a rule in grammar.xml like the

Re: wikicheck Dutch

2014-10-07 Thread Daniel Naber
On 2014-10-07 14:05, R.J. Baars wrote: Would it be an idea to: - make a javascript component for the wiki page, interpreting the page layout (which is html then) and checking the texts from that point? That's similar to what I posted the other day: the proper solution is to use Parsoid,

Re: looking for more semantic rules

2014-10-07 Thread R.J. Baars
I know. I will check the existing rules for EN and DE. Probably re-usable. Will get the updated build tomorrow. Ruud On 2014-10-07 13:51, R.J. Baars wrote: It was not too difficult to translate that. I attached my proposal. I am not able however to add it to the Dutch codebase and test it,

date checks

2014-10-07 Thread R.J. Baars
Daniel, one of the date checks was commented out. I think it could still be of use, if the ambiguous items were removed, e.g. using antipattern. Somewhat like this? (Was not able to test it yet..) Ruud !-- Monday 31/12/2014 - this can be ambiguous (e.g. 6/7/2014) and is thus

Re: workflow optimization

2014-10-07 Thread Andriy Rysin
That's very similar to my workflow, except before I commit I run my rules on huge (5-10 years) archive of 4 Ukrainian magazines, which shows if my rules really work and don't have false positives and which takes about 40 min. So even if I would not mind for build.sh to be a bit faster it probably

special date stuff in english grammar file

2014-10-07 Thread R.J. Baars
Will the entries below be necessary to have the datechecker working? Ruud !--suppress CheckTagEmptyBody -- !DOCTYPE rules [ !ENTITY weekdays maandag|dinsdag|woensdag|donderdag|vrijdag|zaterdag|zondag !ENTITY abbrevWeekdays ma|di|wo|do|za|zo !ENTITY months

Re: special date stuff in english grammar file

2014-10-07 Thread Daniel Naber
On 2014-10-07 18:39, R.J. Baars wrote: Will the entries below be necessary to have the datechecker working? Ruud !--suppress CheckTagEmptyBody -- !DOCTYPE rules [ !ENTITY weekdays maandag|dinsdag|woensdag|donderdag|vrijdag|zaterdag|zondag Only if you refer to the definitions with

Re: special date stuff in english grammar file

2014-10-07 Thread R.J. Baars
So this is a kind of 'macro'.. Good to know that exists. Might come in handy for some type of exceptions.. Ruud On 2014-10-07 18:39, R.J. Baars wrote: Will the entries below be necessary to have the datechecker working? Ruud !--suppress CheckTagEmptyBody -- !DOCTYPE rules [ !ENTITY