Re: tokenizing numbers

2014-09-27 Thread Andriy Rysin
Having a number split into multiple tokens will definitely make rules around those numbers quite complicated. On Sep 24, 2014 3:04 PM, "R.J. Baars" wrote: > Maybe we agree to disagree.. > > Having them as one token makes detecting patterns easy using regular > expressions.. > > Ruud > > > > For P

Re: Large amount of rules ...

2014-09-27 Thread R.J. Baars
Jaume, I think some of the rules will eventually move to simplereplacerule, because it is one word, and not accepted by the spell checker. Adding it to the ignore list will then remove that error, and give the simplereplacerule alternative (there is no useful alternative from the spell checker sin

Re: Large amount of rules ...

2014-09-27 Thread R.J. Baars
The replace rule has a text that says 'wrong'. While using loan words is more a matter of style, depending on the readers being targetted. A simple list alternative would be nice, But I don't do any Java. I can do a little, making simple tools, but I have never been trained to the level required t

Re: Large amount of rules ...

2014-09-27 Thread Jaume OrtolĂ  i Font
2014-09-27 11:06 GMT+02:00 R.J. Baars : > > It is all about suggesting a Dutch word for a loanword. > Then why don't you use a simple replace rule (in Java)? You can use the existing one (or adapt it) and put the list of words in a text file. Jaume ---

languagetool-devel@lists.sourceforge.net

2014-09-27 Thread R.J. Baars
Okay. It is the only char so far encoded that way then. Ruud > On 2014-09-27 10:13, R.J. Baars wrote: > >> How do I get an & as token? It generates an error: >> >> & > > Use & > > Regards > Daniel > > > -- > Meet

Re: Large amount of rules ...

2014-09-27 Thread R.J. Baars
Okay. I will first have to check the results for frequency of getting triggered. They have to be different rules, maybe be in a rulegroup. It is all about suggesting a Dutch word for a loanword. Ruud > On 2014-09-26 21:53, R.J. Baars wrote: > >> Will adding 5000 rules lead to problems? > > It w

languagetool-devel@lists.sourceforge.net

2014-09-27 Thread Daniel Naber
On 2014-09-27 10:13, R.J. Baars wrote: > How do I get an & as token? It generates an error: > > & Use & Regards Daniel -- Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer Achieve PCI DSS 3.0 Com

Re: Large amount of rules ...

2014-09-27 Thread Daniel Naber
On 2014-09-26 21:53, R.J. Baars wrote: > Will adding 5000 rules lead to problems? It will make checking slower. How much needs to be tested. It would also make the configuration dialog difficult to use, so these words would need to be put in a rulegroup or a few rulegroups. Regards Daniel

languagetool-devel@lists.sourceforge.net

2014-09-27 Thread R.J. Baars
How do I get an & as token? It generates an error: & Exception in thread "main" java.io.IOException: Cannot load or parse '/org/languagetool/rules/nl/grammar.xml' at org.languagetool.XMLValidator.validateWithXmlSchema(XMLValidator.java:130) at org.languagetool.rules.patterns