Re: new language

2015-01-28 Thread R.Baars
An artificial language. Okay. That explains why it is not in Ethologue. Ruud Op 28-01-15 om 15:02 schreef Буковинець: > Hi R.Baars, > > Wednesday, January 28, 2015, 2:07:14 PM, you wrote: > >> What is the name of that language in English, what is its ISO language >> co

Re: new language

2015-01-28 Thread R.Baars
What is the name of that language in English, what is its ISO language code ? Op 28-01-15 om 12:45 schreef Буковинець: > Hi Daniel, > > Wednesday, January 28, 2015, 12:59:29 AM, you wrote: > >> great, welcome the LanguageTool! Which new language would you like to >> write rules for? You can actu

Remark on spell check quality

2015-01-21 Thread R.Baars
I have been looking into a lot of spellcheck files (hunspell) lately. Most of those don't care about case. So most will support a word like th, because there is the chemical element Th. (Same for all other elements). When converting a spellcheck file, it could be worth checking out if those erro

Re: spell checker enhancement

2014-09-16 Thread R.Baars
sentences from internet sources have at least 1 spelling error.) Ruud Op 16-09-14 om 15:31 schreef Jaume Ortolà i Font: 2014-09-16 14:43 GMT+02:00 R.Baars <mailto:baar...@xs4all.nl>>: How is that done? Ruud Do you mean ignoring tagged words in spellchecking (even if they

Re: spell checker enhancement

2014-09-16 Thread R.Baars
How is that done? Ruud Op 16-09-14 om 13:23 schreef Jaume Ortolà i Font: 2014-09-16 13:03 GMT+02:00 R.Baars <mailto:baar...@xs4all.nl>>: I see. This is probably of no use for spellchecking, but it is for postagging. It gives no suggestions, but it can be used for avoid

Re: spell checker enhancement

2014-09-16 Thread R.Baars
rds.txt 2014-09-16 12:33 GMT+02:00 R.Baars <mailto:baar...@xs4all.nl>>: Jaume, thanks, but I am not sure. Depends on its implementation I think. Where can I find more info? Ruud Op 16-09-14 om 12:26 schreef Jaume Ortolà i Font: 2014-09-16 11:21 GMT+02:00 R.J. Baa

Re: spell checker enhancement

2014-09-16 Thread R.Baars
Jaume, thanks, but I am not sure. Depends on its implementation I think. Where can I find more info? Ruud Op 16-09-14 om 12:26 schreef Jaume Ortolà i Font: 2014-09-16 11:21 GMT+02:00 R.J. Baars >: We don't agree. There is a spellchecker, but also a single word

Re: spell checker enhancement

2014-09-16 Thread R.Baars
I know it will be simple to generate ignore rule like this, And I will probably do that, as soon as they pop up in the frequency table. Ruud Op 16-09-14 om 12:01 schreef Marcin Miłkowski: > W dniu 2014-09-16 o 11:21, R.J. Baars pisze: >> Marcin, >> >> We don't agree. There is a spellchecker, but

Re: CompoundRule

2014-09-15 Thread R.Baars
Thanks, I will add that to the comments in the file. In the first option, the order is still the reverse one of what is generally wanted for Dutch. Ruud Op 15-09-14 om 13:27 schreef Daniel Naber: > On 2014-09-15 12:19, R.J. Baars wrote: > >> When 'word1 word2' is seen, I would need to suggest w

Re: Dutch WikiCheck

2014-09-15 Thread R.Baars
My problem is there are enormous amounts of errors generated by the checks where wiki mark-up is met. Especially name= etc. It is not for me, but for any wikipedia user checking pages .. Maybe a built-in parsoid-like routine? What is it we do check? Is it enough when all wiki mark-up is hidden

Re: simplify GenericUnpairedBracketsRule implementation

2014-09-15 Thread R.Baars
I think it would be great to be able to maken multi-sentence rules in XML ... I know that is not what is suggested here, but nevertheless... Ruud Op 15-09-14 om 13:00 schreef Daniel Naber: > Hi, > > GenericUnpairedBracketsRule detects quotes that do not get closed etc. > So what it does isn't ov

include xml

2014-08-23 Thread R.Baars
I am quite sure that I read somewhere the grammar.xml can include another xml, but cannot find the instructions again. Can someone help me on this? Ruud -- Slashdot TV. Video for Nerds. Stuff that matters. http://tv.s

Re: Dump

2014-05-27 Thread R.Baars
I did so. Will have to wait some time until the process will skip to another input file, but I will keep you informed. Ruud op 27-05-14 11:06, Marcin Miłkowski schreef: > Hi, > > maybe it was because of a simple mistake in the isNumberOrDot() method. > I fixed it, so the today's build should run

Re: homophone detection

2014-05-07 Thread R.Baars
Good work has been done using this and more sophisticated tools by the universities of Nijmegen and Tilburg, by A. v d Bosch et al. Their tools are also fully open source. These tools got public as 'valkuil.net' and 'fowl.net'. It requires a quite heavy server. In case you are interested, Prof

Re: Wiki

2014-01-10 Thread R.Baars
Yes, that is what I meant, thanks. I am able to log in, but not to edit. After clicking on 'edit', I get a Permission error: Sorry, you can not edit this page. Only members Ruud op 10-01-14 17:19, Daniel Naber schreef: > On 2014-01-09 16:26, R.J. Baars wrote: > >> Could someone please th

Re: showing example sentences in GUI

2013-12-02 Thread R.Baars
Aha, that makes sense ... Ruud op 02-12-13 13:19, Daniel Naber schreef: > On 2013-12-02 13:10, R.J. Baars wrote: > >> Why show sentences with errors to people that need help getting it >> right? >> It is not an objection, more a question of : is there a reason from the >> user perspective? > Usua

Re: better Wikipedia match filtering

2013-11-26 Thread R.Baars
That is far out of my league. Ruud op 26-11-13 17:35, Daniel Naber schreef: > On 2013-11-26 13:55, R.J. Baars wrote: > >> It is huge (>10 GB). Since it is data captured from sites, it is not >> guaranteed to be as free as is needed for free publication of results. > This makes it difficult to be

Re: planning LT homepage relaunch

2013-10-28 Thread R.Baars
Idea: map the languages on a globe (might be hard for languages that share countries) Or just group them by continent? op 28-10-13 19:40, Marcin Miłkowski schreef: > W dniu 2013-10-28 16:57, Daniel Naber pisze: >> On 2013-10-23 20:08, Daniel Naber wrote: >> >>> I've planned for long to relaunch t

Re: en bloc

2013-10-26 Thread R.Baars
That is french, Why not remove noun tags for this word group in the disambiguator? op 27-10-13 07:08, Kumara Bhikkhu schreef: "They walked out of the _hall en bloc_." is flagged by id="THREE_NN" name="Readability: Three nouns in a row"> Not sure what's the best way to fix this. Using exception

Re: planning LT homepage relaunch

2013-10-24 Thread R.Baars
Good free templates as well as glossaries are available using a CMS like Joomla. It also support multiple languages and switching between these. And it supports contirbutions by multiple people. I could talk to my Joomla provider if I could offer you this service from the server I am already pa