Re: move mailing list to forum?

2016-09-30 Thread Daniel Naber
On 2016-09-25 12:22, Daniel Naber wrote: > I > suggest closing this mailing list and using our web forum instead. This > should help against fragmentation of discussions. Thanks for everyone's feedback. As most people agree with the move to the forum, I'll close this list soon. That means the li

Re: JLanguageTool Cannot differ between Information Technology (IT) and the pronoun 'it'

2016-09-28 Thread Daniel Naber
On 2016-09-27 21:06, Fernand FERDINANDE wrote: > Where do I send unknown Dutch words to? (unknown by language tool or > correct words told by language tool to be mistake) According to the README, you can send them to info at taaltik.nl Regards Daniel -

Re: JLanguageTool Cannot differ between Information Technology (IT) and the pronoun 'it'

2016-09-27 Thread Fernand FERDINANDE
Where do I send unknown Dutch words to? (unknown by language tool or correct words told by language tool to be mistake) > Op 26 sep. 2016, om 16:12 heeft Daniel Naber > het volgende geschreven: > > On 2016-09-26 14:48, Idan Morad wrote: > >> What about the spelling mistakes? > > We don't mai

Re: Languagetool-devel Digest, Vol 121, Issue 11

2016-09-27 Thread Sharad Sinha
u can reach the person managing the list at > languagetool-devel-ow...@lists.sourceforge.net > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of Languagetool-devel digest..." > > Today's Topics: > >1. old

Re: move mailing list to forum?

2016-09-27 Thread Andriy Rysin
I felt like mailing list is used more by developers and users writing to the forum. I personally prefer mailing list, but I understand if maintaining both is too much work. Andriy 2016-09-26 12:14 GMT-04:00 Andre Couture : > I personally like the idea of moving to a forum. > > I have register wi

Re: move mailing list to forum?

2016-09-26 Thread Andre Couture
I personally like the idea of moving to a forum. I have register with many forums and most, if not all, of them have option to subscribe to threads. Doing this allow me to receive notifications when new content is added and even some allow to send the content of the changes so I dont need to ev

Re: move mailing list to forum?

2016-09-26 Thread Silvan Jegen
On Mon, Sep 26, 2016 at 04:14:49PM +0200, Daniel Naber wrote: > On 2016-09-26 15:49, Dmitri Gabinski wrote: > > I find a mail list a better solution generally: you can archive > > threads/messages that you deem relevant and delete ones that you deem > > irrelevant. > > You can also do that with a

Re: move mailing list to forum?

2016-09-26 Thread Daniel Naber
On 2016-09-26 15:49, Dmitri Gabinski wrote: > I find a mail list a better solution generally: you can archive > threads/messages that you deem relevant and delete ones that you deem > irrelevant. You can also do that with a forum, as you can get everything you're interested in via mail too. Reg

Re: JLanguageTool Cannot differ between Information Technology (IT) and the pronoun 'it'

2016-09-26 Thread Daniel Naber
On 2016-09-26 14:48, Idan Morad wrote: > What about the spelling mistakes? We don't maintain the spelling dictionary, but you can suggest words at https://github.com/marcoagpinto/aoo-mozilla-en-dict/ (en-GB) and http://wordlist.aspell.net/ (en-..) Regards Daniel ---

Re: move mailing list to forum?

2016-09-26 Thread Dmitri Gabinski
I find a mail list a better solution generally: you can archive threads/messages that you deem relevant and delete ones that you deem irrelevant. In a forum, you are at admin’s mercy, and virtually in all forums I know, threads are not deleted even when they are obsolete. Thus, a forum accumulates

Re: JLanguageTool Cannot differ between Information Technology (IT) and the pronoun 'it'

2016-09-26 Thread Idan Morad
Hi thanks =) What about the spelling mistakes? Best Reagrds, Idan Morad On Sun, Sep 25, 2016 at 3:28 PM, Idan Morad wrote: > Hi, > > My corpus is mostly comprised of articles that comes from Information > Technology and I'd like to find if the analysts that wrote those articles > have done gra

Re: move mailing list to forum?

2016-09-26 Thread Jan Schreiber
I agree. --Jan Am 26.09.2016 um 11:31 schrieb Mike Unwalla: > I am in favour of this proposed change. > > Mike > -- ___ Languagetool-devel mailing list Languagetool-devel@list

RE: move mailing list to forum?

2016-09-26 Thread Mike Unwalla
I am in favour of this proposed change. Mike -Original Message- From: Daniel Naber [mailto:daniel.na...@languagetool.org] Sent: 25 September 2016 11:22 To: LanguageTool Developer List Subject: move mailing list to forum? Hi, there's not much traffic here on the list. Some discussions h

Re: JLanguageTool Cannot differ between Information Technology (IT) and the pronoun 'it'

2016-09-26 Thread Daniel Naber
On 2016-09-25 14:28, Idan Morad wrote: Hi Idan, > My corpus is mostly comprised of articles that comes from Information > Technology and I'd like to find if the analysts that wrote those > articles have done grammar mistakes. > Now I getting a lot of false positive (made mistakes) with the rules

Re: Inflecting second token with postag from the first

2016-09-13 Thread Andriy Rysin
I did a quick POC for allowing back reference in match lemmas. The (rough) patch is attached. I must say I have mixed feelings about this: the logic in match state is quite complicated, I've modified the code path that works for my rule and also for the new test I've added. But tracing and testing

Re: Inflecting second token with postag from the first

2016-09-13 Thread Dominique Pellé
Jesper wrote: > It looks very strange to me to include ".*" in a replacement expression. I understand that it looks strange. But in some cases, the result of replacement is a regexp. That's why regexp syntax can appear inside the regexp_replace="". I see other examples in: - the Polish gra

Re: Inflecting second token with postag from the first

2016-09-13 Thread Jesper Hertel
Yes, but I think those would be included in the $2 which catches everything after fname (.*). ​​ It looks very strange to me to include ".*" in a replacement expression. But now I stated my observation so it is up to you if you want to go into it. Best, Jesper​ 2016-09-13 23:21 GMT+02:00 Andriy

Re: Inflecting second token with postag from the first

2016-09-13 Thread Andriy Rysin
There are some cases where last name would have extra tags. Regards, Andriy On Sep 13, 2016 5:05 PM, "Jesper Hertel" wrote: > Hi Andriy, > > As a beginner in LanguageTool I know almost nothing about this, but I do > have a few decades of experience in regular expressions, and the .* looks > str

Re: Inflecting second token with postag from the first

2016-09-13 Thread Jesper Hertel
Hi Andriy, As a beginner in LanguageTool I know almost nothing about this, but I do have a few decades of experience in regular expressions, and the .* looks strange to me in a replacement expression: postag_replace="$1lname$2.*" Are you sure it shouldn't simply be postag_replace="$1lname$2" ?

Re: Inflecting second token with postag from the first

2016-09-13 Thread Jaume Ortolà i Font
2016-09-13 22:27 GMT+02:00 Andriy Rysin : > Sorry if this is already written somewhere - I looked at wiki pages but > could not find anything relevant. > > I have two tokens (first name and last name) and in the suggestion I want > to inflect second token the same as the first. I tried to do this:

Re: Inflecting second token with postag from the first

2016-09-13 Thread Daniel Naber
On 2016-09-13 22:27, Andriy Rysin wrote: > postag="(noun.*:m.*:)fname(.*)" > postag_replace="$1lname$2.*">2 > > but it sends the tests into 100% CPU loop and I don't have access to > my Eclipse to try to debug this. You can just run "jstack " to get a stacktrace and see where it's looping. jst

Re: AbstractSimpleReplaceRule: look through multiple lemmas

2016-09-13 Thread Andriy Rysin
Thanks, pushed. Andriy 2016-09-13 3:03 GMT-04:00 Daniel Naber : > On 2016-09-13 00:51, Andriy Rysin wrote: > > > All language tests passed but as it's a core rule used by many I'd > > like to do a review before I push. > > + // short for SENT_START > + if( StringUtils.isEmpty(tokenRead

Re: AbstractSimpleReplaceRule: look through multiple lemmas

2016-09-13 Thread Daniel Naber
On 2016-09-13 00:51, Andriy Rysin wrote: > All language tests passed but as it's a core rule used by many I'd > like to do a review before I push. + // short for SENT_START + if( StringUtils.isEmpty(tokenReadings.getToken()) ) +continue; I suggest you explicitly check for SENT_

RE: Language specific settings

2016-09-08 Thread Mike Unwalla
Hi Marco, My comment is about ISE and IZE in English, not about the idea for language-specific settings. For British English and American English, the use of ISE and IZE is clear (I do not know the rules/conventions for other English variants). But, for British English, for some words, some pe

RE: Processing time is linear, but elapsed time with GUI is not

2016-09-03 Thread Daniel Naber
On 2016-09-02 11:10, Mike Unwalla wrote: Hi Mike, > With text that has no errors, and with the same set of rules selected > as > previously, the elapsed time is linear. As best I can measure with a > stopwatch, it is approximately processor time + 0.2 seconds. so it's indeed just the JTextPane

RE: Processing time is linear, but elapsed time with GUI is not

2016-09-02 Thread Mike Unwalla
...@languagetool.org] Sent: 31 August 2016 21:56 To: development discussion for LanguageTool Subject: Re: Processing time is linear, but elapsed time with GUI is not On 2016-08-31 19:03, Mike Unwalla wrote: Hi Mike, > Do you know why the elapsed time is non-linear? What can I do to make > the elapse

Re: Processing time is linear, but elapsed time with GUI is not

2016-08-31 Thread Daniel Naber
On 2016-08-31 19:03, Mike Unwalla wrote: Hi Mike, > Do you know why the elapsed time is non-linear? What can I do to make > the elapsed time linear (or as linear as possible)? this might be caused by the component we use to show the errors, which supports HTML for formatting but isn't very fast

Re: Fwd: Yakov! (rule still giving error)

2016-08-17 Thread Marco A.G.Pinto
Thanks, Yakov and Jan! Kind regards, >Marco A.G.Pinto --- On 17/08/2016 20:37, Yakov Reztsov wrote: > Another variant of rule: > > > > > > há > regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos? > atrás >

RE: [en] Maven error when approximately 820 lines are added to grammar.xml

2016-08-17 Thread Daniel Naber
On 2016-08-17 14:01, Mike Unwalla wrote: > I am confused. Does 'nightly build' mean the snapshots on > https://languagetool.org/download/snapshots/?C=M;O=D? Yes. There's not really a good solution, but I've now added a parameter "-DdisableHardcodedTests" which you can add when you call Maven to

Re: Fwd: Yakov! (rule still giving error)

2016-08-17 Thread Yakov Reztsov
Another variant of rule:                 há     segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?     atrás     Com o verbo haver não é necessário usar "atrás": .   Há n segundos atrás.     >Среда, 17 августа 2016, 21:49 +03:00 от "Marco A.G.Pinto": >

Re: Fwd: Yakov! (rule still giving error)

2016-08-17 Thread Yakov Reztsov
Marco, I tested this rule with local mashine and community.languagetool.org:               há     [a-záãêéçôłę]{1,6}     segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?     atrás     Com o verbo haver não é necessário usar "atrás": .   Há n segundo

RE: English native speaker help

2016-08-17 Thread Mike Unwalla
find ' I never have been' is useful. A rule that finds only 'never have been' is probably not useful. Regards, Mike -Original Message- From: Jan Schreiber [mailto:jan.schrei...@languagetool.org] Sent: 17 August 2016 13:04 To: languagetool-devel@lists.sourcef

Re[2]: Help improve rule pt_PT

2016-08-17 Thread Yakov Reztsov
Try: >Среда, 17 августа 2016, 15:05 +03:00 от "Marco A.G.Pinto" >: > >Thanks, Jan, I use "1" and "3" but TESTRULES PT gives an error in: >Com o verbo haver não é necessário usar "atrás": \1 \2 \3. > >Can you help? > >Thanks! > >On 17/08/2016 12:36, Jan Schreiber wrote: >>Marco, I

Re: Help improve rule pt_PT

2016-08-17 Thread Jan Schreiber
I would try it like this: há segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos? atrás Com o verbo haver não é necessário usar "atrás". Há n segundos atrás. In other words, replace 'at

Re: Help improve rule pt_PT

2016-08-17 Thread Marco A.G.Pinto
Thanks, Jan, I use "1" and "3" but TESTRULES PT gives an error in: Com o verbo haver não é necessário usar "atrás": \1 \2 \3. Can you help? Thanks! On 17/08/2016 12:36, Jan Schreiber wrote: > Marco, > > I think you could replace > > from your original version with the following: > > (Replace 0

Re: English native speaker help

2016-08-17 Thread Jan Schreiber
Hi list, What Athena says is exactly what I learned in school, and they were supposed to teach us British English there. So it's probably safe to add this as a rule for both variants. See also: http://www.englishteachermelanie.com/grammar-present-perfect-have-you-ever-been-to/ Best, Jan Am 17

RE: [en] Maven error when approximately 820 lines are added to grammar.xml

2016-08-17 Thread Mike Unwalla
e Commission', is the Commission being controlled (audited?) or is it doing the c ontrolling?] expected:<0> but was:<2> Tests run: 64, Failures: 1, Errors: 0, Skipped: 4 [INFO] -------- [INFO] Reactor Summary: [INFO] [INFO] languagetool-p

Re: Help improve rule pt_PT

2016-08-17 Thread Jan Schreiber
Marco, I think you could replace from your original version with the following: (Replace 0 with 1 if necessary.) Best, Jan Am 17.08.2016 um 13:25 schrieb Marco A.G.Pinto: > Yakov, > > It didn't work. > > I tried it in the stand-alone tool after using *TESTRULES PT*, with the > sentences: > *Is

Re: Help improve rule pt_PT

2016-08-17 Thread Marco A.G.Pinto
Yakov, It didn't work. I tried it in the stand-alone tool after using *TESTRULES PT*, with the sentences: *Isso aconteceu à quase 20 anos!** **Isso aconteceu à quase vinte e três anos!** **Há 10 anos atrás** **Há quarenta e dois anos atrás* Rules I changed: *** **** ** ** **

Re: Help improve rule pt_PT

2016-08-17 Thread Yakov Reztsov
Hello! One of the options is add  skip="1":           há         segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?     atrás     Com o verbo haver não é necessário usar "atrás": \1 \2 \3.   Há n segundos atrás.     >Среда, 17 августа 2016, 10:59 +03

RE: Possible English rule

2016-08-17 Thread Mike Unwalla
Yes, the rule would be useful. Aside. The rule I_MOVING finds some related problems: Finds: I as walking. Finds: I as eating. Does not find: He as smiling. Does not find: I as calling. The disambiguator log shows that for the 2 sentences that I_MOVING does not find, the disambiguator uses rule A

Re: English native speaker help

2016-08-17 Thread Athena Santora
Hi there In American English the sentence is awkward. It's not "wrong" to say "I never have been" but a native speaker wouldn't commonly phrase it that way. The other "error" : "been IN" isn't technically an error either but a native speaker wouldn't commonly phrase it this way either (now that I

Re: English native speaker help

2016-08-17 Thread Bruno P. Kinoshita
Hi Daniel, Not a native speaker, but it sounds weird for me. I think I heard/read * I have never been* Never have I been Few variations: * I have never been better* Never have I been better* Never been better* Have never been better But "I never have been better" would still sound weird for me. No

Re[2]: update community.languagetool.org ?

2016-08-16 Thread Yakov Reztsov
Thanks! >Вторник, 16 августа 2016, 11:47 +03:00 от Daniel Naber: > >On 2016-08-16 08:45, Yakov Reztsov wrote: > >> can you update community.languagetool.org to current version of >> LanguageTool? > >The update should happen automatically, but there was a bug with file >permissions. This is fixed

Re: update community.languagetool.org ?

2016-08-16 Thread Daniel Naber
On 2016-08-16 08:45, Yakov Reztsov wrote: > can you update community.languagetool.org to current version of > LanguageTool? The update should happen automatically, but there was a bug with file permissions. This is fixed now. Regards Daniel --

RE: [en] Maven error when approximately 820 lines are added to grammar.xml

2016-08-15 Thread Daniel Naber
On 2016-08-15 15:50, Mike Unwalla wrote: > Sure enough, the exception for 'something' prevents the Maven error > message. But why? How can I know (without guessing) the cause of a > Maven error message in the context of the test sentences? If you're using a recent nightly build, the message shoul

Re: [en] Maven error when approximately 820 lines are added to grammar.xml

2016-08-15 Thread Fernand FERDINANDE
Perhaps "shall and Maven" is telling you something but for me it’s telling me nothing. But it makes my happy if I could be of some help. I’m always asking myself how you can correct a language you don’t know. This must be leading to mistakes most of French speaking people do when they speak Dutc

RE: [en] Maven error when approximately 820 lines are added to grammar.xml

2016-08-15 Thread Mike Unwalla
Thanks. I corrected the rule for 'shall' and Maven gives no errors. For 'establish', I guessed that by adding an exception for the postag JJ, (=established), Maven would not give an error. My guess was correct. (And the original rule was faulty, because it should have been about the verb only.)

Re: Google docs add-on

2016-08-15 Thread Jordi Mas
El 15/08/2016 a les 10:09, Daniel Naber ha escrit: > Hi, > > based on feedback from the Chrome add-on, it seems having a Google Docs > add-on would be very useful for many (potential) LT users. Anybody who'd > like to work on it? Some Javascript knowledge will be required. It > should be possible t

RE: [en] Maven error when approximately 820 lines are added to grammar.xml

2016-08-12 Thread Daniel Naber
On 2016-08-12 20:41, Mike Unwalla wrote: > The rules are for Gardner's 'Misused terms in EU publications' > (http://euenglish.webs.com), which I want to commit to LT. It says "The use of ‘shall’ in the third person (he/she/it/they) is archaic". > Here is a test rule: > > >

RE: [en] Maven error when approximately 820 lines are added to grammar.xml

2016-08-12 Thread Mike Unwalla
--- [INFO] Reactor Summary: Regards, Mike -Original Message- From: Daniel Naber [mailto:daniel.na...@languagetool.org] Sent: 12 August 2016 18:52 To: development discussion for LanguageTool Subject: RE: [en] Maven error when approximately 820 lines are adde

RE: [en] Maven error when approximately 820 lines are added to grammar.xml

2016-08-12 Thread Daniel Naber
On 2016-08-12 10:58, Mike Unwalla wrote: > My rules for these words cause the Maven errors: do, establish, of, > shall. Could you be more specific, i.e. why does the rule match the sentence? Is this a rule you're going to commit to LT or is it a "private" rule? > "Anatomy and geometry are fus

RE: [en] Maven error when approximately 820 lines are added to grammar.xml

2016-08-12 Thread Mike Unwalla
[mailto:daniel.na...@languagetool.org] Sent: 11 August 2016 20:03 To: development discussion for LanguageTool Subject: Re: [en] Maven error when approximately 820 lines are added to grammar.xml On 2016-08-11 14:57, Mike Unwalla wrote: > org.languagetool.JLanguageToolTest.testEnglish(JLanguageToolTest.j

Re: [en] Maven error when approximately 820 lines are added to grammar.xml

2016-08-11 Thread Daniel Naber
On 2016-08-11 14:57, Mike Unwalla wrote: > org.languagetool.JLanguageToolTest.testEnglish(JLanguageToolTest.java:84) We have a small number of test sentences in the Java code (instead of in the grammar.xml). In this case, this sentence now leads to an error, either because there's an actual err

RE: Inflected matched tokens: suggest only 1 of 2 possibilities

2016-08-08 Thread Mike Unwalla
- From: Daniel Naber [mailto:daniel.na...@languagetool.org] Sent: 08 August 2016 15:39 To: development discussion for LanguageTool Subject: Re: Inflected matched tokens: suggest only 1 of 2 possibilities On 2016-08-08 14:07, Mike Unwalla wrote: > Sometimes, for an inflection, LT offers

Re: Inflected matched tokens: suggest only 1 of 2 possibilities

2016-08-08 Thread Daniel Naber
On 2016-08-08 14:07, Mike Unwalla wrote: > Sometimes, for an inflection, LT offers two different spellings. Thus, > I must include both spellings in a rule: Have you tried suppress_misspelled="yes" on the ? As "canceled" and "cancelled" are only correct in British and American resp., but not in

Re: our Firefox add-on and Firefox 48

2016-08-05 Thread Daniel Naber
On 2016-08-02 23:21, Daniel Naber wrote: > Firefox 48 has been released today with support for WebExtensions. I > have thus linked the new extension > (https://addons.mozilla.org/de/firefox/addon/languagetool/) from our > homepage, which replaces the old extension (LanguageToolFx). Who's using th

Re: our Firefox add-on and Firefox 48

2016-08-03 Thread Yakov Reztsov
Hi! Great! We need have small links to older version of LanguageToolFX for users older versions of Firefox. >Среда, 3 августа 2016, 0:21 +03:00 от Daniel Naber: > >Hi, > >Firefox 48 has been released today with support for WebExtensions. I >have thus linked the new extension >( https://addon

Re: [en] postags for 'haven'

2016-07-29 Thread Marcin Miłkowski
W dniu 29.07.2016 o 13:57, Mike Unwalla pisze: > Marcin, thanks. > > Except for the removal of VBP, I decided to make no changes to the > disambiguation at this time, for these reasons: > > 1. In disambiguation.xml, if I remove the readings MD and VBP from 'haven' > if it is not MD (that is, it is

RE: [en] postags for 'haven'

2016-07-29 Thread Mike Unwalla
Marcin, thanks. Except for the removal of VBP, I decided to make no changes to the disambiguation at this time, for these reasons: 1. In disambiguation.xml, if I remove the readings MD and VBP from 'haven' if it is not MD (that is, it is not part of "haven't"), my problem is not solved. This sugg

Re: [en] postags for 'haven'

2016-07-27 Thread Marcin Miłkowski
W dniu 25.07.2016 o 16:28, Mike Unwalla pisze: > Hello, > > The word 'haven' has these postags: NN, MD, VBP. > > When the word is not a verb (as in "the village is a haven of tranquility"), > I want to remove MD and VBP. Postag VBP causes 'haven' to appear as a > suggestion when I use postag_repl

Re: Update check text on pt_PT

2016-07-25 Thread Daniel Naber
On 2016-07-26 00:00, Marco A.G.Pinto wrote: > I haven't found the file that has the check text example, so I was > wondering if someone could update the pt_PT one: For the homepage, it's https://github.com/languagetool-org/languagetool-website/blob/master/include/default_texts.php Regards Da

Re: [pt-BR] Question about language tool home page.

2016-07-17 Thread Daniel Naber
On 2016-07-16 21:43, Matheus Poletto wrote: > I was looking in https://www.languagetool.org [1] so i found the > different versions of translations like to /pt/, /uk/ ... As im doing > the pt-BR i would like ask you if is possible to translate and create > a pt-BR version, cause some words from /

Re: [pt-BR] Question about language tool home page.

2016-07-16 Thread Bruno P. Kinoshita
Err, was running the site locally, and copied the link from the wrong tab  kinoshita.eti.br/2016/07/17/reasons-for-having-pt-and-ptbr-in-a-software/ Bruno From: Bruno P. Kinoshita To: development discussion for LanguageTool Sent: Sunday, 17 July 2016 4:35 PM Subject: Re: [pt-BR

Re: [pt-BR] Question about language tool home page.

2016-07-16 Thread Bruno P. Kinoshita
k to maintain both pages, then I'm +1 for keeping just /pt/. But if it's easy to maintain, then I'd be +1 to add the page. Just my 0.02 cents Bruno From: Jaume Ortolà i Font To: development discussion for LanguageTool Sent: Sunday, 17 July 2016 9:14 AM Subject: Re: [pt

Re: [pt-BR] Question about language tool home page.

2016-07-16 Thread Jaume Ortolà i Font
2016-07-16 21:43 GMT+02:00 Matheus Poletto : > Hi guys; > I was looking in https://www.languagetool.org so i found the different > versions of translations like to /pt/, /uk/ ... As im doing the pt-BR i > would like ask you if is possible to translate and create a pt-BR version, > cause some word

Re: Help creating rule pt_PT

2016-07-15 Thread Jaume Ortolà i Font
You can list languages: *Latim|**Francês|Inglês|Português*** Or you can do this: ** * * *Latim*** ** which. in fact, is equivalent to writing two different rules. Jaume 2016-07-15 13:15 GMT+02:00 Marco A.G.Pinto : > Jaume, > > just *NCMS000 *creates lots of false positives: > "Traduz

Re: Help creating rule pt_PT

2016-07-15 Thread Marco A.G.Pinto
Jaume, just *NCMS000 *creates lots of false positives: "Traduzir em sexo" I believe I really need to have the rule as I planned it. Do you have any suggestions or should I create two rules with "Latim" in one? Thanks! Kind regards, >Marco A.G.Pinto --- On 15/07

Re: Help creating rule pt_PT

2016-07-15 Thread Jaume Ortolà i Font
Hi, Most languages have the postag NCMS000, including "Latim". Try: * * Regards, Jaume Ortolà 2016-07-15 12:45 GMT+02:00 Marco A.G.Pinto : > Hello! > > I am trying to create the following rule: > traduzir *em *LANG -> traduzir *para *LANG > (translate TO LANG) > > ** > ** > * * >

Re: External file for English rule EN_A_VS_AN

2016-07-12 Thread Daniel Naber
On 2016-07-12 15:07, Mike Unwalla wrote: > English rule EN_A_VS_AN > (http://community.languagetool.org/rule/show/EN_A_VS_AN?lang=en-US) > finds a mismatch between the indefinite article and the next word ('a > hour'). A comment in the Java source code (AvsAnRule.java) is "This > rule loads some e

RE: Suggestion for English rule - masters thesis > master's thesis

2016-07-11 Thread Mike Unwalla
Done. Refer to https://github.com/languagetool-org/languagetool/commit/12d526ac18d2761770719cac99d047ca9ffcab14. Mike -Original Message- Unless I get an objection in the next few days, I will make 2 rules, one for AmE and one for BrE.

RE: Maven shows errors, but why?

2016-07-08 Thread Mike Unwalla
LanguageTool Subject: Re: Maven shows errors, but why? On 2016-07-08 13:49, Mike Unwalla wrote: > eTest) Time elapsed: 0.18 sec <<< ERROR! > java.lang.NoSuchMethodError: > org.languagetool.rules.Example.fixed(Ljava/lang/String;)Ljava/lang/String; Sometimes maven gets confused

Re: Maven shows errors, but why?

2016-07-08 Thread Daniel Naber
On 2016-07-08 13:49, Mike Unwalla wrote: > eTest) Time elapsed: 0.18 sec <<< ERROR! > java.lang.NoSuchMethodError: > org.languagetool.rules.Example.fixed(Ljava/lang/String;)Ljava/lang/String; Sometimes maven gets confused and it's needed to clean the temporary files. For that, add "clean" to t

RE: ignore tags from binary dictionary

2016-07-08 Thread Daniel Naber
On 2016-07-08 11:10, Mike Unwalla wrote: > bugs bugsJJ > comes comeNN > xxxx JJ > xxx xxx JJ > > The format of a line is: fullform baseform postags (tab separated). > > What is the baseform for the non-existent adjective 'bugs'? I guess > that it > is 'bugs', rather than

RE: ignore tags from binary dictionary

2016-07-08 Thread Mike Unwalla
ss that it is 'bugs', rather than 'bug'. Is my guess correct? Regards, Mike Unwalla Contact: www.techscribe.co.uk/techw/contact.htm -Original Message- From: Daniel Naber [mailto:daniel.na...@languagetool.org] Sent: 07 July 2016 19:02 To: development discussion for La

Re: ignore tags from binary dictionary

2016-07-07 Thread Daniel Naber
On 2016-07-07 14:51, Mike Unwalla wrote: > One reason to use a rule in disambiguation.xml is that we can use > examples as a sanity check. Are there other reasons? What is the > preferred method now that we have 2 methods? I'd prefer removed.txt as it's a shorter notation, but I think both ways

Re: I want to be the Galician language maintainer

2016-07-04 Thread Anton Meixome
Thanks for your commitement with galician, Gabriel Grettings, Antón 2016-06-29 14:02 GMT+02:00 Daniel Naber : > On 2016-06-29 11:51, Gabriel André wrote: > > Hi Gabriel, > >> Last week I sent an e-mail to Daniel asking to be the Galician language >> maintainer. I am starting to create new rules

Re: The spell checker issue

2016-07-02 Thread Bruno P. Kinoshita
Some time ago I started a project that would use Hunspell with Java (I think at the time I used that library that uses BridJ). It worked well, but I also needed to have custom dictionaries, and another interesting feature would be merge dictionaries. Never had time to continue investigating it, b

Re: bulk corrections in Wikipedia using LT

2016-06-30 Thread Jaume Ortolà i Font
2016-06-30 13:02 GMT+02:00 Juan Martorell : > Great job, Jaume! > > However I found some too-greedy corrections in change_always.txt for > Spanish: > > "esta formada" > "esta constituida" > > Recently, the rule excluded the diacritic tilde for referrers, so to speak: > > from: > "Quedé con la abog

Re: bulk corrections in Wikipedia using LT

2016-06-30 Thread Juan Martorell
Great job, Jaume! However I found some too-greedy corrections in change_always.txt for Spanish: "esta formada" "esta constituida" Recently, the rule excluded the diacritic tilde for referrers, so to speak: from: "Quedé con la abogada y la fiscal, *é*sta formada en la Universidad Autónoma." goes

Re: Question about rules

2016-06-29 Thread Daniel Naber
On 2016-06-29 18:10, Matheus Poletto wrote: > I want know if is ok i keep writing and improving the rules to the > pt-BR and doing commit on it to the project? Let's do the next two or three changes with pull requests - if that works well, we'll provide you with direct write access to the LT g

Re: I want to be the Galician language maintainer

2016-06-29 Thread Daniel Naber
On 2016-06-29 11:51, Gabriel André wrote: Hi Gabriel, > Last week I sent an e-mail to Daniel asking to be the Galician language > maintainer. I am starting to create new rules and to review the old > ones > now. I am not a programmer myself, but I am a linguist with some NLP > knowledge. welcom

Re: The spell checker issue

2016-06-29 Thread Dominique Pellé
Daniel Naber wrote: > Hi, > > yesterday I tried to update the English dictionary that LT includes. The > details are documented at > https://github.com/languagetool-org/languagetool/issues/329 but in a > nutshell: our spell checking is so complicated that the dictionary > update didn't work. > >

Re: Several in same rule?

2016-06-28 Thread Daniel Naber
On 2016-06-28 20:52, Dominique Pellé wrote: > Is the feature described in this old thread still > on a todo list? Probably, together with maybe 200 other items. So the best way to get this is to develop it yourself... Regards Daniel --

Re: Several in same rule?

2016-06-28 Thread Dominique Pellé
Daniel Naber wrote: > On 2015-10-11 11:58, Dominique Pellé wrote: > >> Would be possible to allow for several tags >> in the same rule? > > I don't think it's very difficult. I could put it on my TODO list, but I > cannot make any promises about when I have time for this. > > Regards > Daniel

Re: I want become a maintainer in PT-BR

2016-06-27 Thread Jan Schreiber
Hi Matheus, Just in case you are struggling with git (as I do), here is a helpful basic guide: http://rogerdudler.github.io/git-guide/ My advice, for what it's worth: As a first step, take a look at the grammar.xml file for European Portuguese or any other language that you happen to be familiar

Re: bulk corrections in Wikipedia using LT

2016-06-27 Thread Jan Schreiber
Hi Jaume, This is so great, thank you! I have been dreaming about a bot that can make the process of correcting Wikipedia semi-automatic for years. As they say, I just never got around to writing it because I had too much on my plate. --Jan Am 27.06.2016 um 14:27 schrieb Jaume Ortolà i Font: > F

Re: I want become a maintainer in PT-BR

2016-06-27 Thread Daniel Naber
On 2016-06-27 19:01, Matheus Poletto wrote: Hi Matheus, >> I found some posts saying you need maintainer for my language PT-BR >> and i love the ideia of becomming one. What i need do to become >> this? that's great, welcome to LanguageTool! Most maintainers spend their time writing new error d

Re: bulk corrections in Wikipedia using LT

2016-06-27 Thread Jaume Ortolà i Font
2016-06-27 16:12 GMT+02:00 Mike Unwalla : > > [2] > https://github.com/jaumeortola/cawiki-roofreading/blob/master/examples/example_Spanish.txt > > I get a 404 not found message. > Sorry, I deleted a character inadvertently. Try this: https://github.com/jaumeortola/cawiki-proofreading/blob/master

RE: bulk corrections in Wikipedia using LT

2016-06-27 Thread Mike Unwalla
> [2] > https://github.com/jaumeortola/cawiki-roofreading/blob/master/examples/example_Spanish.txt I get a 404 not found message. -Original Message- From: Jaume Ortolà i Font [mailto:jaumeort...@gmail.com] Sent: 27 June 2016 13:27 To: languagetool-devel Subject: bulk corrections in Wiki

RE: Suggestion for English rule - masters thesis > master's thesis

2016-06-23 Thread Mike Unwalla
master's master's thesis." Unless I get an objection in the next few days, I will make 2 rules, one for AmE and one for BrE. Regards, Mike Unwalla Contact: www.techscribe.co.uk/techw/contact.htm -Original Message- From: Purodha Blissenbach [mailto:puro...@blissenbach.

Re: Unverified OpenPGP signature in LT mailing list

2016-06-23 Thread Daniel Naber
On 2016-06-23 09:22, Marco A.G.Pinto wrote: > Daniel, is the software being used Mailman? If so, can it be updated? Yes, it's mailman but it can't be updated as it's not hosted by us. Anyway, I'd like to move to discourse (http://discourse.org), but that will only be possible when they provide

Re: Suggestion for English rule - masters thesis > master's thesis

2016-06-22 Thread Purodha Blissenbach
Ouch! Nick, you put it the right way. That is what I wanted to write, but I messed it up. I'm sorry. Puroda On 22.06.2016 23:50, Nick Hough wrote: > It would be the other way around: > > If you wrote it : "my masters thesis”, because it is the thesis from > your “masters" degree > If your master

Re: Suggestion for English rule - masters thesis > master's thesis

2016-06-22 Thread Nick Hough
It would be the other way around: If you wrote it : "my masters thesis”, because it is the thesis from your “masters" degree If your master wrote it : "my master’s thesis”, because the thesis is owned (hence the possessive apostrophe) by the “master" Both could be valid English. Which version i

Re: Suggestion for English rule - masters thesis > master's thesis

2016-06-22 Thread Purodha Blissenbach
Imho, it depends. If you wrote it : "my master's thesis" If your master wrote it : "my masters thesis" but I am not a native English speaker :-) Purodha On 22.06.2016 18:33, Marco A.G.Pinto wrote: > Hello! > > Using MS Word 2016 I typed "blah blah my masters thesis" and Word > suggested me to re

RE: English rule: "Involved on/in"

2016-06-20 Thread Daniel Naber
On 2016-06-20 11:28, Mike Unwalla wrote: > grammar.xml file. Daniel, Marcin, and others, what is the design > principle of LT that results in some rules being Java rules? Some things aren't possible in XML rules, like reading external files or doing calculations, so these need to be implemented

RE: English rule: "Involved on/in"

2016-06-20 Thread Mike Unwalla
Although 'involved on' is not so common, I found 168 instances on COCA (http://corpus.byu.edu/coca/). The problem is semantic. Thus, a grammar rule can find all instances of the terms, but it cannot find only the misused terms. But, all words in English *could be* misused. So, if we start to in

Re: English rule: "Involved on/in"

2016-06-20 Thread Adrián Chaves Fernández
I believe "involved in" is the right phrasing. There is also "involved with", but for a different context. And "involved on" may be OK in very specific contexts, I guess, but "involved in" will be the right choice in most of them. I am not sure whether or not you can write a grammar rule that forc

RE: English rule: "Involved on/in"

2016-06-19 Thread curon
On 18 Jun 2016 2:27 p.m., Mike Unwalla wrote: > The phrasal verb 'involve in' is common in English. It is not an error. Refer > to www.oxfordlearnersdictionaries.com/definition/english/involve-in. Further to this, 'in' is the dependent preposition of 'involve'. Lists of dependent prepositions c

RE: English rule: "Involved on/in"

2016-06-18 Thread Mike Unwalla
Hi Marco, The phrasal verb 'involve in' is common in English. It is not an error. Refer to www.oxfordlearnersdictionaries.com/definition/english/involve-in. The second definition from Macmillan Phrasal Verbs Plus is "to take part in something that other people are doing". Regards, Mike Unwall

  1   2   3   4   5   6   7   8   9   10   >