Re: He tried not to laughs.
Most words have multiple possible postags. To dtermine which one is the actual one in this sentence, the disabiguator looks at the words around it, and tries to make a decision. The code for disambiguation is in disambiguation.xml in the resource directory. It is very much like the LT rules. Ruud On 04-10-13 05:00, Kumara Bhikkhu wrote: > Disambiguator. I don't even know what that is. > Never mind. Thanks. > > I'll add rules for "not to (verb)" and "(verb) to > (verb)", and see how that goes. > > kb > > Marcin MiÅ‚kowski wrote thus at 06:39 PM 03-10-13: >> W dniu 2013-10-03 11:43, Kumara Bhikkhu pisze: > >> Marcin MiÃ…‚kowski wrote thus at 04:15 PM >> 03-10-13: >> Hi, >> >>> W dniu 2013-10-03 06:24, >> Kumara Bhikkhu pisze: >>> Can the one who >> created this contact me personally? It's not >> triggering >>> "He tried not to laugh_s_." >>> I >> don't know how to correct it. >> >> I'll write >> this on the list -- "laughs" is also plural of >> "laugh", which >> is excluded by the exception >> below (NNS). Unfortunately, without this >> >> exception, a lot of false alarms are found. > > >> I thought so. > > >> Now, maybe we could have a >> second variant of the rule that takes "not" >> >> +"to", and then the exception would not be >> required. This would have to >> be tested on a >> large corpus. > > Anyway of indicating verbs but >> excepting those what are also nouns? No, it's >> not possible unless you have a perfect rule in >> the disambiguator for this. Best, Marcin >> -- >> October Webinars: Code for Performance Free >> Intel webinars can help you accelerate >> application performance. Explore tips for MPI, >> OpenMP, advanced profiling, and more. Get the >> most from the latest Intel processors and >> coprocessors. See abstracts and register > >> http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk >> ___ >> Languagetool-devel mailing list >> Languagetool-devel@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/languagetool-devel > > -- > October Webinars: Code for Performance > Free Intel webinars can help you accelerate application performance. > Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from > the latest Intel processors and coprocessors. See abstracts and register > > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk > ___ > Languagetool-devel mailing list > Languagetool-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: He tried not to laughs.
Disambiguator. I don't even know what that is. Never mind. Thanks. I'll add rules for "not to (verb)" and "(verb) to (verb)", and see how that goes. kb Marcin MiÅkowski wrote thus at 06:39 PM 03-10-13: >W dniu 2013-10-03 11:43, Kumara Bhikkhu pisze: > >Marcin Mià kowski wrote thus at 04:15 PM >03-10-13: >> Hi, >> >>> W dniu 2013-10-03 06:24, >Kumara Bhikkhu pisze: >>> Can the one who >created this contact me personally? It's not >triggering >>> "He tried not to laugh_s_." >>> I >don't know how to correct it. >> >> I'll write >this on the list -- "laughs" is also plural of >"laugh", which >> is excluded by the exception >below (NNS). Unfortunately, without this >> >exception, a lot of false alarms are found. > > >I thought so. > > >> Now, maybe we could have a >second variant of the rule that takes "not" >> >+"to", and then the exception would not be >required. This would have to >> be tested on a >large corpus. > > Anyway of indicating verbs but >excepting those what are also nouns? No, it's >not possible unless you have a perfect rule in >the disambiguator for this. Best, Marcin >-- >October Webinars: Code for Performance Free >Intel webinars can help you accelerate >application performance. Explore tips for MPI, >OpenMP, advanced profiling, and more. Get the >most from the latest Intel processors and >coprocessors. See abstracts and register > >http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk >___ >Languagetool-devel mailing list >Languagetool-devel@lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Modules for individual supported languages?
Why give a special position to English? LT as a plug-in with the individual as a plug-in on that maybe? Ruud On 03-10-13 21:24, Jan Schreiber wrote: > Somebody by the name of Łukasz Janik posted this to our Facebook wall: > > prosze kazdy jezyk jako osobno > > I don't speak a single word of Polish, but according to Google > Translator, this is a feature request to release single-language > versions of LT. (Google and I might be wrong here of course.) ;-) > > I tend to agree with him. Given the fact that the vast majority of > people probably doesn't actively use more than three languages, we're > imposing a huge overhead on our users. > > We've discussed this before, but I'm not sure what the outcome was. I > think the ideal solution would be if the users could configure the > languages they want before downloading. If that is not possible, there > should be a clean way to remove unwanted languages during or after > installation. > > Maybe we could have a two-step download: In the first step, you download > the main app, perhaps with English already on board. During install, you > can choose whatever other languages you may need. > > --Jan > > -- > October Webinars: Code for Performance > Free Intel webinars can help you accelerate application performance. > Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from > the latest Intel processors and coprocessors. See abstracts and register > > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk > ___ > Languagetool-devel mailing list > Languagetool-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Modules for individual supported languages?
Somebody by the name of Łukasz Janik posted this to our Facebook wall: prosze kazdy jezyk jako osobno I don't speak a single word of Polish, but according to Google Translator, this is a feature request to release single-language versions of LT. (Google and I might be wrong here of course.) ;-) I tend to agree with him. Given the fact that the vast majority of people probably doesn't actively use more than three languages, we're imposing a huge overhead on our users. We've discussed this before, but I'm not sure what the outcome was. I think the ideal solution would be if the users could configure the languages they want before downloading. If that is not possible, there should be a clean way to remove unwanted languages during or after installation. Maybe we could have a two-step download: In the first step, you download the main app, perhaps with English already on board. During install, you can choose whatever other languages you may need. --Jan -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
testing 2.3.1 maven artifacts
Hi, I have prepared a 2.3.1 release for Maven Central. It's the same as 2.3 plus Stefan's fixes for the multi-threading problem. As a developer, you can test the artifacts by adding this to your pom.xml: sonatypestaging Sonatype Staging true always warn false never fail https://oss.sonatype.org/content/repositories/orglanguagetool-1003/ default If you give it a try, please let me know of the results. I'd like to release 2.3.1 for Maven Central tomorrow night. Regards Daniel -- http://www.danielnaber.de -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: He tried not to laughs.
W dniu 2013-10-03 12:39, Marcin Miłkowski pisze: > W dniu 2013-10-03 11:43, Kumara Bhikkhu pisze: >> Marcin MiÅ‚kowski wrote thus at 04:15 PM 03-10-13: >>> Hi, >>> >>> W dniu 2013-10-03 06:24, Kumara Bhikkhu pisze: Can the one who created this contact me personally? It's not triggering "He tried not to laugh_s_." I don't know how to correct it. >>> >>> I'll write this on the list -- "laughs" is also plural of "laugh", which >>> is excluded by the exception below (NNS). Unfortunately, without this >>> exception, a lot of false alarms are found. >> >> I thought so. >> >> >>> Now, maybe we could have a second variant of the rule that takes "not" >>> +"to", and then the exception would not be required. This would have to >>> be tested on a large corpus. >> >> Anyway of indicating verbs but excepting those what are also nouns? > > No, it's not possible unless you have a perfect rule in the > disambiguator for this. Which I added, so that your case is covered :) The rule is very conservative, it just looks at "try" (+ "not") + "to" + VBZ/NNS. More verbs should be added to the list, instead of just "try". But I had no time to make such a list. If you want, create one, and I'll add it to the disambiguation rule. Best, Marcin -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: He tried not to laughs.
W dniu 2013-10-03 11:43, Kumara Bhikkhu pisze: > Marcin Miłkowski wrote thus at 04:15 PM 03-10-13: >> Hi, >> >> W dniu 2013-10-03 06:24, Kumara Bhikkhu pisze: >>> Can the one who created this contact me personally? It's not triggering >>> "He tried not to laugh_s_." >>> I don't know how to correct it. >> >> I'll write this on the list -- "laughs" is also plural of "laugh", which >> is excluded by the exception below (NNS). Unfortunately, without this >> exception, a lot of false alarms are found. > > I thought so. > > >> Now, maybe we could have a second variant of the rule that takes "not" >> +"to", and then the exception would not be required. This would have to >> be tested on a large corpus. > > Anyway of indicating verbs but excepting those what are also nouns? No, it's not possible unless you have a perfect rule in the disambiguator for this. Best, Marcin -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Postag question
Marcin MiÅkowski wrote thus at 04:13 PM 03-10-13: >I think I did write some rules for VBP. Anyway, there is at least one >difference between VB and VBP -- for "be". Ahh You mean like this? VB: be VBP: am, are Anyway, I suggest adding these to resource/en/tagset.txt to distinguish them. kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: He tried not to laughs.
Marcin MiÅkowski wrote thus at 04:15 PM 03-10-13: >Hi, > >W dniu 2013-10-03 06:24, Kumara Bhikkhu pisze: > > Can the one who created this contact me personally? It's not triggering > > "He tried not to laugh_s_." > > I don't know how to correct it. > >I'll write this on the list -- "laughs" is also plural of "laugh", which >is excluded by the exception below (NNS). Unfortunately, without this >exception, a lot of false alarms are found. I thought so. >Now, maybe we could have a second variant of the rule that takes "not" >+"to", and then the exception would not be required. This would have to >be tested on a large corpus. Anyway of indicating verbs but excepting those what are also nouns? kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: He tried not to laughs.
Hi, W dniu 2013-10-03 06:24, Kumara Bhikkhu pisze: > Can the one who created this contact me personally? It's not triggering > "He tried not to laugh_s_." > I don't know how to correct it. I'll write this on the list -- "laughs" is also plural of "laugh", which is excluded by the exception below (NNS). Unfortunately, without this exception, a lot of false alarms are found. Now, maybe we could have a second variant of the rule that takes "not" +"to", and then the exception would not be required. This would have to be tested on a large corpus. Regards, Marcin > > > > > to > > is postag="NNS|NNP|NNPS" postag_regexp="yes"> > > > You might need to use the base form of the verb here: > . > Grammatical problem > I was surprised to learn > this. > He spoke to chosen people. > I was surprised to > learns this. > > > > -- > October Webinars: Code for Performance > Free Intel webinars can help you accelerate application performance. > Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from > the latest Intel processors and coprocessors. See abstracts and register > > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk > > > > ___ > Languagetool-devel mailing list > Languagetool-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/languagetool-devel > -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Postag question
W dniu 2013-10-03 08:18, Daniel Naber pisze: > On 2013-10-03 05:50, Kumara Bhikkhu wrote: > >> What's the difference between these two? >> VBVerb, base form: eat, jump, believe >> VBP Verb, non-3rd ps. sing. present: eat, jump, believe > > It's like "eat" in "he had to eat" vs. "I eat" - i.e. the form looks the > same, but still in one case it's a base form, in the other an inflected > form that just happens to look like the base form. Anyway, it may well > be that all VBs are VBPs at the same time because we just look up words > in a dictionary and give them all their possible tags. LanguageTool can > only see the difference between the two if someone has written a > disambiguation rule for that in disambiguation.xml. I think I did write some rules for VBP. Anyway, there is at least one difference between VB and VBP -- for "be". Regards, Marcin -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel