Re: He tried not to laughs.

2013-10-03 Thread Ruud Baars
Most words have multiple possible postags. To dtermine which one is the 
actual one in this sentence, the disabiguator looks at the words around 
it, and tries to make a decision.

The code for disambiguation is in disambiguation.xml in the resource 
directory. It is very much like the LT rules.

Ruud

On 04-10-13 05:00, Kumara Bhikkhu wrote:
> Disambiguator. I don't even know what that is.
> Never mind. Thanks.
>
> I'll add rules for "not to (verb)" and "(verb) to
> (verb)", and see how that goes.
>
> kb
>
> Marcin Miłkowski wrote thus at 06:39 PM 03-10-13:
>> W dniu 2013-10-03 11:43, Kumara Bhikkhu pisze: >
>> Marcin MiÃ…‚kowski wrote thus at 04:15 PM
>> 03-10-13: >> Hi, >> >>> W dniu 2013-10-03 06:24,
>> Kumara Bhikkhu pisze: >>> Can the one who
>> created this contact me personally? It's not
>> triggering >>> "He tried not to laugh_s_." >>> I
>> don't know how to correct it. >> >> I'll write
>> this on the list -- "laughs" is also plural of
>> "laugh", which >> is excluded by the exception
>> below (NNS). Unfortunately, without this >>
>> exception, a lot of false alarms are found. > >
>> I thought so. > > >> Now, maybe we could have a
>> second variant of the rule that takes "not" >>
>> +"to", and then the exception would not be
>> required. This would have to >> be tested on a
>> large corpus. > > Anyway of indicating verbs but
>> excepting those what are also nouns? No, it's
>> not possible unless you have a perfect rule in
>> the disambiguator for this. Best, Marcin
>> --
>> October Webinars: Code for Performance Free
>> Intel webinars can help you accelerate
>> application performance. Explore tips for MPI,
>> OpenMP, advanced profiling, and more. Get the
>> most from the latest Intel processors and
>> coprocessors. See abstracts and register >
>> http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
>> ___
>> Languagetool-devel mailing list
>> Languagetool-devel@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>
> --
> October Webinars: Code for Performance
> Free Intel webinars can help you accelerate application performance.
> Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from
> the latest Intel processors and coprocessors. See abstracts and register >
> http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
> ___
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel


--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: He tried not to laughs.

2013-10-03 Thread Kumara Bhikkhu
Disambiguator. I don't even know what that is.
Never mind. Thanks.

I'll add rules for "not to (verb)" and "(verb) to 
(verb)", and see how that goes.

kb

Marcin Miłkowski wrote thus at 06:39 PM 03-10-13:
>W dniu 2013-10-03 11:43, Kumara Bhikkhu pisze: > 
>Marcin Miłkowski wrote thus at 04:15 PM 
>03-10-13: >> Hi, >> >>> W dniu 2013-10-03 06:24, 
>Kumara Bhikkhu pisze: >>> Can the one who 
>created this contact me personally? It's not 
>triggering >>> "He tried not to laugh_s_." >>> I 
>don't know how to correct it. >> >> I'll write 
>this on the list -- "laughs" is also plural of 
>"laugh", which >> is excluded by the exception 
>below (NNS). Unfortunately, without this >> 
>exception, a lot of false alarms are found. > > 
>I thought so. > > >> Now, maybe we could have a 
>second variant of the rule that takes "not" >> 
>+"to", and then the exception would not be 
>required. This would have to >> be tested on a 
>large corpus. > > Anyway of indicating verbs but 
>excepting those what are also nouns? No, it's 
>not possible unless you have a perfect rule in 
>the disambiguator for this. Best, Marcin 
>-- 
>October Webinars: Code for Performance Free 
>Intel webinars can help you accelerate 
>application performance. Explore tips for MPI, 
>OpenMP, advanced profiling, and more. Get the 
>most from the latest Intel processors and 
>coprocessors. See abstracts and register > 
>http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk 
>___ 
>Languagetool-devel mailing list 
>Languagetool-devel@lists.sourceforge.net 
>https://lists.sourceforge.net/lists/listinfo/languagetool-devel


--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Modules for individual supported languages?

2013-10-03 Thread Ruud Baars
Why give a special position to English?

LT as a plug-in with the individual as a plug-in on that maybe?

Ruud

On 03-10-13 21:24, Jan Schreiber wrote:
> Somebody by the name of Łukasz Janik posted this to our Facebook wall:
>
>   prosze kazdy jezyk jako osobno
>
> I don't speak a single word of Polish, but according to Google
> Translator, this is a feature request to release single-language
> versions of LT. (Google and I might be wrong here of course.) ;-)
>
> I tend to agree with him. Given the fact that the vast majority of
> people probably doesn't actively use more than three languages, we're
> imposing a huge overhead on our users.
>
> We've discussed this before, but I'm not sure what the outcome was. I
> think the ideal solution would be if the users could configure the
> languages they want before downloading. If that is not possible, there
> should be a clean way to remove unwanted languages during or after
> installation.
>
> Maybe we could have a two-step download: In the first step, you download
> the main app, perhaps with English already on board. During install, you
> can choose whatever other languages you may need.
>
> --Jan
>
> --
> October Webinars: Code for Performance
> Free Intel webinars can help you accelerate application performance.
> Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from
> the latest Intel processors and coprocessors. See abstracts and register >
> http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
> ___
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel


--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Modules for individual supported languages?

2013-10-03 Thread Jan Schreiber
Somebody by the name of Łukasz Janik posted this to our Facebook wall:

prosze kazdy jezyk jako osobno

I don't speak a single word of Polish, but according to Google
Translator, this is a feature request to release single-language
versions of LT. (Google and I might be wrong here of course.) ;-)

I tend to agree with him. Given the fact that the vast majority of
people probably doesn't actively use more than three languages, we're
imposing a huge overhead on our users.

We've discussed this before, but I'm not sure what the outcome was. I
think the ideal solution would be if the users could configure the
languages they want before downloading. If that is not possible, there
should be a clean way to remove unwanted languages during or after
installation.

Maybe we could have a two-step download: In the first step, you download
the main app, perhaps with English already on board. During install, you
can choose whatever other languages you may need.

--Jan

--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


testing 2.3.1 maven artifacts

2013-10-03 Thread Daniel Naber
Hi,

I have prepared a 2.3.1 release for Maven Central. It's the same as 2.3 
plus Stefan's fixes for the multi-threading problem.

As a developer, you can test the artifacts by adding this to your 
pom.xml:

 
 
 sonatypestaging
 Sonatype Staging
 
 true
 always
 warn
 
 
 false
 never
 fail
 
 
https://oss.sonatype.org/content/repositories/orglanguagetool-1003/
 default
 
 

If you give it a try, please let me know of the results. I'd like to 
release 2.3.1 for Maven Central tomorrow night.

Regards
  Daniel

-- 
http://www.danielnaber.de


--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: He tried not to laughs.

2013-10-03 Thread Marcin Miłkowski
W dniu 2013-10-03 12:39, Marcin Miłkowski pisze:
> W dniu 2013-10-03 11:43, Kumara Bhikkhu pisze:
>> Marcin Miłkowski wrote thus at 04:15 PM 03-10-13:
>>> Hi,
>>>
>>> W dniu 2013-10-03 06:24, Kumara Bhikkhu pisze:
 Can the one who created this contact me personally? It's not triggering
 "He tried not to laugh_s_."
 I don't know how to correct it.
>>>
>>> I'll write this on the list -- "laughs" is also plural of "laugh", which
>>> is excluded by the exception below (NNS). Unfortunately, without this
>>> exception, a lot of false alarms are found.
>>
>> I thought so.
>>
>>
>>> Now, maybe we could have a second variant of the rule that takes "not"
>>> +"to", and then the exception would not be required. This would have to
>>> be tested on a large corpus.
>>
>> Anyway of indicating verbs but excepting those what are also nouns?
>
> No, it's not possible unless you have a perfect rule in the
> disambiguator for this.

Which I added, so that your case is covered :)

The rule is very conservative, it just looks at "try" (+ "not") + "to" + 
VBZ/NNS. More verbs should be added to the list, instead of just "try". 
But I had no time to make such a list. If you want, create one, and I'll 
add it to the disambiguation rule.

Best,
Marcin

--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: He tried not to laughs.

2013-10-03 Thread Marcin Miłkowski
W dniu 2013-10-03 11:43, Kumara Bhikkhu pisze:
> Marcin Miłkowski wrote thus at 04:15 PM 03-10-13:
>> Hi,
>>
>> W dniu 2013-10-03 06:24, Kumara Bhikkhu pisze:
>>> Can the one who created this contact me personally? It's not triggering
>>> "He tried not to laugh_s_."
>>> I don't know how to correct it.
>>
>> I'll write this on the list -- "laughs" is also plural of "laugh", which
>> is excluded by the exception below (NNS). Unfortunately, without this
>> exception, a lot of false alarms are found.
>
> I thought so.
>
>
>> Now, maybe we could have a second variant of the rule that takes "not"
>> +"to", and then the exception would not be required. This would have to
>> be tested on a large corpus.
>
> Anyway of indicating verbs but excepting those what are also nouns?

No, it's not possible unless you have a perfect rule in the 
disambiguator for this.

Best,
Marcin

--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Postag question

2013-10-03 Thread Kumara Bhikkhu
Marcin Miłkowski wrote thus at 04:13 PM 03-10-13:
>I think I did write some rules for VBP. Anyway, there is at least one
>difference between VB and VBP -- for "be".

Ahh You mean like this?
VB: be
VBP: am, are

Anyway, I suggest adding these to resource/en/tagset.txt to distinguish them.

kb 


--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: He tried not to laughs.

2013-10-03 Thread Kumara Bhikkhu
Marcin Miłkowski wrote thus at 04:15 PM 03-10-13:
>Hi,
>
>W dniu 2013-10-03 06:24, Kumara Bhikkhu pisze:
> > Can the one who created this contact me personally? It's not triggering
> > "He tried not to laugh_s_."
> > I don't know how to correct it.
>
>I'll write this on the list -- "laughs" is also plural of "laugh", which
>is excluded by the exception below (NNS). Unfortunately, without this
>exception, a lot of false alarms are found.

I thought so.


>Now, maybe we could have a second variant of the rule that takes "not"
>+"to", and then the exception would not be required. This would have to
>be tested on a large corpus.

Anyway of indicating verbs but excepting those what are also nouns?

kb


--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: He tried not to laughs.

2013-10-03 Thread Marcin Miłkowski
Hi,

W dniu 2013-10-03 06:24, Kumara Bhikkhu pisze:
> Can the one who created this contact me personally? It's not triggering
> "He tried not to laugh_s_."
> I don't know how to correct it.

I'll write this on the list -- "laughs" is also plural of "laugh", which 
is excluded by the exception below (NNS). Unfortunately, without this 
exception, a lot of false alarms are found.

Now, maybe we could have a second variant of the rule that takes "not" 
+"to", and then the exception would not be required. This would have to 
be tested on a large corpus.

Regards,
Marcin
>
>
> 
> 
> to
> 
> is postag="NNS|NNP|NNPS" postag_regexp="yes">
> 
> 
> You might need to use the base form of the verb here:
> .
> Grammatical problem
> I was surprised to learn
> this.
> He spoke to chosen people.
> I was surprised to
> learns this.
> 
>
>
> --
> October Webinars: Code for Performance
> Free Intel webinars can help you accelerate application performance.
> Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from
> the latest Intel processors and coprocessors. See abstracts and register >
> http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
>
>
>
> ___
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>


--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Postag question

2013-10-03 Thread Marcin Miłkowski
W dniu 2013-10-03 08:18, Daniel Naber pisze:
> On 2013-10-03 05:50, Kumara Bhikkhu wrote:
>
>> What's the difference between these two?
>> VBVerb, base form: eat, jump, believe
>> VBP   Verb, non-3rd ps. sing. present: eat, jump, believe
>
> It's like "eat" in "he had to eat" vs. "I eat" - i.e. the form looks the
> same, but still in one case it's a base form, in the other an inflected
> form that just happens to look like the base form. Anyway, it may well
> be that all VBs are VBPs at the same time because we just look up words
> in a dictionary and give them all their possible tags. LanguageTool can
> only see the difference between the two if someone has written a
> disambiguation rule for that in disambiguation.xml.

I think I did write some rules for VBP. Anyway, there is at least one 
difference between VB and VBP -- for "be".

Regards,
Marcin



--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel