Bugs item #3524295, was opened at 2012-05-07 01:08
Message generated for change (Comment added) made by janschreiber
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=655717&aid=3524295&group_id=110216

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Problem with German
Group: development version
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: Jan Schreiber (janschreiber)
Assigned to: Daniel Naber (dnaber)
Summary: [de] false positives of DE_AGREEMENT rule

Initial Comment:
I attach some false positives in a text file. Most conspicuously, the first two 
or three examples seem to show that 'wir' is treated as an article rather than 
a pronoun by this rule.

----------------------------------------------------------------------

>Comment By: Jan Schreiber (janschreiber)
Date: 2012-05-07 11:55

Message:
Thanks for the fast responses. I'll download a nightly build later this
week and test it on Wikipedia data.
Hitting the right balance between true and false positives will always be a
problem. Maybe we can cover some of the newly introduced false negatives of
the Java rule by an XML rule?
BTW I'm planning to do a small comparison Hunspell+LanguageTool vs. MS Word
(built-in spell-check and grammar capabilities) vs. Duden Korrektor in the
near future.

----------------------------------------------------------------------

Comment By: Daniel Naber (dnaber)
Date: 2012-05-07 11:22

Message:
I fixed the issue with "wir", but we now miss the error in "Erst recht wir
fleißiges Arbeiter." The rule could use a general rewrite anyway.

----------------------------------------------------------------------

Comment By: gulp21 (gulp21-1)
Date: 2012-05-07 10:04

Message:
I fixed some false positives, but there is stil a problem with "wir",
"denselben", and "diejenigen". These words are tagged as "ALG", which might
be a problem; ll. 416-423 seem to contain code to handle that postag.
PRONOUNS_TO_BE_IGNORED contains "wir", but it is commented out for some
reason. 

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=655717&aid=3524295&group_id=110216

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs

Reply via email to