W dniu 2015-01-13 o 09:37, Daniel Naber pisze: > On 2015-01-13 09:10, Marcin Miłkowski wrote: > >>> I've removed about correct 1000 example sentences for German as they >>> were redundant, i.e. they just repeated the incorrect example and its >>> 'correction' attribute. Unless someone objects for their language, I >>> will do the same for all languages (the cleanup effect will probably >>> be >>> much smaller for most other languages). >> >> I usually use correct examples as sanity (regression) checks, so please >> keep this in mind. > > How exactly do you do that? Do you extract the sentences from the XML > first so you have plain text? In that case, we either keep those > sentences or you would need to change the process a bit that extracts > the sentences (building correct sentences from incorrect example plus > its correction). In Polish, only about 60 sentences would be affected.
Ah, now I get what you meant -- if the sentence is already the same as the one tested with the correction, then it's redundant. Right. I just meant I use examples in grammar files to make sure no false alarms are fired. But this is obvious. Regards, Marcin ------------------------------------------------------------------------------ New Year. New Location. New Benefits. New Data Center in Ashburn, VA. GigeNET is offering a free month of service with a new server in Ashburn. Choose from 2 high performing configs, both with 100TB of bandwidth. Higher redundancy.Lower latency.Increased capacity.Completely compliant. http://p.sf.net/sfu/gigenet _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel