I could, If I were able to code. I only do things on the XML level.
Ruud > In UkrainianWordTokenizer.java I am replacing "Unicode apostrophes" > U+2019 and U+02BC into old good single quote (') to unify all apostrophe > handling. If Dutch case is similar you could borrow this code. > > Andriy > > On 09/02/2014 08:11 AM, R.J. Baars wrote: >> The Dutch tokenizer is a little bit different from thet otheres, because >> of words with a ' in it. >> >> That works fine, unless the text does not have a ', but a , which >> happens quite often. >> >> Since I am not able to edit the java program (little knowledge), could >> someone have a look at this please? >> >> Ruud >> >> >> ------------------------------------------------------------------------------ >> Slashdot TV. >> Video for Nerds. Stuff that matters. >> http://tv.slashdot.org/ >> _______________________________________________ >> Languagetool-devel mailing list >> Languagetool-devel@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/languagetool-devel > > > ------------------------------------------------------------------------------ > Slashdot TV. > Video for Nerds. Stuff that matters. > http://tv.slashdot.org/ > _______________________________________________ > Languagetool-devel mailing list > Languagetool-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/languagetool-devel > ------------------------------------------------------------------------------ Slashdot TV. Video for Nerds. Stuff that matters. http://tv.slashdot.org/ _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel