> There are other issues where I could need help from an expert. For > example, results don't get better when we use 4grams instead of 3grams.
This is, I think, a general conclusion from using shingles of any data -- if you're increasing their lengths you also increase the sparsity of the model space. People typically work with bigrams or trigrams; more hardly increases the precision of any model. Dawid ------------------------------------------------------------------------------ _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel