Hi, for English and German LT can use ngram data to detect errors statistically. This ngram data is stored in a Lucene index that we provide[1]. I've now added an implementation that uses BerkeleyLM files[2,3]. The advantage is that people who use BerkeleyLM language models anyway - maybe trained on their own data - can now use it also for LT. It also makes testing for us easier, comparing our own implementation with BerkeleyLM. I'm planning to run comparisons in the future.
To use BerkeleyLM you have to overwrite getLanguageModel() in your language class, so for now this is something for users who use LT via its API. Regards Daniel [1] http://wiki.languagetool.org/finding-errors-using-n-gram-data [2] https://github.com/adampauls/berkeleylm [3] http://tomato.banatao.berkeley.edu:8080/berkeleylm_binaries/ ------------------------------------------------------------------------------ Go from Idea to Many App Stores Faster with Intel(R) XDK Give your users amazing mobile app experiences with Intel(R) XDK. Use one codebase in this all-in-one HTML5 development environment. Design, debug & build mobile apps & 2D/3D high-impact games for multiple OSs. http://pubads.g.doubleclick.net/gampad/clk?id=254741911&iu=/4140 _______________________________________________ Languagetool-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/languagetool-devel
