support for BerkeleyLM language models

Daniel Naber Sat, 05 Dec 2015 08:03:37 -0800

Hi,

for English and German LT can use ngram data to detect errors 
statistically. This ngram data is stored in a Lucene index that we 
provide[1]. I've now added an implementation that uses BerkeleyLM 
files[2,3]. The advantage is that people who use BerkeleyLM language 
models anyway - maybe trained on their own data - can now use it also 
for LT. It also makes testing for us easier, comparing our own 
implementation with BerkeleyLM. I'm planning to run comparisons in the 
future.


To use BerkeleyLM you have to overwrite getLanguageModel() in your 
language class, so for now this is something for users who use LT via 
its API.

Regards
  Daniel

[1] http://wiki.languagetool.org/finding-errors-using-n-gram-data
[2] https://github.com/adampauls/berkeleylm
[3] http://tomato.banatao.berkeley.edu:8080/berkeleylm_binaries/


------------------------------------------------------------------------------
Go from Idea to Many App Stores Faster with Intel(R) XDK
Give your users amazing mobile app experiences with Intel(R) XDK.
Use one codebase in this all-in-one HTML5 development environment.
Design, debug & build mobile apps & 2D/3D high-impact games for multiple OSs.
http://pubads.g.doubleclick.net/gampad/clk?id=254741911&iu=/4140
_______________________________________________
Languagetool-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

support for BerkeleyLM language models

Reply via email to