I spoke to Ken about using KenLM to train a standard backoff LM with the n-gram corpus. It's not supported yet or recommended.

I'm not sure whether the moses' SRILM wrapper will support the count-based LM. And how much memory it will consume. Try it and please let us know.

People have also been using the Common Crawl corpus to build huge backoff LM. They're very difficult to use as it consumes a lot of memory

On 25/04/2015 20:24, Alla Rozovskaya wrote:
Hello,

I have built an interpolated count-based LM on the Google Web N-gram corpus using SRILM toolkit, as specified here: http://www.speech.sri.com/projects/srilm/manpages/srilm-faq.7.html

Is it possible to use it in moses? In particular, since this model uses count files and a file specifying weights, what is the right way to specify the path in moses.ini?

Thank you,

Alla



_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

--
Hieu Hoang
Researcher
New York University, Abu Dhabi
http://www.hoang.co.uk/hieu

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to