I spoke to Ken about using KenLM to train a standard backoff LM with the
n-gram corpus. It's not supported yet or recommended.
I'm not sure whether the moses' SRILM wrapper will support the
count-based LM. And how much memory it will consume. Try it and please
let us know.
People have also been using the Common Crawl corpus to build huge
backoff LM. They're very difficult to use as it consumes a lot of memory
On 25/04/2015 20:24, Alla Rozovskaya wrote:
Hello,
I have built an interpolated count-based LM on the Google Web N-gram
corpus using SRILM toolkit, as specified here:
http://www.speech.sri.com/projects/srilm/manpages/srilm-faq.7.html
Is it possible to use it in moses? In particular, since this model
uses count files and a file specifying weights, what is the right way
to specify the path in moses.ini?
Thank you,
Alla
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support
--
Hieu Hoang
Researcher
New York University, Abu Dhabi
http://www.hoang.co.uk/hieu
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support