Re: [Moses-support] Adding a language model built on Google Web

Marcin Junczys-Dowmunt Tue, 28 Apr 2015 07:16:01 -0700

Hi,

W dniu 28.04.2015 o 16:06, Hieu Hoang pisze:

People have also been using the Common Crawl corpus to build hugebackoff LM. They're very difficult to use as it consumes a lot of memory

That's what I added pruning to KenLM for :) Also if you combine thatwith some domain-filtering you get nice models form the common crawldata. You might need a couble of TV of free disk space though.

Best,
Marcin

On 25/04/2015 20:24, Alla Rozovskaya wrote:
Hello,
I have built an interpolated count-based LM on the Google Web N-gramcorpus using SRILM toolkit, as specified here:http://www.speech.sri.com/projects/srilm/manpages/srilm-faq.7.html
Is it possible to use it in moses? In particular, since this modeluses count files and a file specifying weights, what is the right wayto specify the path in moses.ini?
Thank you,

Alla



_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support
--
Hieu Hoang
Researcher
New York University, Abu Dhabi
http://www.hoang.co.uk/hieu


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Re: [Moses-support] Adding a language model built on Google Web

Reply via email to