Re: [Moses-support] Character ngrams using KenLM

Nat Gillin Wed, 09 Nov 2016 20:18:07 -0800

Dear Kenneth and Moses community,

@Kenneth, Thank you for the tip!


Regards,
Nat

On Wed, Nov 9, 2016 at 4:46 PM, Kenneth Heafield <mo...@kheafield.com>
wrote:

> No. Tokenizer and LM are separate tools. You can of course replace space
> with a token like <spc> or something.
>
> On November 9, 2016 6:04:07 AM GMT+00:00, Nat Gillin <nat.gil...@gmail.com>
> wrote:
>
>> Dear Moses community,
>>
>> Other than manually replacing space with an unused character and adding
>> spaces to each character before training a language model with KenLM. Is it
>> possible for KenLM to generate character ngrams and output in arpa format
>> without altering the input file?
>>
>> Regards,
>> Nat
>>
>> ------------------------------
>>
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Re: [Moses-support] Character ngrams using KenLM

Reply via email to