Hi
what I have done is to download the last version of kenlm, built it and run
the build_binary. Then I call a 2010 version of moses using the lm.

I run the ./test.sh and I got these messages:
util/bit_packing_test.cc:14: error: expected constructor, destructor, or
type conversion before â(â token
util/bit_packing_test.cc:21: error: expected constructor, destructor, or
type conversion before â(â token
util/bit_packing_test.cc:59: error: expected `}' at end of input
util/bit_packing_test.cc:59: error: expected `}' at end of input

I guess, I need to reinstall the full package moses+kenlm and biuld
everything together.

Thanks a lot
Marco

On Mon, Oct 10, 2011 at 6:46 PM, Kenneth Heafield <mo...@kheafield.com>wrote:

> **
> Hi,
>
>     Number 8 means prefault and number 9 means lazy mmap.  It's an option
> and orthogonal to the data structure.
>
>     Since the binary file is the in-memory representation, I do paranoid
> checks to make sure your machine represents floats, 64-bit integers, and
> such in the same way.  For example a 32-bit build will have different
> alignment than a 64-bit build.  This check is complaining.
>
>     Please try build_binary and moses from the same build.  If that doesn't
> work, please send me the first kilobyte of your binary file.
>
>     Also, if you have Boost, can you cd kenlm && make clean && ./test.sh
> and complain if there are any test failures?
>
> Kenneth
>
>
> On 10/10/11 17:31, marco turchi wrote:
>
> Hi Kenneth,
> which number shall I use in the moses.ini 8 or 9 if i build my lm with
> these parameters -q 8 -b 8?
>
> I got this error when I run moses:
> In LanguageModelKen::Load: nGramOrder = 5 will be ignored.  Using whatever
> the file has.
> terminate called after throwing an instance of 'lm::FormatLoadException'
>   what():  File looks like it should be loaded with mmap, but the test
> values don't match.  Was it built on a different machine or with a different
> compiler?
>
> I have the feeling that my moses version needs to be updated!
>
> Thanks a lot
> Marco
>
> On Sat, Oct 8, 2011 at 1:02 PM, marco turchi <marco.tur...@gmail.com>wrote:
>
>> Thanks!
>> I'm going to update my version.
>>
>> Cheers
>> Marco
>>
>>
>> On Sat, Oct 8, 2011 at 1:01 PM, Kenneth Heafield <mo...@kheafield.com>wrote:
>>
>>>  Fixed in revision 4314.  There's still an issue with some SRILM models
>>> failing to build that I'll get to soon.
>>>
>>> On 10/08/11 11:52, marco turchi wrote:
>>>
>>> Hi,
>>> thanks a lot for the answer.
>>> Great, so I can use -m 2048 to build it. Do you think it is enough?
>>>
>>> Thanks again
>>> Marco
>>>
>>> On Sat, Oct 8, 2011 at 12:46 PM, Kenneth Heafield 
>>> <mo...@kheafield.com>wrote:
>>>
>>>>  Hi,
>>>>
>>>>     This looks like a bug in the trie implementation due to some recent
>>>> changes I made for left state minimization.  I'll fix it soon.  A 
>>>> workaround
>>>> is to pass a large -m option to build_binary.
>>>>
>>>> Sorry,
>>>>
>>>> Kenneth
>>>>
>>>>
>>>> On 10/08/11 11:34, marco turchi wrote:
>>>>
>>>>  Dear All,
>>>> I'm trying to build a lm using a large dataset (> 11 M sentences). I
>>>> have generated the Arpa format with irstlm and now I'd like to binarize it
>>>> using kenlm.
>>>>
>>>> I have called the build_binary to estimate memory usage, and I got this
>>>>
>>>> Memory estimate:
>>>> type       MB
>>>> probing 16129 assuming -p 1.5
>>>> trie     7462 without quantization
>>>> trie     4361 assuming -q 8 -b 8 quantization
>>>> trie     6440 assuming -a 22 array pointer compression
>>>> trie     3339 assuming -a 22 -q 8 -b 8 array pointer compression and
>>>> quantization
>>>>
>>>> then I run the binarization in this way:
>>>>
>>>> /nfs/staging/turchmo/moses/kenlmNew/build_binary -i -t /tmp/ -q 8 -b 8
>>>> trie irstLM.ARPA.txt irstLanguageModel.binary.lm
>>>>
>>>> but I got this error:
>>>>
>>>> lm/search_trie.cc:409 in void
>>>> lm::ngram::trie::<unnamed>::SanityCheckCounts(const std::vector<long
>>>> unsigned int, std::allocator<long unsigned int> >&, const std::vector<long
>>>> unsigned int, std::allocator<long unsigned int> >&) threw util::Exception'.
>>>> Longest count should be constant but it changed from 289546423 to
>>>> 289546405 Byte: 37297517525
>>>>
>>>> I have had a look into the mailing list, but I do not find any post with
>>>> the same error.
>>>>
>>>> Any ideas?
>>>>
>>>> Thanks a lot
>>>> Marco
>>>>
>>>>
>>>> _______________________________________________
>>>> Moses-support mailing 
>>>> listMoses-support@mit.eduhttp://mailman.mit.edu/mailman/listinfo/moses-support
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> Moses-support mailing list
>>>> Moses-support@mit.edu
>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>
>>>>
>>>
>>>
>>
>
>
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to