ok i spoke too soon.  using STDIN as input seems unreliable.  using a
named file and reading from it seems ok:

~/projects/randlm/randlm/bin> ./buildlm -struct BloomMap -falsepos 8
-values 8 -output-prefix model -input-type corpus -input-path
~/group/project/statmt9/josh/experiments/multi-europarl/data/lm/europarl-v3.en.gz

(in the next release i'll try to fix this, or at least make sure it is
documented)

Miles

2009/2/19 Miles Osborne <mi...@inf.ed.ac.uk>:
> that might be it.  but i seem to have it working here, using a
> non-gzipped version of Europarl.
>
> in any case, Michael:  tell us if it works when the corpus is gzipped
>
> Miles
>
> 2009/2/19 Barry Haddow <bhad...@inf.ed.ac.uk>:
>> Hi
>>
>> I've seen this error before. The short answer is that you need to use a
>> gzipped version of the corpus.
>>
>> The reason is that randlm uses gzip to decompress/compress when you have a
>> gzipped corpus, which is fine because gzip takes a -d argument for
>> decompressing. If presented with a non-gzipped version of the corpus, randlm
>> attempts to fake gzip with cat, which fails because cat doesn't accept -d.
>>
>> This has come up on the mailing list before, as far as I recall.
>>
>> regards
>> Barry
>>
>> On Thursday 19 February 2009 13:53, Michael Zuckerman wrote:
>>> Hi,
>>>
>>> We are trying to run RandLM on our files. We use the command:
>>> $ ./buildlm -struct BloomMap -falsepos 8 -values 8 -output-prefix model
>>> -input-type corpus < ../../europarl.lower.token.en
>>>
>>> And we get the following errors:
>>> cat: invalid option -- d
>>> Try `cat --help' for more information.
>>> rm ./model.tokens
>>> buildlm: RandLMStats.cpp:312: virtual bool
>>> randlm::CountStats::observe(const randlm::Word*, randlm::Value, int):
>>> Assertion `len > 0' failed.
>>> Aborted
>>>
>>> Are you familiar with these errors ? Do you have an idea about how to solve
>>> them ?
>>>
>>> Thanks,
>>>      Michael.
>>
>> --
>> The University of Edinburgh is a charitable body, registered in
>> Scotland, with registration number SC005336.
>>
>> _______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>
>
>
> --
> The University of Edinburgh is a charitable body, registered in
> Scotland, with registration number SC005336.
>



-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to