Thank you very much, reading from a named file solved the problem. Michael.
On Thu, Feb 19, 2009 at 8:30 PM, Miles Osborne <mi...@inf.ed.ac.uk> wrote: > ok i spoke too soon. using STDIN as input seems unreliable. using a > named file and reading from it seems ok: > > ~/projects/randlm/randlm/bin> ./buildlm -struct BloomMap -falsepos 8 > -values 8 -output-prefix model -input-type corpus -input-path > > ~/group/project/statmt9/josh/experiments/multi-europarl/data/lm/europarl-v3.en.gz > > (in the next release i'll try to fix this, or at least make sure it is > documented) > > Miles > > 2009/2/19 Miles Osborne <mi...@inf.ed.ac.uk>: > > that might be it. but i seem to have it working here, using a > > non-gzipped version of Europarl. > > > > in any case, Michael: tell us if it works when the corpus is gzipped > > > > Miles > > > > 2009/2/19 Barry Haddow <bhad...@inf.ed.ac.uk>: > >> Hi > >> > >> I've seen this error before. The short answer is that you need to use a > >> gzipped version of the corpus. > >> > >> The reason is that randlm uses gzip to decompress/compress when you have > a > >> gzipped corpus, which is fine because gzip takes a -d argument for > >> decompressing. If presented with a non-gzipped version of the corpus, > randlm > >> attempts to fake gzip with cat, which fails because cat doesn't accept > -d. > >> > >> This has come up on the mailing list before, as far as I recall. > >> > >> regards > >> Barry > >> > >> On Thursday 19 February 2009 13:53, Michael Zuckerman wrote: > >>> Hi, > >>> > >>> We are trying to run RandLM on our files. We use the command: > >>> $ ./buildlm -struct BloomMap -falsepos 8 -values 8 -output-prefix model > >>> -input-type corpus < ../../europarl.lower.token.en > >>> > >>> And we get the following errors: > >>> cat: invalid option -- d > >>> Try `cat --help' for more information. > >>> rm ./model.tokens > >>> buildlm: RandLMStats.cpp:312: virtual bool > >>> randlm::CountStats::observe(const randlm::Word*, randlm::Value, int): > >>> Assertion `len > 0' failed. > >>> Aborted > >>> > >>> Are you familiar with these errors ? Do you have an idea about how to > solve > >>> them ? > >>> > >>> Thanks, > >>> Michael. > >> > >> -- > >> The University of Edinburgh is a charitable body, registered in > >> Scotland, with registration number SC005336. > >> > >> _______________________________________________ > >> Moses-support mailing list > >> Moses-support@mit.edu > >> http://mailman.mit.edu/mailman/listinfo/moses-support > >> > > > > > > > > -- > > The University of Edinburgh is a charitable body, registered in > > Scotland, with registration number SC005336. > > > > > > -- > The University of Edinburgh is a charitable body, registered in > Scotland, with registration number SC005336. > _______________________________________________ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support >
_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support