Thank you very much, reading from a named file solved the problem.

Michael.

On Thu, Feb 19, 2009 at 8:30 PM, Miles Osborne <mi...@inf.ed.ac.uk> wrote:

> ok i spoke too soon.  using STDIN as input seems unreliable.  using a
> named file and reading from it seems ok:
>
> ~/projects/randlm/randlm/bin> ./buildlm -struct BloomMap -falsepos 8
> -values 8 -output-prefix model -input-type corpus -input-path
>
> ~/group/project/statmt9/josh/experiments/multi-europarl/data/lm/europarl-v3.en.gz
>
> (in the next release i'll try to fix this, or at least make sure it is
> documented)
>
> Miles
>
> 2009/2/19 Miles Osborne <mi...@inf.ed.ac.uk>:
> > that might be it.  but i seem to have it working here, using a
> > non-gzipped version of Europarl.
> >
> > in any case, Michael:  tell us if it works when the corpus is gzipped
> >
> > Miles
> >
> > 2009/2/19 Barry Haddow <bhad...@inf.ed.ac.uk>:
> >> Hi
> >>
> >> I've seen this error before. The short answer is that you need to use a
> >> gzipped version of the corpus.
> >>
> >> The reason is that randlm uses gzip to decompress/compress when you have
> a
> >> gzipped corpus, which is fine because gzip takes a -d argument for
> >> decompressing. If presented with a non-gzipped version of the corpus,
> randlm
> >> attempts to fake gzip with cat, which fails because cat doesn't accept
> -d.
> >>
> >> This has come up on the mailing list before, as far as I recall.
> >>
> >> regards
> >> Barry
> >>
> >> On Thursday 19 February 2009 13:53, Michael Zuckerman wrote:
> >>> Hi,
> >>>
> >>> We are trying to run RandLM on our files. We use the command:
> >>> $ ./buildlm -struct BloomMap -falsepos 8 -values 8 -output-prefix model
> >>> -input-type corpus < ../../europarl.lower.token.en
> >>>
> >>> And we get the following errors:
> >>> cat: invalid option -- d
> >>> Try `cat --help' for more information.
> >>> rm ./model.tokens
> >>> buildlm: RandLMStats.cpp:312: virtual bool
> >>> randlm::CountStats::observe(const randlm::Word*, randlm::Value, int):
> >>> Assertion `len > 0' failed.
> >>> Aborted
> >>>
> >>> Are you familiar with these errors ? Do you have an idea about how to
> solve
> >>> them ?
> >>>
> >>> Thanks,
> >>>      Michael.
> >>
> >> --
> >> The University of Edinburgh is a charitable body, registered in
> >> Scotland, with registration number SC005336.
> >>
> >> _______________________________________________
> >> Moses-support mailing list
> >> Moses-support@mit.edu
> >> http://mailman.mit.edu/mailman/listinfo/moses-support
> >>
> >
> >
> >
> > --
> > The University of Edinburgh is a charitable body, registered in
> > Scotland, with registration number SC005336.
> >
>
>
>
> --
> The University of Edinburgh is a charitable body, registered in
> Scotland, with registration number SC005336.
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to