Hi Renu

If you run without "--text yes" then you will get an IRSTLM binary file, as the error message shows, so you don't have to run step 6. This file is different from a KenLM binary file. You can also use the IRSTLM binary file in Moses, as long as you adjust moses.ini to tell Moses what type of LM you have.

However, I was suggesting running with just the "--text" argument (i.e. just remove the "yes") to see if that still works,

cheers - Barry

On 06/12/13 04:27, Renu Balyan wrote:
Hi,
Thanks for the response.
Yes, I am using the Moses baseline instructions (http://www.statmt.org/moses/?n=Moses.Baseline). I tried this option too, if I run the command without '--text yes' option then the command runs fine, However I wanted to ask one thing does this give me an arpa file or a binarized one? Because when I run the next command mentioned in the manual: 6. /home/renu/Desktop/mosesdecoder/bin/build_binary news-commentary-v8.fr-en.arpa.en news-commentary-v8.fr-en.blm.en
*I get the following output:*

Reading news-commentary-v8.fr-en.arpa.en
----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100 **************************************************************************************************** lm/read_arpa.cc:63 in void lm::ReadARPACounts(util::FilePiece&, std::vector<long long unsigned int>&) threw FormatLoadException because `line.size() >= 4 && StringPiece(line.data(), 4) == "blmt"'. *This looks like an IRSTLM binary file. Did you forget to pass --text yes to compile-lm? Byte: 40 File: news-commentary-v8.fr-en.arpa.en*
ERROR

The last second line put in bold indicates that the one I am using is a binary file. Does that mean I already have a binary file and I do not need to use step 6 mentioned above (which infact is for converting from arpa to binary file) However, I did not try to run it by keeping '--text' and removing 'yes' from the command. I will try this as well.
Thanks
Renu

    ----- Original Message -----
    *From:* Barry Haddow <mailto:bhad...@staffmail.ed.ac.uk>
    *To:* renubalyan <mailto:renubal...@cdac.in> ;
    moses-support@mit.edu <mailto:moses-support@mit.edu>
    *Sent:* Friday, December 06, 2013 2:49 AM
    *Subject:* Re: [Moses-support] Warning: Too many arguments while
    IRSTLM language model Training

    Hi

    It looks like you are following the Moses baseline instructions
    (http://www.statmt.org/moses/?n=Moses.Baseline). It's not
    explained, but step 5 should convert the IRSTLM iARPA file
    produced by step 4 to a (standard) ARPA file. The following step
    will then binarise it with KenLM.

    The command you ran is

    /home/renu/Desktop/irstlm/bin/compile-lm --text yes
    news-commentary-v8.fr-en.lm.en.gz news-commentary-v8.fr-en.arpa.en

    I notice that someone added a "yes" to this command in the
    documentation  recently (November 13th). Does it work if you don't
    include "yes"?

    IRSTLM folks - can you clarify? Does the '--text' parameter
    require a 'yes' argument? The usage for the command suggests it
    does, but it used to work without,

    cheers - Barry

    On 04/12/13 15:58, renubalyan wrote:
    Hi,
    I am  building the baseline system based on Moses manual
    instructions.
    I have installed Moses, GIZA++ and IRSTLM as mentioned in the
    manual.
    The corpus preparation (tokenization, ...cleaning) steps also
    goes well.
    However when I move to Language Model Training: I have some problems
    I am following these steps:
    1. mkdir ~/lm

    2. cd ~/lm

    3. /home/renu/Desktop/irstlm/bin/add-start-end.sh <
    /home/renu/Desktop/corpus/news-commentary-v8.fr-en.true.en>
    news-commentary-v8.fr-en.sb.en

    4. export IRSTLM=/home/renu/Desktop/irstlm;
    /home/renu/Desktop/irstlm/bin/build-lm.sh -i
    news-commentary-v8.fr-en.sb.en -t ./tmp -p -s improved-kneser-ney
    -o news-commentary-v8.fr-en.lm.en

    5. /home/renu/Desktop/irstlm/bin/compile-lm --text yes
    news-commentary-v8.fr-en.lm.en.gz news-commentary-v8.fr-en.arpa.en
    Steps 1-4 work well but step 5 gives me -------(Warning:Too many
    parameters)

    I have searched the web for any possible solution but could not
    find any.
    I am not able to move ahead, kindly help.
    Thanks
    Renu

    
-------------------------------------------------------------------------------------------------------------------------------

    This e-mail is for the sole use of the intended recipient(s) and may
    contain confidential and privileged information. If you are not the
    intended recipient, please contact the sender by reply e-mail and
    destroy
    all copies and the original message. Any unauthorized review, use,
    disclosure, dissemination, forwarding, printing or copying of
    this email
    is strictly prohibited and appropriate legal action will be taken.
    
-------------------------------------------------------------------------------------------------------------------------------



    _______________________________________________
    Moses-support mailing list
    Moses-support@mit.edu
    http://mailman.mit.edu/mailman/listinfo/moses-support


-------------------------------------------------------------------------------------------------------------------------------
This e-mail is for the sole use of the intended recipient(s) and may
contain confidential and privileged information. If you are not the
intended recipient, please contact the sender by reply e-mail and destroy
all copies and the original message. Any unauthorized review, use,
disclosure, dissemination, forwarding, printing or copying of this email
is strictly prohibited and appropriate legal action will be taken.
-------------------------------------------------------------------------------------------------------------------------------


_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to