Re: [Moses-support] accessing the compact format of the phrase-table

2016-09-23 Thread Dimitar Shterionov
Hello Tom,

Yes, I guess I will have to rebuild. I have the corpus so it will be ok...

Thanks a lot.

Cheers,
Dimitar.

Dimitar Shterionov | dimit...@kantanmt.com | Machine Translation Researcher

www.KantanMT.com <http://www.kantanmt.com/> | Easy Translation - No
Software. No Hardware. No Hassle MT.

<https://www.facebook.com/KantanMT>
<https://plus.google.com/+Kantanmt_cloudmachinetranslation>
<https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt>
<http://www.slideshare.net/kantanmt> <https://www.youtube.com/user/KantanMT>
<http://kantanmtblog.com/> <https://kantanmt.com/rssfeeds.php>



On 23 September 2016 at 12:10, Tom Hoar  wrote:

> Sorry Dimitar,
>
> I understand. I can't think of a way. If you have any of the output stages
> of train-model.perl (assuming that's what you're using), you could resume
> at a downstream step, for example `--do-steps 4-9`. Since you're saving
> disk space, I wouldn't expect you saved things like the GIZA word alignment
> files, etc. Rerunning train-model.perl from the beginning might be your
> only way, assuming you saved the original corpus.
> Best regards,
>
> Tom Hoar
> Chief Executive Officer
> *Precision Translation Tools Pte Ltd*
> Singapore/Thailand
> Web: www.precisiontranslationtools.com
> Thailand Mobile: +66 87 345-1875
> Skype call: tahoar
> Skype chat: tahoar
>
>
> On 9/23/2016 4:45 PM, moses-support-requ...@mit.edu wrote:
>
> Date: Fri, 23 Sep 2016 10:44:57 +0100
> From: Dimitar Shterionov  
> Subject: Re: [Moses-support] accessing the compact format of the
>   phrase-table
> To: Tom Hoar  
> Cc: moses-support@mit.edu
>
> Hello Tom,
>
> When I store the phrase-table for later use I store only the compact
> version - it's simply smaller.
>
> Is there another way to get the original text phrase-table without
> rebuilding?
>
> Cheers,
> Dimitar.
>
> Dimitar Shterionov | dimit...@kantanmt.com | Machine Translation Researcher
> www.KantanMT.com <http://www.kantanmt.com/> <http://www.kantanmt.com/> | Easy 
> Translation - No
> Software. No Hardware. No Hassle MT.
> <https://www.facebook.com/KantanMT> 
> <https://www.facebook.com/KantanMT><https://plus.google.com/+Kantanmt_cloudmachinetranslation>
>  
> <https://plus.google.com/+Kantanmt_cloudmachinetranslation><https://twitter.com/KantanMT>
>  <https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt> 
> <https://www.linkedin.com/company/kantanmt><http://www.slideshare.net/kantanmt>
>  <http://www.slideshare.net/kantanmt> <https://www.youtube.com/user/KantanMT> 
> <https://www.youtube.com/user/KantanMT><http://kantanmtblog.com/> 
> <http://kantanmtblog.com/> <https://kantanmt.com/rssfeeds.php> 
> <https://kantanmt.com/rssfeeds.php> 
>
>
>
> ___
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] accessing the compact format of the phrase-table

2016-09-23 Thread Dimitar Shterionov
Hello Tom,

When I store the phrase-table for later use I store only the compact
version - it's simply smaller.

Is there another way to get the original text phrase-table without
rebuilding?

Cheers,
Dimitar.

Dimitar Shterionov | dimit...@kantanmt.com | Machine Translation Researcher

www.KantanMT.com <http://www.kantanmt.com/> | Easy Translation - No
Software. No Hardware. No Hassle MT.

<https://www.facebook.com/KantanMT>
<https://plus.google.com/+Kantanmt_cloudmachinetranslation>
<https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt>
<http://www.slideshare.net/kantanmt> <https://www.youtube.com/user/KantanMT>
<http://kantanmtblog.com/> <https://kantanmt.com/rssfeeds.php>



On 23 September 2016 at 10:37, Tom Hoar  wrote:

> Dimitar, is there any reason you can't go back to the original text
> phrase-table that was used to create the compact format?
>
>
> On 9/23/2016 4:26 PM, moses-support-requ...@mit.edu wrote:
>
> Date: Fri, 23 Sep 2016 10:24:51 +0100
> From: Dimitar Shterionov  
> Subject: Re: [Moses-support] accessing the compact format of the
>   phrase-table
> To: Marcin Junczys-Dowmunt  
> Cc: moses-support@mit.edu
>
> Hi Marcin and thanks for the quick reply.
>
> With the queryPhraseTableMin do I always need to provide exact string to
> match an entry or can I actually provide partial string? or regex?
> Basically I want to extract all 1-grams. Any suggestions?
>
> Thanks once again.
> Dimitar.
>
> Dimitar Shterionov | dimit...@kantanmt.com | Machine Translation Researcher
> www.KantanMT.com <http://www.kantanmt.com/> <http://www.kantanmt.com/> | Easy 
> Translation - No
> Software. No Hardware. No Hassle MT.
> <https://www.facebook.com/KantanMT> 
> <https://www.facebook.com/KantanMT><https://plus.google.com/+Kantanmt_cloudmachinetranslation>
>  
> <https://plus.google.com/+Kantanmt_cloudmachinetranslation><https://twitter.com/KantanMT>
>  <https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt> 
> <https://www.linkedin.com/company/kantanmt><http://www.slideshare.net/kantanmt>
>  <http://www.slideshare.net/kantanmt> <https://www.youtube.com/user/KantanMT> 
> <https://www.youtube.com/user/KantanMT><http://kantanmtblog.com/> 
> <http://kantanmtblog.com/> <https://kantanmt.com/rssfeeds.php> 
> <https://kantanmt.com/rssfeeds.php> 
>
>
>
> ___
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] accessing the compact format of the phrase-table

2016-09-23 Thread Dimitar Shterionov
Thanks a lot Marcin.

I was fearing that would be the case :).

Cheers,
Dimitar.

Dimitar Shterionov | dimit...@kantanmt.com | Machine Translation Researcher

www.KantanMT.com <http://www.kantanmt.com/> | Easy Translation - No
Software. No Hardware. No Hassle MT.

<https://www.facebook.com/KantanMT>
<https://plus.google.com/+Kantanmt_cloudmachinetranslation>
<https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt>
<http://www.slideshare.net/kantanmt> <https://www.youtube.com/user/KantanMT>
<http://kantanmtblog.com/> <https://kantanmt.com/rssfeeds.php>



On 23 September 2016 at 10:26, Marcin Junczys-Dowmunt 
wrote:

> Hi Dimitar,
> full string unfortunately. This is a hash table, so no partial methods
> possible. The source string itself isn't stored anywhere.
> Best,
> Marcin
>
> W dniu 23/09/16 o 10:24, Dimitar Shterionov pisze:
>
> Hi Marcin and thanks for the quick reply.
>
> With the queryPhraseTableMin do I always need to provide exact string to
> match an entry or can I actually provide partial string? or regex?
> Basically I want to extract all 1-grams. Any suggestions?
>
> Thanks once again.
> Dimitar.
>
> Dimitar Shterionov | dimit...@kantanmt.com | Machine Translation
> Researcher
>
> www.KantanMT.com <http://www.kantanmt.com/> | Easy Translation - No
> Software. No Hardware. No Hassle MT.
>
> <https://www.facebook.com/KantanMT>
> <https://plus.google.com/+Kantanmt_cloudmachinetranslation>
> <https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt>
> <http://www.slideshare.net/kantanmt>
> <https://www.youtube.com/user/KantanMT> <http://kantanmtblog.com/>
> <https://kantanmt.com/rssfeeds.php> 
>
>
> On 23 September 2016 at 10:19, Marcin Junczys-Dowmunt 
> wrote:
>
>> Hi,
>> If you want a complete dump of the phrase table as text, this is not
>> possible. The compact phrase table is not reversible. You can use
>> queryPhraseTableMin to ask for the translations of specific phrases.
>> Best,
>> Marcin
>>
>> W dniu 23/09/16 o 10:16, Dimitar Shterionov pisze:
>>
>> Dear all,
>>
>> I want to extract some specific data from the table after the model has
>> been built. Before it is compacted I have it in raw format and it is easy.
>> But I don't know how to process and read the compacted format of the
>> phrase-table. Is there a way to read the phrase-table.minphr format as
>> text? Any suggestions?
>>
>> Thank you very much.
>> Kind regards,
>> Dimitar.
>>
>>
>> ___
>> Moses-support mailing 
>> listMoses-support@mit.eduhttp://mailman.mit.edu/mailman/listinfo/moses-support
>>
>> ___ Moses-support mailing
>> list Moses-support@mit.edu http://mailman.mit.edu/mailman
>> /listinfo/moses-support
>
>
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] accessing the compact format of the phrase-table

2016-09-23 Thread Dimitar Shterionov
Hi Marcin and thanks for the quick reply.

With the queryPhraseTableMin do I always need to provide exact string to
match an entry or can I actually provide partial string? or regex?
Basically I want to extract all 1-grams. Any suggestions?

Thanks once again.
Dimitar.

Dimitar Shterionov | dimit...@kantanmt.com | Machine Translation Researcher

www.KantanMT.com <http://www.kantanmt.com/> | Easy Translation - No
Software. No Hardware. No Hassle MT.

<https://www.facebook.com/KantanMT>
<https://plus.google.com/+Kantanmt_cloudmachinetranslation>
<https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt>
<http://www.slideshare.net/kantanmt> <https://www.youtube.com/user/KantanMT>
<http://kantanmtblog.com/> <https://kantanmt.com/rssfeeds.php>



On 23 September 2016 at 10:19, Marcin Junczys-Dowmunt 
wrote:

> Hi,
> If you want a complete dump of the phrase table as text, this is not
> possible. The compact phrase table is not reversible. You can use
> queryPhraseTableMin to ask for the translations of specific phrases.
> Best,
> Marcin
>
> W dniu 23/09/16 o 10:16, Dimitar Shterionov pisze:
>
> Dear all,
>
> I want to extract some specific data from the table after the model has
> been built. Before it is compacted I have it in raw format and it is easy.
> But I don't know how to process and read the compacted format of the
> phrase-table. Is there a way to read the phrase-table.minphr format as
> text? Any suggestions?
>
> Thank you very much.
> Kind regards,
> Dimitar.
>
>
> ___
> Moses-support mailing 
> listMoses-support@mit.eduhttp://mailman.mit.edu/mailman/listinfo/moses-support
>
>
>
> ___
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] accessing the compact format of the phrase-table

2016-09-23 Thread Dimitar Shterionov
Dear all,

I want to extract some specific data from the table after the model has
been built. Before it is compacted I have it in raw format and it is easy.
But I don't know how to process and read the compacted format of the
phrase-table. Is there a way to read the phrase-table.minphr format as
text? Any suggestions?

Thank you very much.
Kind regards,
Dimitar.
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] after tuning UnknownWordPenalty parameter is missing

2016-09-20 Thread Dimitar Shterionov
Hello,

I am trying to tune an engine using mert. I run the mert-moses.pl script
without filtering (i.e., --no-filter-phrase-table). At the end I use
substitute-weights.perl to have a new ini file.

I noticed that in the new ini file I don't have the UnknownWordPenalty
paramter. The original file looks like this (some lines omitted):

...
# feature functions
[feature]
UnknownWordPenalty
WordPenalty
PhrasePenalty
...

# dense weights for feature functions
[weight]
UnknownWordPenalty0= 1
WordPenalty0= -1
PhrasePenalty0= 0.2
TranslationModel0= 0.2 0.2 0.2 0.2
LexicalReordering0= 0.3 0.3 0.3 0.3 0.3 0.3
Distortion0= 0.3
LM0= 0.5



And the new ini file looks like this:
...
# feature functions
[feature]
UnknownWordPenalty
WordPenalty
PhrasePenalty
...

...
[weight]
LexicalReordering0= 0.0762169 0.00608827 0.0268392 0.123948 0.0511544
0.127807
Distortion0= 0.0304069
LM0= 0.0635183
WordPenalty0= -0.268702
PhrasePenalty0= 0.0536959
TranslationModel0= 0.0525019 0.0459397 0.000797115 0.0723839


Does anyone have any idea why is the UnknownWordPenalty missing?

Thank you for your support.

Kind regards,
Dimitar.
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] mosesserver vs moses

2016-08-12 Thread Dimitar Shterionov
Dear all,

I have noticed that moses and mosesserver return different translations
sometimes.
I am wondering what are the differences between moses and mosesserver with
respect to decoding? Are there different parameters set up during
compilation? Can it be ensured that moses and mosesserver will return the
same results?

Thank you,
Kind regards,
Dimitar
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] moses 2.1 and mosesserver different result

2016-07-21 Thread Dimitar Shterionov
Hello,

I have a question regarding the differences between Moses and mosesserver.
I have noticed that sometimes Moses server translates segments differently
from Moses. Is that normal? or is there some good explanation?

I use the same parameters when invoking both.

Thank you,
Kind regards,
Dimitar.
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support