Although, those old successful runs might have been with the old Moses kbmira. 
I'll look into this and report back.

matt


> On Feb 27, 2015, at 12:19 PM, Matt Post <p...@cs.jhu.edu> wrote:
> 
> Hi Barry — Thanks for the response. I don't think that's it, because I use 
> the exact same approach for lots of other tuning runs. Isn't it the header 
> line of the features file that lists dense features? I've been using this 
> format, where dense features are listed in each header line, and then sparse 
> features in the individual lines:
> 
>       FEATURES_TXT_BEGIN_0 0 300 9 lm_0 lm_1 tm_pt_1 tm_pt_3 tm_pt_0 tm_pt_2 
> WordPenalty PhrasePenalty Distortion 
>       -82.183 -72.639 -79.162 -41.493 -60.118 -28.509 -10.857 19 -8 
>       -82.183 -72.639 -79.162 -41.493 -60.118 -28.509 -10.857 19 -8 
> OOVPenalty=-100
> 
> This works in lots of places (although, it also raises a separate question, 
> of whether kbmira actually distinguishes between sparse and dense features? I 
> seem to remember Colin once saying that there is a single group weight 
> between the two groups, but I've never been able to find this in the code).
> 
> matt
> 
> 
>> On Feb 26, 2015, at 5:35 PM, Barry Haddow <bhad...@staffmail.ed.ac.uk 
>> <mailto:bhad...@staffmail.ed.ac.uk>> wrote:
>> 
>> Hi Matt
>> 
>> When mert-moses.pl runs kbmira, it always supplies a list of the dense 
>> features (and their initial values) using the --dense-init parameter. I 
>> think this is your problem. I've attached a typical file used for this 
>> feature list.
>> 
>> Of course, kbmira should have a sensible message rather than a segfault. 
>> This is probably my doing,
>> 
>> cheers - Barry
>> 
>> On 26/02/15 22:18, Matt Post wrote:
>>> kbmira segfaults on the following command:
>>> 
>>> 
>>>         kbmira run --ffile run1.features.dat --scfile run1.scores.dat -o 
>>> mert.out
>>> 
>>> Where run1.features.dat (30 MB) and run1.scores.dat (14 MB) can be 
>>> downloaded here:
>>> 
>>> 
>>>         https://www.dropbox.com/s/yim7ub1bmq5jv2g/run1.features.dat?dl=0 
>>> <https://www.dropbox.com/s/yim7ub1bmq5jv2g/run1.features.dat?dl=0>
>>> 
>>>         https://www.dropbox.com/s/kkek36o7aflgzuu/run1.scores.dat?dl=0 
>>> <https://www.dropbox.com/s/kkek36o7aflgzuu/run1.scores.dat?dl=0>
>>> 
>>> I tracked it down to this line of mert/FeatureStats.cpp.
>>> 
>>> std::string SparseVector::decode(std::size_t id)
>>> {
>>>   return m_id_to_name[id];
>>> }
>>> 
>>> Any obvious ideas before I go down this rabbit hole? I verified there are 
>>> no blank lines or anything else funny with the formatting, at least as far 
>>> as I can tell (all dense features, plus one sparse feature, 
>>> OOVPenalty=-100, showing up occasionally).
>>> 
>>> matt
>>> 
>>> 
>>> 
>>> 
>>> 
>>> _______________________________________________
>>> Moses-support mailing list
>>> Moses-support@mit.edu <mailto:Moses-support@mit.edu>
>>> http://mailman.mit.edu/mailman/listinfo/moses-support 
>>> <http://mailman.mit.edu/mailman/listinfo/moses-support>
>> 
>> <run1.dense>
> 
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to