Maybe it because of the mismatch since I don't know French.
I will check it,
tks!
On Tue, Nov 13, 2012 at 12:06 AM, Cuong Hoang <hoangcuong2...@gmail.com>wrote:

> Dear Prof. Koehn,
> The fuzzy thing is there is no mismatch between those files!
> The only mismatch is the output of the error!
>
> >> !alignment point (42,23) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (33,24) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (28,26) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (30,28) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (29,29) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (32,29) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (31,30) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (34,31) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (37,34) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (42,35) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (41,36) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (35,37) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (42,37) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (36,38) out of range (0-31,0-25) in line 1, ignoring
> >> alignment point (43,39) out of range (0-31,0-25) in line 1, ignoring
>
>
>
>  On Tue, Nov 13, 2012 at 12:01 AM, Philipp Koehn <pko...@inf.ed.ac.uk>wrote:
>
>> Hi,
>>
>> can you take a look at the files corpus.lowercased.* ?
>> There seems to be a mismatch between those files and
>> the GIZA++ output.
>>
>> -phi
>>
>> On Mon, Nov 12, 2012 at 6:58 AM, Cuong Hoang <hoangcuong2...@gmail.com>
>> wrote:
>> > I attach the output when I run MOSES.
>> > I want to make a note that the quality of my toolkit is good since I
>> > reference GIZA++
>> > from every aspect before coding.
>> > To besides, the format of the word alignment output is correct.
>> > However, I stuck with the problem here!
>> >
>> > On Mon, Nov 12, 2012 at 10:42 PM, Cuong Hoang <hoangcuong2...@gmail.com
>> >
>> > wrote:
>> >>
>> >> Hi all,
>> >> These days I've been stuck with a very fuzzy error.
>> >> I've been coding an IBM models training toolkit (1-3) and now finishing
>> >> IBM Model 4.
>> >> The output of my toolkit is exactly as the MOSES specification as
>> >> described: http://www.statmt.org/moses/?n=FactoredTraining.RunGIZA
>> >> For example:
>> >>
>> >> # Sentence pair (1) source length 4 target length 3 alignment score :
>> >> 0.00643931
>> >>  wiederaufnahme der sitzungsperiode
>> >>  NULL ({ }) resumption ({ 1 }) of ({ }) the ({ 2 }) session ({ 3 })
>> >>  # Sentence pair (2) source length 17 target length 18 alignment score
>> :
>> >> 1.74092e-26
>> >>  ich erklaere die am donnerstag , den 28. maerz 1996 unterbrochene
>> >> sitzungsperiode
>> >>    des europaeischen parlaments fuer wiederaufgenommen .
>> >>
>> >>
>> >>
>> >>
>> >> Now, I've been stuck with the very strange error alignment point out of
>> >> range. For example:
>> >>
>> >> !alignment point (42,23) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (33,24) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (28,26) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (30,28) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (29,29) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (32,29) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (31,30) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (34,31) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (37,34) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (42,35) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (41,36) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (35,37) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (42,37) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (36,38) out of range (0-31,0-25) in line 1, ignoring
>> >> alignment point (43,39) out of range (0-31,0-25) in line 1, ignoring
>> >>
>> >> However, the outputs of my toolkit of the pair 1 are:
>> >> From English-French:
>> >> # Sentence pair (1) source length 40 target length 44 alignment score :
>> >> 1.0
>> >> à partir de la fin du xixe siècle , la découverte du spectre
>> >> électromagnétique et du monde de l 'atome va aussi mener à l
>> 'apparition d
>> >> 'une nouvelle branche de l 'astronomie , la plus importante de nos
>> jours : l
>> >> 'astrophysique .
>> >> NULL ({ }) from ({ 1 }) the ({ 4 }) 19th ({ 41 }) century ({ 8 })
>> onwards
>> >> ({ 2 5 7 }) , ({ 9 }) the ({ 6 }) discovery ({ 11 }) of ({ 12 }) the
>> ({ 10
>> >> }) electromagnetic ({ 14 }) spectrum ({ 13 }) and ({ 15 }) the ({ 16 })
>> >> world ({ 17 }) of ({ 18 }) the ({ 19 }) atom ({ 20 39 }) spurred ({ 21
>> 22 40
>> >> }) on ({ 24 }) the ({ 25 }) development ({ }) of ({ 3 }) astrophysics
>> ({ 23
>> >> 26 43 }) , ({ 34 }) a ({ 27 28 }) new ({ 29 }) discipline ({ }) in ({
>> 31 })
>> >> astronomy ({ 30 33 }) that ({ 32 }) is ({ 35 }) now ({ }) considered
>> ({ })
>> >> to ({ 38 }) be ({ }) the ({ 42 }) most ({ 36 }) important ({ 37 }) .
>> ({ 44
>> >> })
>> >>
>> >> From French-English
>> >> # Sentence pair (1) source length 44 target length 40 alignment score :
>> >> 1.0
>> >> from the 19th century onwards , the discovery of the electromagnetic
>> >> spectrum and the world of the atom spurred on the development of
>> >> astrophysics , a new discipline in astronomy that is now considered to
>> be
>> >> the most important .
>> >> NULL ({ }) à ({ }) partir ({ 1 }) de ({ }) la ({ 2 }) fin ({ }) du ({
>> 7 })
>> >> xixe ({ 3 5 }) siècle ({ 4 }) , ({ 6 }) la ({ 10 }) découverte ({ 8 })
>> du ({
>> >> 9 }) spectre ({ 12 }) électromagnétique ({ 11 34 }) et ({ 13 }) du ({
>> 14 })
>> >> monde ({ 15 }) de ({ 16 }) l ({ 17 }) 'atome ({ 18 19 33 }) va ({ 20
>> 32 })
>> >> aussi ({ }) mener ({ }) à ({ }) l ({ 21 }) 'apparition ({ 22 }) d ({
>> 23 })
>> >> 'une ({ 26 }) nouvelle ({ 27 }) branche ({ }) de ({ 29 }) l ({ })
>> >> 'astronomie ({ 28 30 }) , ({ 25 }) la ({ 31 }) plus ({ }) importante
>> ({ 39
>> >> }) de ({ 35 }) nos ({ }) jours ({ }) : ({ }) l ({ 37 }) 'astrophysique
>> ({ 24
>> >> 36 38 }) . ({ 40 })
>> >>
>> >> So, I just wonder what's the exactly problem here since the output of
>> word
>> >> alignment is very normal?
>> >> Thanks,
>> >> best regards,
>> >> C. Hoang
>> >>
>> >> --
>> >> Hoàng Cường
>> >> SMTNerd
>> >>
>> >
>> >
>> >
>> > --
>> > Hoàng Cường
>> > SMTNerd
>> >
>> >
>> > _______________________________________________
>> > Moses-support mailing list
>> > Moses-support@mit.edu
>> > http://mailman.mit.edu/mailman/listinfo/moses-support
>> >
>>
>
>
>
> --
> Hoàng Cường
> SMTNerd
>
>


-- 
Hoàng Cường
SMTNerd
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to