Maybe it because of the mismatch since I don't know French. I will check it, tks! On Tue, Nov 13, 2012 at 12:06 AM, Cuong Hoang <hoangcuong2...@gmail.com>wrote:
> Dear Prof. Koehn, > The fuzzy thing is there is no mismatch between those files! > The only mismatch is the output of the error! > > >> !alignment point (42,23) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (33,24) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (28,26) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (30,28) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (29,29) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (32,29) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (31,30) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (34,31) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (37,34) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (42,35) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (41,36) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (35,37) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (42,37) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (36,38) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (43,39) out of range (0-31,0-25) in line 1, ignoring > > > > On Tue, Nov 13, 2012 at 12:01 AM, Philipp Koehn <pko...@inf.ed.ac.uk>wrote: > >> Hi, >> >> can you take a look at the files corpus.lowercased.* ? >> There seems to be a mismatch between those files and >> the GIZA++ output. >> >> -phi >> >> On Mon, Nov 12, 2012 at 6:58 AM, Cuong Hoang <hoangcuong2...@gmail.com> >> wrote: >> > I attach the output when I run MOSES. >> > I want to make a note that the quality of my toolkit is good since I >> > reference GIZA++ >> > from every aspect before coding. >> > To besides, the format of the word alignment output is correct. >> > However, I stuck with the problem here! >> > >> > On Mon, Nov 12, 2012 at 10:42 PM, Cuong Hoang <hoangcuong2...@gmail.com >> > >> > wrote: >> >> >> >> Hi all, >> >> These days I've been stuck with a very fuzzy error. >> >> I've been coding an IBM models training toolkit (1-3) and now finishing >> >> IBM Model 4. >> >> The output of my toolkit is exactly as the MOSES specification as >> >> described: http://www.statmt.org/moses/?n=FactoredTraining.RunGIZA >> >> For example: >> >> >> >> # Sentence pair (1) source length 4 target length 3 alignment score : >> >> 0.00643931 >> >> wiederaufnahme der sitzungsperiode >> >> NULL ({ }) resumption ({ 1 }) of ({ }) the ({ 2 }) session ({ 3 }) >> >> # Sentence pair (2) source length 17 target length 18 alignment score >> : >> >> 1.74092e-26 >> >> ich erklaere die am donnerstag , den 28. maerz 1996 unterbrochene >> >> sitzungsperiode >> >> des europaeischen parlaments fuer wiederaufgenommen . >> >> >> >> >> >> >> >> >> >> Now, I've been stuck with the very strange error alignment point out of >> >> range. For example: >> >> >> >> !alignment point (42,23) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (33,24) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (28,26) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (30,28) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (29,29) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (32,29) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (31,30) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (34,31) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (37,34) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (42,35) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (41,36) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (35,37) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (42,37) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (36,38) out of range (0-31,0-25) in line 1, ignoring >> >> alignment point (43,39) out of range (0-31,0-25) in line 1, ignoring >> >> >> >> However, the outputs of my toolkit of the pair 1 are: >> >> From English-French: >> >> # Sentence pair (1) source length 40 target length 44 alignment score : >> >> 1.0 >> >> à partir de la fin du xixe siècle , la découverte du spectre >> >> électromagnétique et du monde de l 'atome va aussi mener à l >> 'apparition d >> >> 'une nouvelle branche de l 'astronomie , la plus importante de nos >> jours : l >> >> 'astrophysique . >> >> NULL ({ }) from ({ 1 }) the ({ 4 }) 19th ({ 41 }) century ({ 8 }) >> onwards >> >> ({ 2 5 7 }) , ({ 9 }) the ({ 6 }) discovery ({ 11 }) of ({ 12 }) the >> ({ 10 >> >> }) electromagnetic ({ 14 }) spectrum ({ 13 }) and ({ 15 }) the ({ 16 }) >> >> world ({ 17 }) of ({ 18 }) the ({ 19 }) atom ({ 20 39 }) spurred ({ 21 >> 22 40 >> >> }) on ({ 24 }) the ({ 25 }) development ({ }) of ({ 3 }) astrophysics >> ({ 23 >> >> 26 43 }) , ({ 34 }) a ({ 27 28 }) new ({ 29 }) discipline ({ }) in ({ >> 31 }) >> >> astronomy ({ 30 33 }) that ({ 32 }) is ({ 35 }) now ({ }) considered >> ({ }) >> >> to ({ 38 }) be ({ }) the ({ 42 }) most ({ 36 }) important ({ 37 }) . >> ({ 44 >> >> }) >> >> >> >> From French-English >> >> # Sentence pair (1) source length 44 target length 40 alignment score : >> >> 1.0 >> >> from the 19th century onwards , the discovery of the electromagnetic >> >> spectrum and the world of the atom spurred on the development of >> >> astrophysics , a new discipline in astronomy that is now considered to >> be >> >> the most important . >> >> NULL ({ }) à ({ }) partir ({ 1 }) de ({ }) la ({ 2 }) fin ({ }) du ({ >> 7 }) >> >> xixe ({ 3 5 }) siècle ({ 4 }) , ({ 6 }) la ({ 10 }) découverte ({ 8 }) >> du ({ >> >> 9 }) spectre ({ 12 }) électromagnétique ({ 11 34 }) et ({ 13 }) du ({ >> 14 }) >> >> monde ({ 15 }) de ({ 16 }) l ({ 17 }) 'atome ({ 18 19 33 }) va ({ 20 >> 32 }) >> >> aussi ({ }) mener ({ }) à ({ }) l ({ 21 }) 'apparition ({ 22 }) d ({ >> 23 }) >> >> 'une ({ 26 }) nouvelle ({ 27 }) branche ({ }) de ({ 29 }) l ({ }) >> >> 'astronomie ({ 28 30 }) , ({ 25 }) la ({ 31 }) plus ({ }) importante >> ({ 39 >> >> }) de ({ 35 }) nos ({ }) jours ({ }) : ({ }) l ({ 37 }) 'astrophysique >> ({ 24 >> >> 36 38 }) . ({ 40 }) >> >> >> >> So, I just wonder what's the exactly problem here since the output of >> word >> >> alignment is very normal? >> >> Thanks, >> >> best regards, >> >> C. Hoang >> >> >> >> -- >> >> Hoàng Cường >> >> SMTNerd >> >> >> > >> > >> > >> > -- >> > Hoàng Cường >> > SMTNerd >> > >> > >> > _______________________________________________ >> > Moses-support mailing list >> > Moses-support@mit.edu >> > http://mailman.mit.edu/mailman/listinfo/moses-support >> > >> > > > > -- > Hoàng Cường > SMTNerd > > -- Hoàng Cường SMTNerd
_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support