Re: [Moses-support] Looking for non-CLI tool for aligning parallel text
or even see our own ACL paper from this year, which applies MC techniques correctly http://aclweb.org/anthology-new/P/P09/P09-1088.pdf (a problem with the paper you mentioned is that they only ran the sampler for 100 rounds --that is barely enough to move from the initial distribution) Miles 2009/10/28 Adam Lopez : > See this paper (which I believe is current state of the art for direct > alignment of phrases) and references therein: > http://aclweb.org/anthology-new/D/D08/D08-1033.pdf > > This strand of research goes back at least as far as this paper: > http://aclweb.org/anthology-new/W/W02/W02-1018.pdf > > On Tue, Oct 27, 2009 at 10:51 PM, Catalin Braescu wrote: >> Then I wonder how can aligning be done automatically for phrases? And >> what's the accuracy of such process? >> >> >> Catalin Braescu >> >> >> >> On Wed, Oct 28, 2009 at 12:36 AM, Miles Osborne wrote: >>> well, alignment is a task that is really done en mass and not >>> sentence-by-sentence. apart from say teaching, there isn't really a >>> need for a GUI to do it. >>> >>> (convince me that you are ready to use this to align 8 million >>> sentence pairs and i'd be impressed) >>> >>> Miles >>> >>> 2009/10/27 Catalin Braescu : Big thanks for the links! But I have to say I cannot believe my eyes... most of these programs are jar files launcged with parameters from the command line... and the way they work could be a textbook for user unfriendliness :-( How can people stand such primitive and bizarre apps? I am not bashing their authors, I am only surprised there weren't any authors of better programs... Catalin Braescu On Tue, Oct 27, 2009 at 9:57 PM, Adam Lopez wrote: > There are several of these around. Note that I have not used any of them. > > http://www.cs.utah.edu/~hal/HandAlign/ > http://www.umiacs.umd.edu/~nmadnani/alignment/forclip.htm > http://www.d.umn.edu/~tpederse/parallel.html > http://www.let.rug.nl/~tiedeman/Uplug/ > > Ulrich Germann also demonstrated such an editor at last year's ACL, > although it does not seem to be online; perhaps email him. > > Adam > > > On Tue, Oct 27, 2009 at 6:25 PM, Catalin Braescu > wrote: >> Ok, so what I'm looking for is a non-CLI alignment editor. Any ideas? >> >> >> Catalin Braescu >> Omlulu.com >> >> >> On Tue, Oct 27, 2009 at 1:41 PM, Catalin Braescu >> wrote: >>> I am asking in advance for your forgiveness if my question is trivial >>> (or, rather, the answer). >>> >>> I am looking for a non-CLI tool that a not-very-technical person can >>> use to align 2 documents in different languages. >>> >>> When I'm saying "non--CLI" I mean anything that has a window and a >>> "visual" way of handling things: anything between a dual pane Notepad, >>> a php-backed web form, a Java Applet, whatever. as in, not a command >>> line "thing" - our newly hired PC operators won't be able to handle >>> it. >>> >>> Any suggestions? >>> >>> >>> >>> Catalin Braescu >>> Omlulu.com >>> >> ___ >> Moses-support mailing list >> Moses-support@mit.edu >> http://mailman.mit.edu/mailman/listinfo/moses-support >> > ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support >>> >>> >>> >>> -- >>> The University of Edinburgh is a charitable body, registered in >>> Scotland, with registration number SC005336. >>> >> >> ___ >> Moses-support mailing list >> Moses-support@mit.edu >> http://mailman.mit.edu/mailman/listinfo/moses-support >> > > ___ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support > > -- > The University of Edinburgh is a charitable body, registered in > Scotland, with registration number SC005336. > > -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support
Re: [Moses-support] Looking for non-CLI tool for aligning parallel text
See this paper (which I believe is current state of the art for direct alignment of phrases) and references therein: http://aclweb.org/anthology-new/D/D08/D08-1033.pdf This strand of research goes back at least as far as this paper: http://aclweb.org/anthology-new/W/W02/W02-1018.pdf On Tue, Oct 27, 2009 at 10:51 PM, Catalin Braescu wrote: > Then I wonder how can aligning be done automatically for phrases? And > what's the accuracy of such process? > > > Catalin Braescu > > > > On Wed, Oct 28, 2009 at 12:36 AM, Miles Osborne wrote: >> well, alignment is a task that is really done en mass and not >> sentence-by-sentence. apart from say teaching, there isn't really a >> need for a GUI to do it. >> >> (convince me that you are ready to use this to align 8 million >> sentence pairs and i'd be impressed) >> >> Miles >> >> 2009/10/27 Catalin Braescu : >>> Big thanks for the links! >>> >>> But I have to say I cannot believe my eyes... most of these programs >>> are jar files launcged with parameters from the command line... and >>> the way they work could be a textbook for user unfriendliness :-( >>> >>> How can people stand such primitive and bizarre apps? I am not bashing >>> their authors, I am only surprised there weren't any authors of better >>> programs... >>> >>> >>> Catalin Braescu >>> >>> On Tue, Oct 27, 2009 at 9:57 PM, Adam Lopez wrote: There are several of these around. Note that I have not used any of them. http://www.cs.utah.edu/~hal/HandAlign/ http://www.umiacs.umd.edu/~nmadnani/alignment/forclip.htm http://www.d.umn.edu/~tpederse/parallel.html http://www.let.rug.nl/~tiedeman/Uplug/ Ulrich Germann also demonstrated such an editor at last year's ACL, although it does not seem to be online; perhaps email him. Adam On Tue, Oct 27, 2009 at 6:25 PM, Catalin Braescu wrote: > Ok, so what I'm looking for is a non-CLI alignment editor. Any ideas? > > > Catalin Braescu > Omlulu.com > > > On Tue, Oct 27, 2009 at 1:41 PM, Catalin Braescu > wrote: >> I am asking in advance for your forgiveness if my question is trivial >> (or, rather, the answer). >> >> I am looking for a non-CLI tool that a not-very-technical person can >> use to align 2 documents in different languages. >> >> When I'm saying "non--CLI" I mean anything that has a window and a >> "visual" way of handling things: anything between a dual pane Notepad, >> a php-backed web form, a Java Applet, whatever. as in, not a command >> line "thing" - our newly hired PC operators won't be able to handle >> it. >> >> Any suggestions? >> >> >> >> Catalin Braescu >> Omlulu.com >> > ___ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support > >>> >>> ___ >>> Moses-support mailing list >>> Moses-support@mit.edu >>> http://mailman.mit.edu/mailman/listinfo/moses-support >>> >> >> >> >> -- >> The University of Edinburgh is a charitable body, registered in >> Scotland, with registration number SC005336. >> > > ___ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support > ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support
Re: [Moses-support] Looking for non-CLI tool for aligning parallel text
There's nothing unusual about hand aligning data. If there was there wouldn't be so many tools for it! While it's true that you wouldn't do it for a million sentences to throw into a large-scale NIST system, it's quite common to hand align small sets for evaluation of techniques, etc. This paper reports hand aligning over 10K sentences: http://aclweb.org/anthology-new/H/H05/H05-1012.pdf --which, while not much for Arabic-English, would represent a substantial portion of some impoverished data sets. Adam On Tue, Oct 27, 2009 at 10:36 PM, Miles Osborne wrote: > well, alignment is a task that is really done en mass and not > sentence-by-sentence. apart from say teaching, there isn't really a > need for a GUI to do it. > > (convince me that you are ready to use this to align 8 million > sentence pairs and i'd be impressed) > > Miles > > 2009/10/27 Catalin Braescu : >> Big thanks for the links! >> >> But I have to say I cannot believe my eyes... most of these programs >> are jar files launcged with parameters from the command line... and >> the way they work could be a textbook for user unfriendliness :-( >> >> How can people stand such primitive and bizarre apps? I am not bashing >> their authors, I am only surprised there weren't any authors of better >> programs... >> >> >> Catalin Braescu >> >> On Tue, Oct 27, 2009 at 9:57 PM, Adam Lopez wrote: >>> There are several of these around. Note that I have not used any of them. >>> >>> http://www.cs.utah.edu/~hal/HandAlign/ >>> http://www.umiacs.umd.edu/~nmadnani/alignment/forclip.htm >>> http://www.d.umn.edu/~tpederse/parallel.html >>> http://www.let.rug.nl/~tiedeman/Uplug/ >>> >>> Ulrich Germann also demonstrated such an editor at last year's ACL, >>> although it does not seem to be online; perhaps email him. >>> >>> Adam >>> >>> >>> On Tue, Oct 27, 2009 at 6:25 PM, Catalin Braescu >>> wrote: Ok, so what I'm looking for is a non-CLI alignment editor. Any ideas? Catalin Braescu Omlulu.com On Tue, Oct 27, 2009 at 1:41 PM, Catalin Braescu wrote: > I am asking in advance for your forgiveness if my question is trivial > (or, rather, the answer). > > I am looking for a non-CLI tool that a not-very-technical person can > use to align 2 documents in different languages. > > When I'm saying "non--CLI" I mean anything that has a window and a > "visual" way of handling things: anything between a dual pane Notepad, > a php-backed web form, a Java Applet, whatever. as in, not a command > line "thing" - our newly hired PC operators won't be able to handle > it. > > Any suggestions? > > > > Catalin Braescu > Omlulu.com > ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support >>> >> >> ___ >> Moses-support mailing list >> Moses-support@mit.edu >> http://mailman.mit.edu/mailman/listinfo/moses-support >> > > > > -- > The University of Edinburgh is a charitable body, registered in > Scotland, with registration number SC005336. > > ___ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support > ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support
Re: [Moses-support] Looking for non-CLI tool for aligning parallel text
"phrases" are not usually directly aligned. instead, words are (this is what Giza++ does for example). phrases are usually extracted using heuristics. the accuracy of word alignment is a function of the number of sentence pairs and also the actual language pair. for example, you need a lot more data to do well at Chinese-English than with Spanish-English. Miles 2009/10/27 Catalin Braescu : > Then I wonder how can aligning be done automatically for phrases? And > what's the accuracy of such process? > > > Catalin Braescu > > > > On Wed, Oct 28, 2009 at 12:36 AM, Miles Osborne wrote: >> well, alignment is a task that is really done en mass and not >> sentence-by-sentence. apart from say teaching, there isn't really a >> need for a GUI to do it. >> >> (convince me that you are ready to use this to align 8 million >> sentence pairs and i'd be impressed) >> >> Miles >> >> 2009/10/27 Catalin Braescu : >>> Big thanks for the links! >>> >>> But I have to say I cannot believe my eyes... most of these programs >>> are jar files launcged with parameters from the command line... and >>> the way they work could be a textbook for user unfriendliness :-( >>> >>> How can people stand such primitive and bizarre apps? I am not bashing >>> their authors, I am only surprised there weren't any authors of better >>> programs... >>> >>> >>> Catalin Braescu >>> >>> On Tue, Oct 27, 2009 at 9:57 PM, Adam Lopez wrote: There are several of these around. Note that I have not used any of them. http://www.cs.utah.edu/~hal/HandAlign/ http://www.umiacs.umd.edu/~nmadnani/alignment/forclip.htm http://www.d.umn.edu/~tpederse/parallel.html http://www.let.rug.nl/~tiedeman/Uplug/ Ulrich Germann also demonstrated such an editor at last year's ACL, although it does not seem to be online; perhaps email him. Adam On Tue, Oct 27, 2009 at 6:25 PM, Catalin Braescu wrote: > Ok, so what I'm looking for is a non-CLI alignment editor. Any ideas? > > > Catalin Braescu > Omlulu.com > > > On Tue, Oct 27, 2009 at 1:41 PM, Catalin Braescu > wrote: >> I am asking in advance for your forgiveness if my question is trivial >> (or, rather, the answer). >> >> I am looking for a non-CLI tool that a not-very-technical person can >> use to align 2 documents in different languages. >> >> When I'm saying "non--CLI" I mean anything that has a window and a >> "visual" way of handling things: anything between a dual pane Notepad, >> a php-backed web form, a Java Applet, whatever. as in, not a command >> line "thing" - our newly hired PC operators won't be able to handle >> it. >> >> Any suggestions? >> >> >> >> Catalin Braescu >> Omlulu.com >> > ___ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support > >>> >>> ___ >>> Moses-support mailing list >>> Moses-support@mit.edu >>> http://mailman.mit.edu/mailman/listinfo/moses-support >>> >> >> >> >> -- >> The University of Edinburgh is a charitable body, registered in >> Scotland, with registration number SC005336. >> > > ___ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support > -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support
Re: [Moses-support] Looking for non-CLI tool for aligning parallel text
Then I wonder how can aligning be done automatically for phrases? And what's the accuracy of such process? Catalin Braescu On Wed, Oct 28, 2009 at 12:36 AM, Miles Osborne wrote: > well, alignment is a task that is really done en mass and not > sentence-by-sentence. apart from say teaching, there isn't really a > need for a GUI to do it. > > (convince me that you are ready to use this to align 8 million > sentence pairs and i'd be impressed) > > Miles > > 2009/10/27 Catalin Braescu : >> Big thanks for the links! >> >> But I have to say I cannot believe my eyes... most of these programs >> are jar files launcged with parameters from the command line... and >> the way they work could be a textbook for user unfriendliness :-( >> >> How can people stand such primitive and bizarre apps? I am not bashing >> their authors, I am only surprised there weren't any authors of better >> programs... >> >> >> Catalin Braescu >> >> On Tue, Oct 27, 2009 at 9:57 PM, Adam Lopez wrote: >>> There are several of these around. Note that I have not used any of them. >>> >>> http://www.cs.utah.edu/~hal/HandAlign/ >>> http://www.umiacs.umd.edu/~nmadnani/alignment/forclip.htm >>> http://www.d.umn.edu/~tpederse/parallel.html >>> http://www.let.rug.nl/~tiedeman/Uplug/ >>> >>> Ulrich Germann also demonstrated such an editor at last year's ACL, >>> although it does not seem to be online; perhaps email him. >>> >>> Adam >>> >>> >>> On Tue, Oct 27, 2009 at 6:25 PM, Catalin Braescu >>> wrote: Ok, so what I'm looking for is a non-CLI alignment editor. Any ideas? Catalin Braescu Omlulu.com On Tue, Oct 27, 2009 at 1:41 PM, Catalin Braescu wrote: > I am asking in advance for your forgiveness if my question is trivial > (or, rather, the answer). > > I am looking for a non-CLI tool that a not-very-technical person can > use to align 2 documents in different languages. > > When I'm saying "non--CLI" I mean anything that has a window and a > "visual" way of handling things: anything between a dual pane Notepad, > a php-backed web form, a Java Applet, whatever. as in, not a command > line "thing" - our newly hired PC operators won't be able to handle > it. > > Any suggestions? > > > > Catalin Braescu > Omlulu.com > ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support >>> >> >> ___ >> Moses-support mailing list >> Moses-support@mit.edu >> http://mailman.mit.edu/mailman/listinfo/moses-support >> > > > > -- > The University of Edinburgh is a charitable body, registered in > Scotland, with registration number SC005336. > ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support
Re: [Moses-support] Looking for non-CLI tool for aligning parallel text
well, alignment is a task that is really done en mass and not sentence-by-sentence. apart from say teaching, there isn't really a need for a GUI to do it. (convince me that you are ready to use this to align 8 million sentence pairs and i'd be impressed) Miles 2009/10/27 Catalin Braescu : > Big thanks for the links! > > But I have to say I cannot believe my eyes... most of these programs > are jar files launcged with parameters from the command line... and > the way they work could be a textbook for user unfriendliness :-( > > How can people stand such primitive and bizarre apps? I am not bashing > their authors, I am only surprised there weren't any authors of better > programs... > > > Catalin Braescu > > On Tue, Oct 27, 2009 at 9:57 PM, Adam Lopez wrote: >> There are several of these around. Note that I have not used any of them. >> >> http://www.cs.utah.edu/~hal/HandAlign/ >> http://www.umiacs.umd.edu/~nmadnani/alignment/forclip.htm >> http://www.d.umn.edu/~tpederse/parallel.html >> http://www.let.rug.nl/~tiedeman/Uplug/ >> >> Ulrich Germann also demonstrated such an editor at last year's ACL, >> although it does not seem to be online; perhaps email him. >> >> Adam >> >> >> On Tue, Oct 27, 2009 at 6:25 PM, Catalin Braescu wrote: >>> Ok, so what I'm looking for is a non-CLI alignment editor. Any ideas? >>> >>> >>> Catalin Braescu >>> Omlulu.com >>> >>> >>> On Tue, Oct 27, 2009 at 1:41 PM, Catalin Braescu >>> wrote: I am asking in advance for your forgiveness if my question is trivial (or, rather, the answer). I am looking for a non-CLI tool that a not-very-technical person can use to align 2 documents in different languages. When I'm saying "non--CLI" I mean anything that has a window and a "visual" way of handling things: anything between a dual pane Notepad, a php-backed web form, a Java Applet, whatever. as in, not a command line "thing" - our newly hired PC operators won't be able to handle it. Any suggestions? Catalin Braescu Omlulu.com >>> ___ >>> Moses-support mailing list >>> Moses-support@mit.edu >>> http://mailman.mit.edu/mailman/listinfo/moses-support >>> >> > > ___ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support > -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support
Re: [Moses-support] Looking for non-CLI tool for aligning parallel text
Big thanks for the links! But I have to say I cannot believe my eyes... most of these programs are jar files launcged with parameters from the command line... and the way they work could be a textbook for user unfriendliness :-( How can people stand such primitive and bizarre apps? I am not bashing their authors, I am only surprised there weren't any authors of better programs... Catalin Braescu On Tue, Oct 27, 2009 at 9:57 PM, Adam Lopez wrote: > There are several of these around. Note that I have not used any of them. > > http://www.cs.utah.edu/~hal/HandAlign/ > http://www.umiacs.umd.edu/~nmadnani/alignment/forclip.htm > http://www.d.umn.edu/~tpederse/parallel.html > http://www.let.rug.nl/~tiedeman/Uplug/ > > Ulrich Germann also demonstrated such an editor at last year's ACL, > although it does not seem to be online; perhaps email him. > > Adam > > > On Tue, Oct 27, 2009 at 6:25 PM, Catalin Braescu wrote: >> Ok, so what I'm looking for is a non-CLI alignment editor. Any ideas? >> >> >> Catalin Braescu >> Omlulu.com >> >> >> On Tue, Oct 27, 2009 at 1:41 PM, Catalin Braescu wrote: >>> I am asking in advance for your forgiveness if my question is trivial >>> (or, rather, the answer). >>> >>> I am looking for a non-CLI tool that a not-very-technical person can >>> use to align 2 documents in different languages. >>> >>> When I'm saying "non--CLI" I mean anything that has a window and a >>> "visual" way of handling things: anything between a dual pane Notepad, >>> a php-backed web form, a Java Applet, whatever. as in, not a command >>> line "thing" - our newly hired PC operators won't be able to handle >>> it. >>> >>> Any suggestions? >>> >>> >>> >>> Catalin Braescu >>> Omlulu.com >>> >> ___ >> Moses-support mailing list >> Moses-support@mit.edu >> http://mailman.mit.edu/mailman/listinfo/moses-support >> > ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support
Re: [Moses-support] Looking for non-CLI tool for aligning parallel text
There are several of these around. Note that I have not used any of them. http://www.cs.utah.edu/~hal/HandAlign/ http://www.umiacs.umd.edu/~nmadnani/alignment/forclip.htm http://www.d.umn.edu/~tpederse/parallel.html http://www.let.rug.nl/~tiedeman/Uplug/ Ulrich Germann also demonstrated such an editor at last year's ACL, although it does not seem to be online; perhaps email him. Adam On Tue, Oct 27, 2009 at 6:25 PM, Catalin Braescu wrote: > Ok, so what I'm looking for is a non-CLI alignment editor. Any ideas? > > > Catalin Braescu > Omlulu.com > > > On Tue, Oct 27, 2009 at 1:41 PM, Catalin Braescu wrote: >> I am asking in advance for your forgiveness if my question is trivial >> (or, rather, the answer). >> >> I am looking for a non-CLI tool that a not-very-technical person can >> use to align 2 documents in different languages. >> >> When I'm saying "non--CLI" I mean anything that has a window and a >> "visual" way of handling things: anything between a dual pane Notepad, >> a php-backed web form, a Java Applet, whatever. as in, not a command >> line "thing" - our newly hired PC operators won't be able to handle >> it. >> >> Any suggestions? >> >> >> >> Catalin Braescu >> Omlulu.com >> > ___ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support > ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support
Re: [Moses-support] Looking for non-CLI tool for aligning parallel text
Ok, so what I'm looking for is a non-CLI alignment editor. Any ideas? Catalin Braescu Omlulu.com On Tue, Oct 27, 2009 at 1:41 PM, Catalin Braescu wrote: > I am asking in advance for your forgiveness if my question is trivial > (or, rather, the answer). > > I am looking for a non-CLI tool that a not-very-technical person can > use to align 2 documents in different languages. > > When I'm saying "non--CLI" I mean anything that has a window and a > "visual" way of handling things: anything between a dual pane Notepad, > a php-backed web form, a Java Applet, whatever. as in, not a command > line "thing" - our newly hired PC operators won't be able to handle > it. > > Any suggestions? > > > > Catalin Braescu > Omlulu.com > ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support
[Moses-support] Looking for non-CLI tool for aligning parallel text
I am asking in advance for your forgiveness if my question is trivial (or, rather, the answer). I am looking for a non-CLI tool that a not-very-technical person can use to align 2 documents in different languages. When I'm saying "non--CLI" I mean anything that has a window and a "visual" way of handling things: anything between a dual pane Notepad, a php-backed web form, a Java Applet, whatever. as in, not a command line "thing" - our newly hired PC operators won't be able to handle it. Any suggestions? Catalin Braescu Omlulu.com ___ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support