ah thanks, just what i was looking for.
Hieu Hoang Researcher New York University, Abu Dhabi http://www.hoang.co.uk/hieu On 6 May 2015 at 11:18, joerg <[email protected]> wrote: > > Go to http://opus.lingfil.uu.se > Select the language pair your interested in and click on the language ID's > in the column "mono" (or "raw" next to it) to download the data you like to > use. Europarl is version 7 in the list. You can also take them from here: > http://opus.lingfil.uu.se/Europarl/mono/ > > Best, > Jörg > > > ********************************************************************************** > Jörg Tiedemann > http://stp.lingfil.uu.se/~joerg/ > > > > On May 6, 2015, at 7:25 AM, Hieu Hoang wrote: > > ah thx. The tgz file only has data for a subset of the languages. > > It would be useful to be able to download them all, or at least know how > to extract them from the raw data. > > > > Hieu Hoang > Researcher > New York University, Abu Dhabi > http://www.hoang.co.uk/hieu > > On 6 May 2015 at 03:04, Ulrich Germann <[email protected]> wrote: > >> Extract it from commoncrawl, of course! ;-) >> >> ... or get it here: >> http://www.statmt.org/wmt13/training-monolingual-europarl-v7.tgz >> >> - Uli >> >> On Mon, May 4, 2015 at 5:46 AM, Hieu Hoang <[email protected]> wrote: >> >>> What's the easiest way get the single-language data from the Europarl >>> corpus as described in the 1st table in: >>> http://statmt.org/europarl/ >>> >>> I tried downloading the xml source >>> http://statmt.org/europarl/v7/europarl.tgz >>> stripping the xml and running split-sentence.perl, but this takes an >>> unfathomably long time >>> >>> Hieu Hoang >>> Researcher >>> New York University, Abu Dhabi >>> http://www.hoang.co.uk/hieu >>> >>> _______________________________________________ >>> Moses-support mailing list >>> [email protected] >>> http://mailman.mit.edu/mailman/listinfo/moses-support >>> >>> >> >> >> -- >> Ulrich Germann >> Senior Researcher >> School of Informatics >> University of Edinburgh >> > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support > > >
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
