ah thanks, just what i was looking for.


Hieu Hoang
Researcher
New York University, Abu Dhabi
http://www.hoang.co.uk/hieu

On 6 May 2015 at 11:18, joerg <[email protected]> wrote:

>
> Go to http://opus.lingfil.uu.se
> Select the language pair your interested in and click on the language ID's
> in the column "mono" (or "raw" next to it) to download the data you like to
> use. Europarl is version 7 in the list. You can also take them from here:
> http://opus.lingfil.uu.se/Europarl/mono/
>
> Best,
> Jörg
>
>
> **********************************************************************************
> Jörg Tiedemann
> http://stp.lingfil.uu.se/~joerg/
>
>
>
> On May 6, 2015, at 7:25 AM, Hieu Hoang wrote:
>
> ah thx. The tgz file only has data for a subset of the languages.
>
> It would be useful to be able to download them all, or at least know how
> to extract them from the raw data.
>
>
>
> Hieu Hoang
> Researcher
> New York University, Abu Dhabi
> http://www.hoang.co.uk/hieu
>
> On 6 May 2015 at 03:04, Ulrich Germann <[email protected]> wrote:
>
>> Extract it from commoncrawl, of course! ;-)
>>
>> ... or get it here:
>> http://www.statmt.org/wmt13/training-monolingual-europarl-v7.tgz
>>
>> - Uli
>>
>> On Mon, May 4, 2015 at 5:46 AM, Hieu Hoang <[email protected]> wrote:
>>
>>> What's the easiest way get the single-language data from the Europarl
>>> corpus as described in the 1st table in:
>>>   http://statmt.org/europarl/
>>>
>>> I tried downloading the xml source
>>>    http://statmt.org/europarl/v7/europarl.tgz
>>> stripping the xml and running split-sentence.perl, but this takes an
>>> unfathomably long time
>>>
>>> Hieu Hoang
>>> Researcher
>>> New York University, Abu Dhabi
>>> http://www.hoang.co.uk/hieu
>>>
>>> _______________________________________________
>>> Moses-support mailing list
>>> [email protected]
>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>
>>>
>>
>>
>> --
>> Ulrich Germann
>> Senior Researcher
>> School of Informatics
>> University of Edinburgh
>>
>
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to