Räxmät, Mansur. You are right. It seems I got a wrong dump with too many
information that is irrelevant for me. The problem were not the 15 GB, but
the needed space for decompressing them. I think now, it's ok.

2018-05-12 18:40 GMT+03:00 mansur <[email protected]>:

> Hello!
>
> If you mean this corpus [1], it is not so big - 5,4Gb. Or I am wrong? I
> can download it and give you some part of it, if you want.
>
> There are also smaller dumps [2], for example [3]
>
> 1. http://dumps.wikimedia.your.org/frwiki/latest/frwiki-
> latest-pages-meta-current.xml.bz2
> 2. http://dumps.wikimedia.your.org/frwiki/latest/
> 3. http://dumps.wikimedia.your.org/frwiki/latest/frwiki-
> latest-pages-meta-current1.xml-p3p412301.bz2
>
> With best wishes,
> Mansur
>
> 2018-05-12 16:22 GMT+03:00 Hèctor Alòs i Font <[email protected]>:
>
>> 2018-05-12 14:40 GMT+03:00 Kartik Mistry <[email protected]>:
>>
>>> On Sat, May 12, 2018 at 2:51 PM, Hèctor Alòs i Font
>>> <[email protected]> wrote:
>>> > I'd like to create a French Wikipedia corpus, but I wouldn't like to
>>> > download the whole Wikipedia dump. I'm not sure I have enough disk
>>> space for
>>> > decompressing it. Is there somewhere maybe a 10% dump?
>>>
>>> This can be useful too: https://dumps.wikimedia.org/ot
>>> her/contenttranslation/
>>
>>
>> Thanks, Kartik. It is too little and not enough random for what I'm
>> looking for, but this is an important indeed information for improving the
>> translators. A GSoC Apertium project is working on it :)
>>
>> ------------------------------------------------------------
>> ------------------
>> Check out the vibrant tech community on one of the world's most
>> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
>> _______________________________________________
>> Apertium-stuff mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>>
>>
>
> ------------------------------------------------------------
> ------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> _______________________________________________
> Apertium-stuff mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
>
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to