Hello!
If you mean this corpus [1], it is not so big - 5,4Gb. Or I am wrong? I can
download it and give you some part of it, if you want.
There are also smaller dumps [2], for example [3]
1.
http://dumps.wikimedia.your.org/frwiki/latest/frwiki-latest-pages-meta-current.xml.bz2
2. http://dumps.wikimedia.your.org/frwiki/latest/
3.
http://dumps.wikimedia.your.org/frwiki/latest/frwiki-latest-pages-meta-current1.xml-p3p412301.bz2
With best wishes,
Mansur
2018-05-12 16:22 GMT+03:00 Hèctor Alòs i Font <[email protected]>:
> 2018-05-12 14:40 GMT+03:00 Kartik Mistry <[email protected]>:
>
>> On Sat, May 12, 2018 at 2:51 PM, Hèctor Alòs i Font
>> <[email protected]> wrote:
>> > I'd like to create a French Wikipedia corpus, but I wouldn't like to
>> > download the whole Wikipedia dump. I'm not sure I have enough disk
>> space for
>> > decompressing it. Is there somewhere maybe a 10% dump?
>>
>> This can be useful too: https://dumps.wikimedia.org/ot
>> her/contenttranslation/
>
>
> Thanks, Kartik. It is too little and not enough random for what I'm
> looking for, but this is an important indeed information for improving the
> translators. A GSoC Apertium project is working on it :)
>
> ------------------------------------------------------------
> ------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> _______________________________________________
> Apertium-stuff mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
>
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff