Lars Aronsson <l...@aronsson.se> writes: > On 09/13/2013 02:54 AM, Gang Chen wrote: >> 1) Is it possible to make some kind of Wikipedia dump? >> >> This tool works fine for extracting the main text from Wikipedia, >> http://wiki.apertium.org/wiki/User:Gang_Chen/Wikipedia_Extractor > > Wikipedia very rarely has good translations between languages.
Fortunately, training the tagger doesn't require parallel text, just monolingual text. -- Kevin Brubeck Unhammer GPG: 0x766AC60C
pgpZ_Es9K5PEL.pgp
Description: PGP signature
------------------------------------------------------------------------------ How ServiceNow helps IT people transform IT departments: 1. Consolidate legacy IT systems to a single system of record for IT 2. Standardize and globalize service processes across IT 3. Implement zero-touch automation to replace manual, redundant tasks http://pubads.g.doubleclick.net/gampad/clk?id=51271111&iu=/4140/ostg.clktrk
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff