Yes, it was safe in my case (import of Russian and English Wiktionary). See http://meta.wikimedia.org/wiki/Talk:Xml2sql and example of script or shell command to strip out the <redirect />
-- Andrew. On Fri, Feb 5, 2010 at 6:38 AM, Eric Sun <e...@cs.stanford.edu> wrote: > Would it be safe to strip out the <redirect /> tags from the xml and > reimport, or will that cause other problems? > > Thanks, > Eric > > On Thu, Feb 4, 2010 at 6:24 PM, Chad <innocentkil...@gmail.com> wrote: > >> On Thu, Feb 4, 2010 at 9:12 PM, Eric Sun <e...@cs.stanford.edu> wrote: >> > Hi, >> > >> > I saw this thread back in October where someone was having trouble >> > importing the English Wikipedia XML dump: >> > http://lists.wikimedia.org/pipermail/wikitech-l/2009-October/045594.html >> > The thread back in October seemed to end without resolution, and the >> > tools still seem to be broken, so has anyone found a solution in the >> > meantime? >> > >> > I'm using mediawiki-1.15.1 and attempting to import >> > enwiki-20100130-pages-articles.xml.bz2. >> > >> > None of these options seem to work: >> > 1) importDump.php >> > fails by spewing "Warning: xml_parse(): Unable to call handler in_() >> > in ./includes/Import.php on line 437" repeatedly >> > >> > 2) xml2sql (http://meta.wikimedia.org/wiki/Xml2sql): >> > Fails with error: >> > xml2sql: parsing aborted at line 33 pos 16. >> > due to the new <redirect> tag introduced in the new dumps? >> > >> > 3) mwdumper (http://www.mediawiki.org/wiki/MWDumper): >> > Current XML is schema v0.4, but the documentation says that it's for 0.3 >> > >> > 4) mwimport (http://meta.wikimedia.org/wiki/Data_dumps/mwimport): >> > Fails immediately: >> > siteinfo: untested generator 'MediaWiki 1.16alpha-wmf', expect trouble >> ahead >> > page: expected closing tag in line 35 >> > >> > Any tips? >> > Thanks! >> > Eric >> > >> > _______________________________________________ >> > Wikitech-l mailing list >> > Wikitech-l@lists.wikimedia.org >> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l >> > >> >> Most of these errors are caused by the new(ish) <redirect /> tag >> within <page> elements. 0.4 is the correct version of the schema, >> but unfortunately the schema was updated and dumps were >> produced using them before the changes made it into a release. >> >> 1.15.1 cannot import pages with <redirect />, we should probably >> backport that. That, and we should rewrite the importers to not barf >> terribly when they encounter an unknown element. >> >> -Chad >> >> _______________________________________________ >> Wikitech-l mailing list >> Wikitech-l@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/wikitech-l >> > _______________________________________________ > Wikitech-l mailing list > Wikitech-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikitech-l > _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l