Just wanted to tell you, that http://aarddict.org users and dictionary creators also stumbled about these missing namespaces and are now suggesting to continue scraping these. So is scraping the expected approach? See here: https://groups.google.com/g/aarddict/c/WssxfWQYsto
Regards, Erik Am 17.03.22 um 21:39 schrieb Jan Berkel: >>> Can they be found somewhere else? In N6 or N14? For me it seems that >>> articles/pages that have a colon like Anexo: or Conjugaison: are not >>> part. >> These are not namespace 0. Perhaps the export process forgot to respect >> $wgContentNamespaces? > I don't think this these namespaces are included in $wgContentNamespaces on > the Wiktionaries. > > I've created a phabricator ticket to request more namespaces to be included > in the dump, not sure if this is the correct process/project tag: > > https://phabricator.wikimedia.org/T303652 > > –Jan > _______________________________________________ > Xmldatadumps-l mailing list -- [email protected] > To unsubscribe send an email to [email protected] _______________________________________________ Xmldatadumps-l mailing list -- [email protected] To unsubscribe send an email to [email protected]
