Just wanted to tell you, that http://aarddict.org users and dictionary creators also stumbled about these missing namespaces and are now suggesting to continue scraping these. So is scraping the expected approach? See here: https://groups.google.com/g/aarddict/c/WssxfWQYsto
Regards, Erik
Am 17.03.22 um 21:39 schrieb Jan Berkel:
Can they be found somewhere else? In N6 or N14? For me it seems that articles/pages that have a colon like Anexo: or Conjugaison: are not part.
These are not namespace 0. Perhaps the export process forgot to respect $wgContentNamespaces?
I don't think this these namespaces are included in $wgContentNamespaces on the Wiktionaries.
I've created a phabricator ticket to request more namespaces to be included in the dump, not sure if this is the correct process/project tag:
https://phabricator.wikimedia.org/T303652
–Jan _______________________________________________ Xmldatadumps-l mailing list -- xmldatadumps-l@lists.wikimedia.org To unsubscribe send an email to xmldatadumps-l-leave@lists.wikimedia.org