Just wanted to tell you, that
http://aarddict.org users and dictionary
creators also stumbled about these missing namespaces and are now
suggesting to continue scraping these. So is scraping the expected
approach? See here:
https://groups.google.com/g/aarddict/c/WssxfWQYsto
Regards,
Erik
Am 17.03.22 um 21:39 schrieb Jan Berkel:
>>> Can they be found somewhere else? In N6 or N14? For me it seems that
>>> articles/pages that have a colon like Anexo: or Conjugaison: are not
>>> part.
>> These are not namespace 0. Perhaps the export process forgot to respect
>> $wgContentNamespaces?
> I don't think this these namespaces are included in $wgContentNamespaces on the
Wiktionaries.
>
> I've created a phabricator ticket to request more namespaces to be included in
the dump, not sure if this is the correct process/project tag:
>
>
https://phabricator.wikimedia.org/T303652
>
> –Jan
> _______________________________________________
> Xmldatadumps-l mailing list -- xmldatadumps-l(a)lists.wikimedia.org
> To unsubscribe send an email to xmldatadumps-l-leave(a)lists.wikimedia.org