we're basically mirroring all the generated dumps, extract them,
harvest data etc. Lately I came to examine some of the more exotic
languages and to my surprise they were even more exotic than I
thought. I propose to ditch them.
Afar (aa) Wikipedia
latest at our servers is aar-20141223.xml.bz with 22974 bytes
(we convert into iso639-3)
It seems the wiki has been closed or moved into incubator:
Nevertheless in the xmldumps this wiki keeps showing up and pretending
something is there. I believe we'd be all better off if dums of this
Basically the same applies for Ndonga Wikipedia
But the xmldumps keep pouring in:
etc. Same story with several other wikimedia projects in other languages.
So in general: Could we stop dumping closed projects?
Dipl.-Inf. Univ. Richard C. Jelinek
PetaMem GmbH - www.petamem.com Geschäftsführer: Richard Jelinek
Language Technology - We Mean IT! Sitz der Gesellschaft: Fürth
2.58921 * 10^8 Mind Units Registergericht: AG Fürth, HRB-9201
Xmldatadumps-l mailing list