I just wanted to say thank you! That's truly amazing work.
As far as I can tell, more than 200 Million lines of wikitext have so far been removed from the Wikipedias. That's 200 Million lines that do not have to maintained anymore.
(I have not run the actual analysis yet, I have been waiting for the bots to finish their job, but maybe I should as it is pretty much exactly a year since I run the analysis on the pre-Wikidata age Wikipedia dumps).
You are amazing!
Cheers, Denny
2013/7/8 addshorewiki addshorewiki@gmail.com
For the bot removing interwiki links that are redirects etc my new code should be ready by this weekend (I hope) and this should give the lists I have a big clear out! :)
Addshore On 8 Jul 2013 04:32, "Romaine Wiki" romaine_wiki@yahoo.com wrote:
Today we reached at nl-wiki the situation that + 64% of the interwikiconflicts have been solved. A lot of this work has been done by the Dutch community, but also a lot of work is done by users form other projects, thank you very much for the help!
I have checked the complete template namespace and category namespace for local interwiki's and all are removed from these pages, so these namespaces are now clean on nl-wiki. If users from especially smaller Wikipedia's want to know on what pages of their wiki are local interwikis left, you can use AWB, download the latest databasedump and do a query on that dump. If you want to know what query you need exactly, e-mail me personally as the string of the query is a bit long. But it is even for noobs on bots and codes easy to do. (I can also do it for you.)
With doing all this solving of interwikiconflicts, we came across several things:
- A lot of biological conflicts are in our list of interwikiconflicts.
Certain genus do only have one species under it, what makes some Wikipedias make that together one article, while others want two articles as it are two layers in the taxonomical tree. One article on the English Wikipedia that created hundreds of interwikiconflicts was a list to which many redirects were linking which were used for interwikis. All have been removed with a bot.
- Another thing we notice is that a lot of renamings of articles to make
place for a disambiguation page haven't been proparly executed, as on Wikidata in an item of a group of articles, one of the links was to a disambiguation page. (It would be nice if a bot could check for disambiguation pages (based on the presence of a template from [[MediaWiki:Disambiguationspage]] on that wiki in it) so that we know where we need to fix this.)
- Another thing we see is that a lot of interwikis are still local
because the local interwiki links to a page that is a redirect because the page was renamed, while this wasn't changed by a bot. Most interwikibots do not recognize that the redirect is the same page as the one added to Wikidata. So we need a bot to remove all interwikis that link to a redirect linking to a page that is in the same item as the page where the local interwikis are in.
Let's clean this mess up!
Romaine
http://www.wikidata.org/wiki/User:Romaine
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l