Hi Bence,
yes, I am thinking about rerunning the script and removing comments
first. Didn't work out yesterday, I had a bug in my script, and
noticed that only in the morning when it was already almost finished.
Let's see when I have the time for the update. (Or if someone else
picks up the code and does it).
Thanks for the comments,
Cheers,
Denny
2012/6/25 Bence Damokos <bdamokos(a)gmail.com>om>:
Hi Denny,
This is a really interesting list.
Looking at the Hungarian list, I find that in many instances the duplicate
interwiki link is actually commented out (in the form of "<!-- Source:
[[en: something]] --> or <!-- wrong interwikis: [[en: ..] [[fr: ..]] -->),
and not real duplicate links. (In some cases there are indeed duplicate
links, where one concept covers two concepts in other languages.)
Maybe you could refine your search algorithm to exclude commented out
links, and improve your listing page by including not only the second
interwiki link found for a given language, but also the first one, so it is
easier to assess without having to check the article pages or source codes?
In any case, the village pumps might be a good place to post a link to the
lists. The "Global message delivery" system might help you in that:
http://meta.wikimedia.org/wiki/Global_message_delivery
Best regards,
Bence
On Mon, Jun 25, 2012 at 12:29 PM, Denny Vrandečić <
denny.vrandecic(a)wikimedia.de> wrote:
Hi all,
I ran some analysis last week, to get some numbers out of the
Wikipedia language links. One type of reports that were generated was
the list of all articles in the main namespaces of the Wikipedias that
link to more than one article in another language edition of Wikipedia
(so called double language links). There are not that many of them
(about 19,000 in total), split by language, all available here:
<http://simia.net/languagelinks/>
Double language links are not errors per se, but they contain a few
nuisances
* they lead to two links in the language links list that just look the
same (you have to hover over them to see that they link to different
languages), which is not really optimal from the user experience side
* they are not saved in the langlinks table and thus are ignored in
certain reports and also in the respective export
I am not sure how to reach out to the respective Wikipedia
communities, or if I should at all. Should I post to their respective
version of the village pump? Remembering from the time I was active on
the Croatian Wikipedia, I would have appreciated that list to check
the entries. I reckoned the wikipedia-l list would be the right place,
but that list looks rather dead.
Cheers,
Denny
--
Project director Wikidata
Wikimedia Deutschland e.V. | Obentrautstr. 72 | 10963 Berlin
Tel. +49-30-219 158 26-0 |
http://wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 B. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.
_______________________________________________
Wikimedia-l mailing list
Wikimedia-l(a)lists.wikimedia.org
Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l
_______________________________________________
Wikimedia-l mailing list
Wikimedia-l(a)lists.wikimedia.org
Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l
--
Project director Wikidata
Wikimedia Deutschland e.V. | Obentrautstr. 72 | 10963 Berlin
Tel. +49-30-219 158 26-0 |
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 B. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.