Dear wikimedia Community
My Name is Fidel Gil I am a master's Student from Technical University of Kaiserslautern in Germany, and I am currently running some experiments with the enwiki xml datadumps. where I recreate the linking structure between articles. when doing so for the article https://en.wikipedia.org/wiki/Animation
I found that it has a link that has as name 'walt disney studios' in the subsection 'Animated Features CGI' that resolves to 'walt disney animation studios'.
When going through the xml file the entry Animation does reference 'walt disney studios' a disambiguation page rather than 'walt disney animation studios'.
small excerpt from the line in question 'In 1937, [[Walt Disney Studios]] premiered their first-ever animated feature'.
Do the xml file dump use the tag names rather than some other form of URL resolution to create this [[<name of article>]] tags.?
Looking forward to your reply Fidel Gil
Fidel Sergio Gil Guevara, 10/06/20 15:25:
Do the xml file dump use the tag names rather than some other form of URL resolution to create this [[<name of article>]] tags.?
No. The dumps contain the wikitext as is. Maybe you have an older version of the dump, before this edit? https://en.wikipedia.org/w/index.php?title=Animation&diff=954685921&oldid=953535522
Federico
xmldatadumps-l@lists.wikimedia.org