Behzad,
The XML dumps should be complete and reflect the full history of pages in
Wikipedia. Could you give an example of a page from the XML dump that
doesn't have the full set of revisions?
-Aaron
On Tue, Feb 24, 2015 at 12:49 PM, Jeremy Baron <jeremy(a)tuxmachine.com>
wrote:
On Feb 24, 2015 1:44 PM, "Behzad Tabibian"
<btabibian(a)gmail.com> wrote:
I am new to working with Wikipedia dumps. I am
trying to obtain full
revision history of all the articles on Wikipedia. I
downloaded enwiki-20140707-pages-meta-history1.xml-*.7z from
https://dumps.wikimedia.org/enwiki/20140707/. However, by looking at the
xml files revision history of individual articles do not match with
revision history one may see from history page on Wikipedia website. It
seems the dump contains significantly smaller number of revisions than what
can be found on Wikipedia.
This may be a decent place to ask (actually I don't read this list too
much so just guessing) but probably more relevant at
xmldatadumps-l(a)lists.wikimedia.org . FYI
-Jeremy
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l