I don't want all the history. I just want the current articles, so I am downloading pages-meta-current.xml.bz2 and pages-articles.xml.bz2 When I try to download pages-meta-current the size is 1.5GB, instead of 5.4GB. When I download pages-articles the size is 3GB like it should be. Where else can I get the pages-meta-current? The previous dump? When I look for the previous one, I can only find a status.html file. Maybe I don't really need the pages-meta-current if I only want the current articles?
P Please consider the environment before printing this e-mail
-----Original Message----- From: wikitech-l-bounces@lists.wikimedia.org [mailto:wikitech-l-bounces@lists.wikimedia.org] On Behalf Of Fawad Nazir Sent: Monday, October 29, 2007 11:57 AM To: Wikimedia developers Subject: Re: [Wikitech-l] Dump is small
The last pages-meta-current.xml.bz2 in 20071018 is said to be 5.4GB,
but
when I downloaded it, it was only 1.5 GB. Why is that? Is there a problem with this dump? I saw there was a lot
of
discussion about it. What should I do? Thanks
Are you looking for enwiki-20070908-stub-meta-history.xml.gz, this is 5.4GB. Try downloading this one.
fawad@cyprus:~/wiki$ ls -lh -r-------- 1 fawad fawad 5.4G 2007-10-21 09:29 enwiki-20070908-stub-meta-history.xml.gz