I don't want all the history. I just want the current articles, so I am
downloading pages-meta-current.xml.bz2 and pages-articles.xml.bz2
When I try to download pages-meta-current the size is 1.5GB, instead of
5.4GB. When I download pages-articles the size is 3GB like it should be.
Where else can I get the pages-meta-current? The previous dump? When I
look for the previous one, I can only find a status.html file.
Maybe I don't really need the pages-meta-current if I only want the
current articles?
P Please consider the environment before printing this e-mail
-----Original Message-----
From: wikitech-l-bounces(a)lists.wikimedia.org
[mailto:wikitech-l-bounces@lists.wikimedia.org] On Behalf Of Fawad Nazir
Sent: Monday, October 29, 2007 11:57 AM
To: Wikimedia developers
Subject: Re: [Wikitech-l] Dump is small
The last pages-meta-current.xml.bz2 in 20071018 is
said to be 5.4GB,
but
when I downloaded it, it was only 1.5 GB.
Why is that? Is there a problem with this dump? I saw there was a lot
of
discussion about it. What should I do?
Thanks
Are you looking for enwiki-20070908-stub-meta-history.xml.gz, this is
5.4GB. Try downloading this one.
fawad@cyprus:~/wiki$ ls -lh
-r-------- 1 fawad fawad 5.4G 2007-10-21 09:29
enwiki-20070908-stub-meta-history.xml.gz
--
Fawad Nazir
http://www.geocities.com/nazir_fawad/
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
http://lists.wikimedia.org/mailman/listinfo/wikitech-l