The https://dumps.wikimedia.org web interface for downloading various
dump files is currently offline. The rsync service for external
mirroring is as well. Local network NFS consumers may or may not be
working depending on which server the consumer is attached to.
This unexpected outage is the result of hardware issues following a
short planned maintenance. We are currently investigating the root
cause of the outage and will post additional updates as they become
available. Thanks for your patience.
Bryan
--
Bryan Davis Wikimedia Foundation <bd808(a)wikimedia.org>
[[m:User:BDavis_(WMF)]] Manager, Cloud Services Boise, ID USA
irc: bd808 v:415.839.6885 x6855
(Fwding to Xmldatadumps-l@ list -
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l , CCing
wikitech-ambassadors@ just for resolution here)
On Mon, Jun 4, 2018 at 10:55 PM, Nikhil Prakash <nikhil07prakash(a)gmail.com>
wrote:
> Hi there,
>
> This is Nikhil, an undergraduate student from India. And I'm trying to
> understand the Wikipedia's data dumps provided by Wikimedia.
>
> I'm working on 20180520 dumps. It contains many sections, each having
> different data. And I would like to know what each section's data
> represent. Although it's written in a brief, I don't get it clearly.
>
> Like in section "All pages, current versions only." Does each and every
> article's current version is present in this data? Because I just
> downloaded "enwiki-20180520-pages-meta-current1.xml-p10p30303.bz2", the
> first-page information is of "AccessibleComputing" but it does not have
> complete article's information in it?
>
> Hoping to get a quick reply.
>
> Thanks.
> Nikhil
>
>
> _______________________________________________
> Wikitech-ambassadors mailing list
> Wikitech-ambassadors(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-ambassadors
>
>
--
Nick Wilson (Quiddity)
Community Liaison, Wikimedia Foundation