Xmldatadumps-l July 2010

xmldatadumps-l@lists.wikimedia.org

10 participants
9 discussions

by Andreas Meier

Hello, the actualy running dump for enwiki takes a lot of time for Wiki page-to-page link records. At http://download.wikipedia.org/enwiki/20100130/ it was only one and a half hour. Best regards Andreas

13 years, 9 months

xml dumps resumed

by Ariel T. Glenn

XML dumps have been resumed, one thread only; if they look ok in another 12 hours or so I'll start multiple batches and they will run as usual. Old bad dump files have been moved out of the way. There is no reason to think that the dumps running now won't look fine; this is just me being cautious. Ariel

13 years, 9 months

dumps halted for 1-2 days

by Ariel T. Glenn

Dumps have been halted for 1-2 days while code fixes get merged into the deployment branch of our code (so that we need not have the host that runs them removed from getting regular updates). We'll also be running a job to locate and remove the broken dumps that have accumulated in the past 10 (?) days, priior to restarting them. Ariel

13 years, 9 months

enwiki dump progress on 20100622 - failed again

by Dmitry Chichkov

Subj: http://download.wikimedia.org/enwiki/20100622/ Is there anything that can be done to alleviate that problem? By the way, what's the point of producing .bz2 version of the pages-meta-history.xml dump? Is it easier on the system to produce .bz2 first and .7z after that? From the user's perspective I can tell that .7z is all I need, there is simply no point in working with .bz2 (if .7z is available). -- Regards, Dmitry

13 years, 9 months

Database dumps are stopped ?

by Nicolas Vervelle

Hi, The database dump progress page ( http://dumps.wikimedia.org/backup-index.html) seems to indicate that no dump has been made for more than a week for any Wikipedia. The first line is about the enwiki dump which is still in progress and seems to be updated. But all the other lines are dated back to 2010-07-06 or older. Nico

13 years, 9 months

elwiki and simplewiki stopped

by Andreas Meier

Hello, look at http://download.wikipedia.org/simplewiki/20100705/ and http://download.wikipedia.org/elwiki/20100705/ Best regards Andreas

13 years, 9 months

by jcms

-- Este mensaje le ha llegado mediante el servicio de correo electronico que ofrece Infomed para respaldar el cumplimiento de las misiones del Sistem a Nacional de Salud. La persona que envia este correo asume el compromiso de usar el servicio a tales fines y cumplir con las regulaciones establecidas Infomed: http://www.sld.cu/

13 years, 9 months

Order of pages/revisions

by Chrisil J. Arackaparambil

Hello folks, I had some questions about the order or pages and revisions in the dump. As I understand, the order is according to the respective IDs. But where do these IDs come from? Are they the keys of the corresponding table in the database? So then they are more or less in order of creation? If that's the case, why does the dump begin with pages with titles mostly beginning with "A"? Thank you, Chrisil

13 years, 9 months

Number of pages on Wikipedia

by Chrisil J. Arackaparambil

Hello everybody, I was doing a bit of analysis of the dump enwiki-20100130-pages-meta-history.xml.7z. What I found to my surprise is that there are (at least) 7 million pages in the main namespace. I got this figure by grepping for page titles that do not contain a ":" character. Is this really the case or am I missing something? I'd seen some Wikimedia stats that said the number of articles currently is about 3.2 million, so I'm not sure why I'm seeing so many pages in the dump. Thank you, Chrisil

13 years, 9 months

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

Xmldatadumps-l July 2010