Re: [Wikitech-l] Fwd: Reg. Research using Wikipedia

10 Mar 2011

      On 3/10/11 6:29 AM, Paul Houle wrote:
...
  I can say,  positively,  that you'll get the job done faster by

downloading the dump file and cracking into it directly.  I've got
scripts that can download and extract stuff from the XML dump in an hour
or so.  I still have some processes that use the API,  but I'm
increasingly using the dumps because it's faster and easier.
You're likely correct - also I've recently been exposed to the 
'wikipedia offline patch' extension 
(http://code.google.com/p/wikipedia-offline-patch/) which I believe 
allows you to use a compressed dump as your db stroage - saving you the 
pain/space of uncompressing a dump file.  Probably worth a look.
Arthur

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Fwd: Reg. Research using Wikipedia