Re: [Wikitech-l] Static HTML Dumps

5 Apr 2011


      On 4/5/2011 4:00 PM, Platonides wrote
...
I think he is better parsing the articles, though.
For a linguistic research you don't need things such as the contents of
templates, so a simple wikitext stripping would do. And it will be much,
much, much, much faster than parsing the whole wiki.
Could be true,  but what's fascinating for me about Wikipedia is 
all of the unscrambled eggs that can be found in the middle of otherwise 
unstructured text.

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Static HTML Dumps