On Wed, Jan 20, 2016 at 1:06 AM, Aaron Halfaker aaron.halfaker@gmail.com wrote:
On Tue, Jan 19, 2016 at 1:10 AM, John Mark Vandenberg jayvdb@gmail.com wrote:
On Tue, Jan 19, 2016 at 4:13 PM, Aaron Halfaker aaron.halfaker@gmail.com wrote:
Here's an example using regular expressions and `mwxml` (a new offshoot of mediawiki-utilities referenced above) https://tools.wmflabs.org/paws/public/EpochFail/examples/mwxml.py.ipynb
The example extracts image links from English Wikipedia, but I imagine it would work for you with little modification.
Well, other languages have different namespace names..., so that script is English only.
Surely the regular expressions are editable ;)
Your walk through uses Dutch Wikipedia as its example, and currently it does not support Dutch Wikipedia. Would you please fix that.