Wikipedia data extraction - Mediawiki-api

3 Mar 2012

Hi,

I am using the following perl modules to extract data from Wikipedia and
Wikitravel respectively -

- WWW::Wikipedia
- MediaWiki::API

...
 From both these APIs and also by looking at the
MediaWiki APIs, I seem to get the entire chunk of text in the Web Service response.
To extract
different sections of the Wiki entry, I have to rely on pattern matching
and regular expressions.

Is there a better way to achieve this? Is there some sample code in any
language (preferably, perl) which anyone can share, or is there some tool
which does this out of the box?

Any help would be appreciated.

Regards,
Ashish