Hi,
I have a quick question: What is the best way to integrate wikipedia into other applications without using the html interface? I would like to integrate the search functionality and the content docs into my app. Is there an xml interface to the wikipedia? I would like to bypass screen scraping. Ideal would be a REST style interface.
The application is a public project where I try to connect webservices in a random way: http://konnexion.sourceforge.net/
I would love to integrate wikipedia.
Thanks for any feedback, Stan Wiechers
On Mon, 10 Jan 2005 12:00:07 -0500, Stan Wiechers Stan@rga.com wrote:
I have a quick question: What is the best way to integrate wikipedia into other applications without using the html interface? I would like to integrate the search functionality and the content docs into my app. Is there an xml interface to the wikipedia? I would like to bypass screen scraping. Ideal would be a REST style interface.
Well, there's "&action=raw" to get the raw wikitext of a page. Or look at http://pywikipediabot.sf.net for a Python bot framework for interacting with the site. Or download the database from http://download.wikimedia.org and do really crazy stuff to it without loading the wikimedia servers any.
But no, no XML interface. There's been discussion of code to make bot access easier, but I can't find any of it, and I don't think it resulted in much in the way of new code, I'm afraid.
There is a special page at wikipedia called Special:Export that gives the page Wiki-text and date in XML. You can access to it either by submitting a form at Special:Export with several needed pages, or "get" it page by page this way:
http://en.wikipedia.org/wiki/Special:Export/Main_page
Use this with care, the wiki servers are overloaded.
Petko
На понеделник 10 януари 2005 18:00 Stan Wiechers пише:
Hi,
I have a quick question: What is the best way to integrate wikipedia into other applications without using the html interface? I would like to integrate the search functionality and the content docs into my app. Is there an xml interface to the wikipedia? I would like to bypass screen scraping. Ideal would be a REST style interface.
The application is a public project where I try to connect webservices in a random way: http://konnexion.sourceforge.net/
I would love to integrate wikipedia.
Thanks for any feedback, Stan Wiechers
wikitech-l@lists.wikimedia.org