Mediawiki-api March 2012

mediawiki-api@lists.wikimedia.org

9 participants
5 discussions

Need to extract abstract of a wikipedia page

by aditya srinivas

Hello, I am writing a Java program to extract the abstract of the wikipedia page given the title of the wikipedia page. I have done some research and found out that the abstract with be in rvsection=0 So for example if I want the abstract of 'Eiffel Tower" wiki page then I am querying using the api in the following way. http://en.wikipedia.org/w/api.php?action=query&prop=revisions&titles=Eiffel… and parse the XML data which we get and take the wikitext in the tag <rev xml:space="preserve"> which represents the abstract of the wikipedia page. But this wiki text also contains the infobox data which I do not need. I would like to know if there is anyway in which I can remove the infobox data and get only the wikitext related to the page's abstract Or if there is any alternative method by which I can get the abstract of the page directly. Looking forward to your help. Thanks in Advance Aditya Uppu

5 months

doc request etiquette best practice`

by Fred Zimmerman

Hi, I have an application that every five minutes or so comes up with a list of 50-500 Wikipedia individual documents that it wants to request via curl. It wants to get them as quickly and as completely as possible with minimal missed documents or dropped connections. 1) what is the etiquette for how to do this politely? 2) what are the best practice for using curl to submit API requests? * is it best to use a single curl command with a long list of URLs to get, or a long list of individual curl commands? * compressed or not? * retry values? * max-redirs? * other parameters? FredZ ----------------------------------------------------- Subscribe to the Nimble Books Mailing List http://eepurl.com/czS- for monthly updates

12 years, 1 month

API Account Creation

by Jim Safley

Has there been any movement toward API account creation? [1] I maintain several projects that use the API to embed a MediaWiki-type interface into a CMS. As it stands now, I need to tell my users to go to the MediaWiki interface to create an account -- a discombobulating experience for many. Jim [1] http://www.mediawiki.org/wiki/API:Account_creation

12 years, 1 month

can I use the API to search for images in commons.wikimedia.org?

by Timothy Helck

I want to add a widget to a web page that will allow user to enter search terms and search wikimedia for images that match the terms. I have implemented a similar widget for flickr, using their API, but am having trouble doing the same with wikimedia. Basically, I would like to replicate the functionality of the commons.wikimedia.org search page. Ideally I would like to be able to get a Category listing (ex. http://commons.wikimedia.org/wiki/Chartres_Cathedral) or a true search results (ex. http://commons.wikimedia.org/w/index.php?title=Special%3ASearch&search=%22c…), but at this point I would be happy with either. I've tried using the allimages list, but that is not adequate. Is there any other way to search images using the API? I have also been looking at Freebase and DBPedia. These seem like they might do what I want, but RDF is completely new to me and I'm still trying to figure out the basics of it. If anyone can point me in the right direction for either of those resources, I would appreciate it. Regards. Tim Helck

12 years, 1 month

Wikipedia data extraction

by Ashish Mukherjee

Hi, I am using the following perl modules to extract data from Wikipedia and Wikitravel respectively - - WWW::Wikipedia - MediaWiki::API >From both these APIs and also by looking at the MediaWiki APIs, I seem to get the entire chunk of text in the Web Service response. To extract different sections of the Wiki entry, I have to rely on pattern matching and regular expressions. Is there a better way to achieve this? Is there some sample code in any language (preferably, perl) which anyone can share, or is there some tool which does this out of the box? Any help would be appreciated. Regards, Ashish

12 years, 1 month

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

Mediawiki-api March 2012