Mediawiki-api May 2021

mediawiki-api@lists.wikimedia.org

5 participants
2 discussions

Need to extract abstract of a wikipedia page
by aditya srinivas 23 Nov '23

23 Nov '23

Hello, I am writing a Java program to extract the abstract of the wikipedia page given the title of the wikipedia page. I have done some research and found out that the abstract with be in rvsection=0 So for example if I want the abstract of 'Eiffel Tower" wiki page then I am querying using the api in the following way. http://en.wikipedia.org/w/api.php?action=query&prop=revisions&titles=Eiffel… and parse the XML data which we get and take the wikitext in the tag <rev xml:space="preserve"> which represents the abstract of the wikipedia page. But this wiki text also contains the infobox data which I do not need. I would like to know if there is anyway in which I can remove the infobox data and get only the wikitext related to the page's abstract Or if there is any alternative method by which I can get the abstract of the page directly. Looking forward to your help. Thanks in Advance Aditya Uppu

4 3

Duplicate images in Wikimedia Commons
by Sreejith K. 20 May '21

20 May '21

Hi, I am trying to compile a list of duplicate images in Wikimedia Commons. I am iterating through the list of images using the generator=allimages API and using the continue option to get the next set. But the api gets stuck at 𪎥-seal.svg and it does not return the next set or the continue option. Here is the url I am using: https://commons.wikimedia.org/w/api.php?action=query&generator=allimages&pr… Can anyone help me with this? If there is an alternative, that would be great. Thanks, Sreejith Kulamgarath.

5 5

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

Mediawiki-api May 2021