Mediawiki-api November 2013

mediawiki-api@lists.wikimedia.org

14 participants
13 discussions

Need to extract abstract of a wikipedia page
by aditya srinivas 23 Nov '23

23 Nov '23

Hello, I am writing a Java program to extract the abstract of the wikipedia page given the title of the wikipedia page. I have done some research and found out that the abstract with be in rvsection=0 So for example if I want the abstract of 'Eiffel Tower" wiki page then I am querying using the api in the following way. http://en.wikipedia.org/w/api.php?action=query&prop=revisions&titles=Eiffel… and parse the XML data which we get and take the wikitext in the tag <rev xml:space="preserve"> which represents the abstract of the wikipedia page. But this wiki text also contains the infobox data which I do not need. I would like to know if there is anyway in which I can remove the infobox data and get only the wikitext related to the page's abstract Or if there is any alternative method by which I can get the abstract of the page directly. Looking forward to your help. Thanks in Advance Aditya Uppu

4 3

Changes in API action=parse&mobileformat=...
by Max Semenik 09 Jun '14

09 Jun '14

Hi, in response to bug 54607 [1], we've changed the semantics of the mobileformat parameter to action=parse == Summary == Previously, it used to accept strings 'html' or 'wml', later just 'html' and modify the structure of output (see below). This was problematic because you needed to retrieve the HTML from output in different ways, depending on whether mobileformat is specified or not. Now, mobileformat is a boolean parameter, that is if there's a 'mobileformat' parameter in request, it will be treated as "the output should be mobile-friendly", regardless of value. And the output structure will be the same. For compatibility with older callers, mobileformat=(html|wml) will be special-cased to return the older structure at least for 6 month from now. These changes will start being rolled out to the WMF sites starting from tomorrow, Tuesday October 24th and this process will be complete by October 31st. == Examples == === Non-mobile parse === api.php?action=parse&format=xml { "parse": { "title": "...", "text": { "*": "foo" } } } api.php?action=parse&format=json <?xml version="1.0"?> <api> <parse title="..." displaytitle="..."> <text xml:space="preserve">foo</text> </parse> </api> === Parse that outputs mobile HTML, old style === api.php?action=parse&format=json&mobileformat=html { "parse": { "title": "API", "text": "foo" } } api.php?action=parse&format=xml&mobileformat=html <?xml version="1.0"?> <api> <parse title="..." text="foo" displaytitle="..."> </parse> </api> === Parse that outputs mobile HTML, new style === api.php?action=parse&format=...&mobileformat Same as for non-mobile parses. == FAQ == Q: I didn't use mobileformat before, does anything change for me? A: No. Q: I use mobileformat=html, will my bot/tool be broken now? A: No, you will have 6 months to switch to new style. Q: I'm only planning to use mobileformat, what should I do? A: Just use the new style. Q: How did this format discrepancy appear in the first place? A: To err is human. ----- [1] https://bugzilla.wikimedia.org/show_bug.cgi?id=54607 -- Best regards, Max Semenik ([[User:MaxSem]])

2 2

Wikipeda Page Counts for A word
by Muhidin Mohamed 10 Dec '13

10 Dec '13

Dear All, I am working on an application which requires the total wikipedia page counts that contain specific word or a pair of words. I have tried to to use advanced search to get these page counts and retrieve the the HTML page to extract that. Please see the attached snapshot is information pointed by the red arrow the actual total pages containing the name David Beckham ?. Any advice is highly appreciated. Cheers, ................................................................... Muhidin A. Mohamed, School of Electrical, Electronic and Computer Engineering, University of Birmingham, Pritchatts Road, Edgbaston, B15 2SA

3 3

Re: [Mediawiki-api] EXCERPT API : a fix?
by Luigi Assom 29 Nov '13

29 Nov '13

Hello, is there any minimum number of votes for fixing a bug? I am wondering which is - if there is - an average time for having a normal bug fixed, and see if I may help. How would if possible to help without installing mediawiki software? (I am a bit skilled in python, not other languages) tks!

2 1

EXCERPT API : a fix?
by Luigi Assom 27 Nov '13

27 Nov '13

Hello everybody, I noticed a possible glitch at wikipedia api prop=excerpts In some cases, parameter *exsentences* does not return the number of sentences, but the coordinates of an article. Instead parameter *exchars* return a number of chars, and also the coordinates. E.g. please have a look here: http://en.wikipedia.org/wiki/United_World_College_of_the_Adriatic http://en.wikipedia.org/w/api.php?format=jsonfm&action=query&pageids=126666… http://en.wikipedia.org/w/api.php?format=jsonfm&action=query&pageids=126666… I would like to have just the intro with the sentences. Another example, though I think this is not an error, is with names, e.g. H. P. Lovecraft: Against the World, Against Life http://en.wikipedia.org/w/api.php?format=jsonfm&action=query&pageids=175459… Here I obtained a truncated sentence, cause the dots in the name force the sentence to be truncated. Is it a fix to do? coordinates should appear only with their module, right? -- Luigi Assom Skype contact: oggigigi

2 2

MediaWiki API to list new articles as Special:Newpages
by Kenrick 26 Nov '13

26 Nov '13

Hi, My name is Kenrick, I'm an Indonesian Wikipedia administrator trying to experiment things with MediaWiki API. I am currently doing a project to list down the new articles using MediaWiki API with these parameters: action = query list = recentchanges rctype = new rcshow = !redirect But then this method also includes those articles which current revision is a redirect page. Is there any way to perfectly list down new articles like Special:Newpages did? Thank you.

2 2

About "move/move" and "move/move_redir"
by Liu Chenheng 24 Nov '13

24 Nov '13

Hi, All, There are 2 move operations in recent change: "move/move" and "move/move_redir". And in "move/move" changes, there may be tag suppressedredirect="" which means move the page without creating the redirect ( http://www.mediawiki.org/wiki/Help:Moving_a_page). Here comes my question: does "move/move" change without the suppressedredirect="" equal to "move/move_redir", i.e. it's a move redirect operation? Thanks.

2 3

how to resolve image url
by Thomas 17 Nov '13

17 Nov '13

Hi, I am writing a mobile client that can show wikipedia content. My approach is to download the raw media-wiki markup instead of the generated html. This allows me more control and I avoid using a html parser/viewer. The approach is quite successful except for when I encounter images. In the markup I can see something like; File:1945-P-Jefferson-War-Nickel-Reverse.JPG I use the API to fetch some metadata; en.wikipedia.org/w/api.php?action=query&prop=imageinfo \ &iilimit=1&format=xml&iiprop=dimensions%7Cmime&titles=[foo] The piece of the puzzle I am still missing is how to find out the actual download URL for any given image. I've seen images start with; http://upload.wikimedia.org/wikipedia/en/6/6d/ and with; http://upload.wikimedia.org/wikipedia/commons/d/d0/ But I don't really understand how to decide what url to prefix to my image-name. Anyone can shed some light on this? Thanks! -- Thomas Zander

4 5

geodata and geohack
by Luigi Assom 14 Nov '13

14 Nov '13

Hi All, we've been developing a product to access the knowledge in the wikipedia in other ways... and of course we make use of the mediawiki api. We are using geodata extension to construct a nearby similar to the wiki, but I'd like the possibility to retrieve articles beyond the 10000 limit radius: this is the case when I drag the map. As example, you can zoom out your map at "nations" scale, drag it around the globe, and you'll see other articles which will be pinned on the map. I've seen something similar within google maps and other mobile app, so I read again the doc http://www.mediawiki.org/wiki/Extension:GeoData I figured out this feature should be available from geohack, which is already included, by the use of parameter "scale". However, I didn't understand how to use it; also geohack is not mentioned in the main doc http://en.wikipedia.org/w/api.php Should I pipe it with a generator? Any example for doing it ? Once again, thank you so much for your help to all devs! Luigi

2 2

How to get around server latency for parsed html revisions & parallel requests
by jeph 12 Nov '13

12 Nov '13

Hi, As part of the visualisation tool I'm building I'm fetching the parsed revisions of an article. When the article is of a considerable size , eg latest revisions of Barack Obama it takes 10+ seconds. As the tool is interactive and it shows the edits made to an article as an animation the time taken by the server does not bode well. (The requests are only read ) I'm currently not making parallel requests. What would a reasonable degree of parallel requests. Are there other ways to get around this latency issue ? https://meta.wikimedia.org/wiki/Grants:IEG/Replay_Edits talks about the tool and the project. Thanks Jeph

4 5

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

Mediawiki-api November 2013