Hello,
I am writing a Java program to extract the abstract of the wikipedia page
given the title of the wikipedia page. I have done some research and found
out that the abstract with be in rvsection=0
So for example if I want the abstract of 'Eiffel Tower" wiki page then I am
querying using the api in the following way.
http://en.wikipedia.org/w/api.php?action=query&prop=revisions&titles=Eiffel…
and parse the XML data which we get and take the wikitext in the tag <rev
xml:space="preserve"> which represents the abstract of the wikipedia page.
But this wiki text also contains the infobox data which I do not need. I
would like to know if there is anyway in which I can remove the infobox data
and get only the wikitext related to the page's abstract Or if there is any
alternative method by which I can get the abstract of the page directly.
Looking forward to your help.
Thanks in Advance
Aditya Uppu
Hi,
I am trying to compile a list of duplicate images in Wikimedia Commons. I
am iterating through the list of images using the generator=allimages API
and using the continue option to get the next set. But the api gets stuck
at 𪎥-seal.svg and it does not return the next set or the continue option.
Here is the url I am using:
https://commons.wikimedia.org/w/api.php?action=query&generator=allimages&pr…
Can anyone help me with this? If there is an alternative, that would be
great.
Thanks,
Sreejith Kulamgarath.