Hello,
I am writing a Java program to extract the abstract of the wikipedia page
given the title of the wikipedia page. I have done some research and found
out that the abstract with be in rvsection=0
So for example if I want the abstract of 'Eiffel Tower" wiki page then I am
querying using the api in the following way.
http://en.wikipedia.org/w/api.php?action=query&prop=revisions&titles=Eiffel…
and parse the XML data which we get and take the wikitext in the tag <rev
xml:space="preserve"> which represents the abstract of the wikipedia page.
But this wiki text also contains the infobox data which I do not need. I
would like to know if there is anyway in which I can remove the infobox data
and get only the wikitext related to the page's abstract Or if there is any
alternative method by which I can get the abstract of the page directly.
Looking forward to your help.
Thanks in Advance
Aditya Uppu
Hi,
I've posted this also on the API:Edit talk page
https://www.mediawiki.org/wiki/API_talk:Edit#How_does_%22recreate%22_parame…
Apparently, without this "recreate" parameter, an article can be recreated
even if it was deleted since the retrieval of the basetimestamp. Is this an
expected behavior? From the current description (*Override any errors about
the page having been deleted in the meantime*) I understand that you have
to set it to true if you want to be able to recreate the page, so no giving
this parameter should raise an error. Can you clarify the document and
behavior?
Discussion on enwiki :
https://en.wikipedia.org/w/index.php?title=User_talk:GoingBatty&diff=995218…
Thanks
Nico