I am writing a Java program to extract the abstract of the wikipedia page
given the title of the wikipedia page. I have done some research and found
out that the abstract with be in rvsection=0
So for example if I want the abstract of 'Eiffel Tower" wiki page then I am
querying using the api in the following way.
and parse the XML data which we get and take the wikitext in the tag <rev
xml:space="preserve"> which represents the abstract of the wikipedia page.
But this wiki text also contains the infobox data which I do not need. I
would like to know if there is anyway in which I can remove the infobox data
and get only the wikitext related to the page's abstract Or if there is any
alternative method by which I can get the abstract of the page directly.
Looking forward to your help.
Thanks in Advance
Per the below, protocol-relative URLs are now enabled on
test.wikipedia.org and will be rolled out to the rest of the wikis
over the course of the next few weeks. What this means is that URLs
used in the interface will now look like //example.com instead of
http://example.com , so we can support both HTTP and HTTPS without
splitting our cache.
The API, in most cases, will not output protocol-relative URLs, but
will continue to output http:// URLs no matter whether you call it
over HTTP or HTTPS. This is because we don't expect API clients to be
able to resolve these correctly, and that the context of these URLs
(which is needed to resolve them) will frequently get lost along the
way. And we don't wanna go breaking clients, now, do we? :)
The exceptions to this, as far as I am aware, are:
* HTML produced by the parser will have protocol-relative URLs in <a
href="..."> tags etc.
* prop=extlinks and list=exturlusage will output URLs verbatim as they
appear in the article, which means they may output protocol-relative
If you are getting protocol-relative URLs in some other place, that's
probably a bug (or maybe it's intentional and I forgot to list it
here), so please let me know, or e-mail this list, or file bug, if you
see that happening.
Roan Kattouw (Catrope)
---------- Forwarded message ----------
From: Ryan Lane <rlane32(a)gmail.com>
Date: Thu, Jul 14, 2011 at 8:55 PM
Subject: [Wikitech-l] Protocol-relative URLs enabled on test.wikipedia.org
To: Wikimedia developers <wikitech-l(a)lists.wikimedia.org>
Over the past couple days Roan Kattouw and I have been pushing out
changes to enable protocol-relative URL support. We've gotten to a
point where we think it is stable and working.
We've enabled this on test.wikipedia.org, and plan on running it for
two weeks before enabling it elsewhere. Please test if everything is
working properly, especially with regards to the API and bots. Report
bugs in bugzilla if any are found.
Wikitech-l mailing list
Mediawiki-api-announce mailing list
i want to login to my mediawiki site from an external java class.
I already have successfully sent the first request and received the token.
After that i have sent the second request with a generated cookie.
I received the success message as result. But im still not loged in.
I tryed to send the three cookies with username, id and token after the
success message as a third request,
but it didnt work. I think there is something wrong with my third
request, could you please describe me how i have to
send these three cookies after receiving the success message.
(Oh and sorry for my bad english)
best regards Berninger Ingo
First, thank you for viewing my post. I'm working on product to get
information from wiki and show them in office application. And when I am
working with MediaWiki APIs, I found a problem. I use C# to build my
application and I get the query result from MediaWiki by Json format. And my
problem is that Some APIs will return a json object which cannot be
deserialized to a C# object.
For example, when I try to get the image list of
request url is this<http://en.wikipedia.org/w/api.php?action=query&prop=images&imlimit=6&titles…>)
result will be:
Please check the red box in the picture above. As we know that JSON is a
key/value based format and in the red box, this key is dynamically generated
according to the page id. in this way, this json is unable to be
a C# object since I have to map a C# property to a const key/ID in code. I
just wonder why MediaWiki do this? if MediaWiki APIs wants to support to
show multipage in query result, probably it should use Array to show all
How can I fetch links which are shown in the external links section of every
article. It's possible to fetch all external links inside a given page using
But I want to fetch links in External Links section only. What query should
I use ?
"A belief may be larger than a fact."