http://en.wikipedia.org/w/api.php?format=xml&action=query&titles=Th…
<http://en.wikipedia.org/w/api.php?format=xml&action=query&titles=The_Social_Network&prop=revisions&rvprop=content>
or in json
http://en.wikipedia.org/w/api.php?format=json&action=query&titles=T…
<http://en.wikipedia.org/w/api.php?format=json&action=query&titles=The_Social_Network&prop=revisions&rvprop=content>
or you can request multiple pages
Just save and open in notepad to see the content format in json.
Definitely play with the API Sandbox,
http://en.wikipedia.org/wiki/Special:ApiSandbox
You can also request more than one page at time.
http://en.wikipedia.org/w/api.php?action=query&format=xml&prop=revi…
<http://en.wikipedia.org/w/api.php?action=query&format=xml&prop=revisions&rvprop=content&titles=The_Social_Network%7CTitanic_%281997_film%29>
what the API will generate
/w/api.php?action=query&prop=revisions&format=xml&rvprop=content&titles=The_Social_Network%7CTitanic_(1997_film)
<http://en.wikipedia.org/w/api.php?action=query&prop=revisions&format=xml&rvprop=content&titles=The_Social_Network%7CTitanic_%281997_film%29>
You just need to add in the
http://en.wikipedia.org or whatever language
sub domain you need like:
http://de.wikipedia.org/w/api.php?action=query&prop=revisions&forma…
<http://de.wikipedia.org/w/api.php?action=query&prop=revisions&format=xml&rvprop=content&rvlimit=5&titles=The_Social_Network>
gives you the German translation.
Tips on the API Sandbox is:
1. Format=choose it (format=xml)
2. Action - Set to Query (action=query)
3. Action - Right below, use drop down to choose prop value
(prop=revisions).
4. Choose what you want, content (rvprop=content)
5. Loose the revisions to return (rvlimit), clear the input box. I don't
know why they set that 10 automatically, let the user decide or else
you'll get a snapshot of the last 10 versions of the page.
6. Pick the title, you can get multiple titles by separating them with
the pipe | character.
The problem you will have to address is parsing out the wiki markup if
you just want pure text content. You'll have to find a sanitizer to
strip out all the wiki markup.
Tom
------------------------------------------------------------------------
On 4/27/2012 6:38 AM, Krishna Kranth wrote:
Thanks for the link provided. I tried my best to
understand how to
extract data from Wikipedia using APIs. But I failed miserably. Please
can someone help me out to give me a simpler link or example showing
the data from Wikipedia.
Something like this :
http://api.rottentomatoes.com/api/public/v1.0/movies.json?apikey=e8qkmhd7s4…
<http://api.rottentomatoes.com/api/public/v1.0/movies.json?apikey=e8qkmhd7s4hambanqsbwy9ua&q=The+Social+Network>
Click on the above link and check how simpler it is to get data from
Rotten Tomatoes. Is there any link in that way?
Thanks in advance.
Regards
M.Krishna Kranth
Team works means more We less Me.
On Thu, Apr 26, 2012 at 6:13 PM, Timothy Helck
<timothy.helck(a)gmail.com <mailto:timothy.helck@gmail.com>> wrote:
Krishna,
Try this page:
http://en.wikipedia.org/w/api.php
I find some of the other documentation a bit confusing. The above
link seems to have everything in one place. It takes a while to go
through it. Once you gain some familiarity with the API you will
be able to formulate more specific questions, then you will find
the people on this mailing list to be very helpful.
Good luck!
Tim Helck
On Thu, Apr 26, 2012 at 8:18 AM, Krishna Kranth
<krishnakranth.m(a)gmail.com <mailto:krishnakranth.m@gmail.com>> wrote:
Dear Team,
I am planning to start a new website containing celebrities
from different areas like in sports, movies and politics. For
this I am planning to exract data from Wikipedia using API in
XML or JSON format. Can someone send me Wikipedia API and the
format of response I get? I am pretty much new to this type of
technical stuff. I tried going through the document but could
not understand much in detail. An example format will be of
great use.
Thanks in advance.
Regards
M.Krishna Kranth
Team works means more We less Me.
_______________________________________________
Mediawiki-api mailing list
Mediawiki-api(a)lists.wikimedia.org
<mailto:Mediawiki-api@lists.wikimedia.org>
https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
_______________________________________________
Mediawiki-api mailing list
Mediawiki-api(a)lists.wikimedia.org
<mailto:Mediawiki-api@lists.wikimedia.org>
https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
_______________________________________________
Mediawiki-api mailing list
Mediawiki-api(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/mediawiki-api