http://code.google.com/p/jwpl/ this looks also good JWPL (Java Wikipedia Library) is a free, Java-based application programming interface that allows to access all information contained in Wikipedia.
have not tried that , but you said you wanted to do it in java. mike
On Sun, Nov 27, 2011 at 6:20 PM, Mike Dupont jamesmikedupont@googlemail.com wrote:
look, for 100 articles, just create a list of them, and export them as xml. or use the book creator. http://en.wikipedia.org/wiki/Help:Books
also there is a json api to pull single articles. http://www.barattalo.it/2010/08/29/php-bot-to-get-wikipedia-definitions/
mike
On Sun, Nov 27, 2011 at 6:15 PM, Khalida BEN SIDI AHMED send.to.khalida@gmail.com wrote:
I have already read the responses given in this post.
I want to the extract the first paragraph (or the first sentence) for a list of <100 articles. I could not use JWPL beacause I don't have a big hard disk space to create the DB. I try to use JSoup but I need examples. _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
-- James Michael DuPont Member of Free Libre Open Source Software Kosova http://flossk.org