http://code.google.com/p/jwpl/ this looks also good
JWPL (Java Wikipedia Library) is a free, Java-based application
programming interface that allows to access all information contained
in Wikipedia.
have not tried that , but you said you wanted to do it in java.
mike
On Sun, Nov 27, 2011 at 6:20 PM, Mike Dupont
<jamesmikedupont(a)googlemail.com> wrote:
look,
for 100 articles, just create a list of them, and export them as xml.
or use the book creator.
http://en.wikipedia.org/wiki/Help:Books
also there is a json api to pull single articles.
http://www.barattalo.it/2010/08/29/php-bot-to-get-wikipedia-definitions/
mike
On Sun, Nov 27, 2011 at 6:15 PM, Khalida BEN SIDI AHMED
<send.to.khalida(a)gmail.com> wrote:
I have already read the responses given in this
post.
I want to the extract the first paragraph (or the first sentence) for a
list of <100 articles.
I could not use JWPL beacause I don't have a big hard disk space to create
the DB. I try to use JSoup but I need examples.
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
--
James Michael DuPont
Member of Free Libre Open Source Software Kosova
http://flossk.org
--
James Michael DuPont
Member of Free Libre Open Source Software Kosova
http://flossk.org