Really interesting contribution.
I'll give it a try and let you know :).
Thanks!
F --
--- El mié, 18/11/09, Delip Rao <deliprao(a)gmail.com> escribió:
De: Delip Rao <deliprao(a)gmail.com>
Asunto: [Wiki-research-l] Java API for reading Wikipedia XML dumps
Para: wiki-research-l(a)lists.wikimedia.org
Fecha: miércoles, 18 de noviembre, 2009 18:20
Hello!We have been working on a Java API
for reading Wikipedia XML dumps for sometime and it's
now reasonably functional. Check out:
http://code.google.com/p/wikixmlj/Features:
Easy
access to important elements of a Wikipedia
pageAlso provides interfaces for Wiki text
parsing.Memory efficient
SAX interface for parsingLazy loading of files
for DOMCallback support with
DOMDirectly operate on compressed wikipedia
dumps (gzip/bzip2/native xml
supported)Best,
Delip
-----Adjunto en línea a continuación-----
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l