Really interesting contribution.
I'll give it a try and let you know :).
Thanks!
F --
--- El mié, 18/11/09, Delip Rao deliprao@gmail.com escribió:
De: Delip Rao deliprao@gmail.com Asunto: [Wiki-research-l] Java API for reading Wikipedia XML dumps Para: wiki-research-l@lists.wikimedia.org Fecha: miércoles, 18 de noviembre, 2009 18:20 Hello!We have been working on a Java API for reading Wikipedia XML dumps for sometime and it's now reasonably functional. Check out:
http://code.google.com/p/wikixmlj/Features: Easy access to important elements of a Wikipedia pageAlso provides interfaces for Wiki text parsing.Memory efficient
SAX interface for parsingLazy loading of files for DOMCallback support with DOMDirectly operate on compressed wikipedia dumps (gzip/bzip2/native xml supported)Best,
Delip
-----Adjunto en línea a continuación-----
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l