On 31/03/14 22:46, Lydia Pintscher wrote:
On Mon, Mar 31, 2014 at 3:47 PM, Markus Krötzsch markus@semantic-mediawiki.org wrote:
Dear all,
I am happy to announce the very first release of Wikidata Toolkit [1], the Java library for programming with Wikidata and Wikibase. This initial release can download and parse Wikidata dump files for you, so as to process all Wikidata content in a streaming fashion. An example program is provided [2]. The libary can also be used with MediaWiki dumps generated by other Wikibase installations (if you happen to work in EAGLE ;-).
Maven users can get the library directly from Maven Central (see [1]); this is the preferred method of installation. There is also an all-in-one JAR at github [3] and of course the sources [4].
Version 0.1.0 is of course alpha, but the code that we have is already well-tested and well-documented. Improvements that are planned for the next release include:
- Faster and more robust loading of Wikibase dumps
- Support for various serialization formats, such as JSON and RDF
- Initial support for Wikibase API access
Nevertheless, you can already give it a try now. In later releases, it is also planned to support more advanced processing after loading, especially for storing and querying the data.
Feedback is welcome. Developers are also invited to contribute via github.
Congrats, Markus! Great to see a first release. Want to post something on Wikidata:Project chat too?
Yes, good point. Posted the message there too now.
Cheers
Markus